ArXiv TLDR

BEACON: A Multimodal Dataset for Learning Behavioral Fingerprints from Gameplay Data

🐦 Tweet
2605.10867

Ishpuneet Singh, Gursmeep Kaur, Uday Pratap Singh Atwal, Guramrit Singh, Gurjot Singh + 1 more

cs.CRcs.AIcs.CVcs.LGcs.NI

TLDR

BEACON is a large, multimodal dataset from competitive Valorant gameplay for continuous authentication and behavioral fingerprinting research.

Key contributions

  • Introduces BEACON, a 430GB multimodal dataset from 102+ hours of competitive Valorant gameplay.
  • Captures diverse signals: mouse, keystrokes, network, screen, hardware, and in-game context.
  • Leverages high cognitive & motor demands of esports for robust behavioral biometrics research.
  • Supports research in continuous authentication, behavioral profiling, user drift, and multimodal learning.

Why it matters

This paper addresses the need for robust datasets in continuous authentication. BEACON's large scale and multimodal nature, especially from high-stakes esports, provide a unique benchmark. It will drive advancements in behavioral fingerprinting and security models.

Original Abstract

Continuous authentication in high-stakes digital environments requires datasets with fine-grained behavioral signals under realistic cognitive and motor demands. But current benchmarks are often limited by small scale, unimodal sensing or lack of synchronised environmental context. To address this gap, this paper introduces BEACON ( Behavioral Engine for Authentication \& Continuous Monitoring), a large-scale multimodal dataset that captures diverse skill tiers in competitive \textit{Valorant} gameplay. BEACON contains approximately 430 GB of synchronised modality data (461 GB total on-disk including auxiliary \textit{Valorant} configuration captures) from 79 sessions across 28 distinct players, estimated at 102.51 hours of active gameplay, including high-frequency mouse dynamics, keystroke events, network packet captures, screen recordings, hardware metadata, and in-game configuration context. BEACON leverages the high precision motor skills and high cognitive load that are inherent to tactical shooters, making it a rigorous stress test for the robustness of behavioral biometrics. The dataset allows for the study of continuous authentication, behavioral profiling, user drift and multimodal representation learning in a high-fidelity esports setting. The authors release the dataset and code on Hugging Face and GitHub to create a reproducible benchmark for evaluating next-generation behavioral fingerprinting and security models

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.