DECKER: Domain-invariant Embedding for Cross-Keyboard Extraction and Recognition
Bikrant Bikram Pratap Maurya, Nitin Choudhury, Daksh Agarwal, Arun Balaji Buduru
TLDR
DECKER is a new framework that uses a large dataset (HEAR) to improve acoustic side-channel attacks, making keystroke inference robust across diverse keyboards.
Key contributions
- Introduces HEAR, a large dataset for acoustic side-channel attacks (ASCA) across 37 keyboards and 53 users.
- Proposes DECKER, a domain-invariant framework for robust keystroke inference across diverse keyboards.
- DECKER employs keyboard signature normalization, domain-adversarial disentanglement, and contrastive alignment.
- Integrates an LLM-based post-processing layer to refine keystroke sequences using linguistic context.
Why it matters
This paper addresses the critical security risk of acoustic side-channel attacks by demonstrating their effectiveness across diverse keyboards, users, and noisy environments. DECKER and the HEAR dataset provide a robust benchmark, highlighting the practical threat and the need for improved countermeasures against keystroke inference.
Original Abstract
Acoustic side-channel attacks (ASCA) on keyboards pose a significant security risk, as keystrokes can be inferred from typing acoustics, revealing sensitive information. Prior ASCA studies are limited by small-scale datasets with restricted diversity in users, keyboards, and environments, constraining analysis across devices, microphones, and noise conditions. We introduce HEAR, a dataset designed to study ASCA along three axes: keyboard generalization, noise adaptation, and user bias. HEAR contains recordings from 53 participants using 37 laptop keyboards, collected in three realistic settings: (1) external microphone capture, (2) device microphone capture without network noise, and (3) VoIP-based streaming capture. This enables controlled evaluation across users, keyboards, and environments. On HEAR, we establish an ASCA benchmark spanning conventional features and pre-trained representations from raw audio and spectrograms in unimodal and multimodal settings. We propose DECKER, a domain-invariant keystroke inference framework with four stages: (1) Keyboard Signature Normalization to reduce device coloration, (2) domain-adversarial disentanglement to suppress keyboard identity, (3) supervised cross-keyboard contrastive alignment to enforce key consistency, and (4) Acoustic Style Randomization to synthesize unseen keyboard responses. We further explore sentence-level inference using an LLM-based post-processing layer to refine keystroke sequences via linguistic context. Results on HEAR show DECKER improves keystroke identification over strong baselines, particularly in cross-keyboard and cross-user settings, with further gains from language-model rectification. These findings highlight that ASCA remains effective across diverse users, devices, and noisy environments, underscoring its practical security risk.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.