The Deepfakes We Missed: We Built Detectors for a Threat That Didn't Arrive

May 12, 20262605.12075

cs.CRcs.AI

TLDR

Deepfake detection research is misaligned, focusing on public figure manipulation while real threats are NCII, voice scams, and emotional fraud.

Key contributions

Deepfake detection research is misaligned, focusing on outdated public figure manipulation threats.
Real-world deepfake harms are dominated by NCII, voice-clone scams, and emotional manipulation fraud.
The predicted large-scale public figure deepfake catastrophe did not materialize by 2024.
Proposes rebalancing ML research to address actual, growing deepfake harm categories.

Why it matters

This paper highlights a critical misalignment in deepfake defense, redirecting research to actual, evolving threats like NCII and scams. Addressing this gap is vital for effective deepfake mitigation and protecting against real-world harms.

Original Abstract

Nearly a decade of Machine Learning (ML) research on deepfake detection has been organized around a threat model inherited from 2017--2019, revolving around face-swap and talking-head manipulation of public figures, motivated by concerns about large-scale misinformation and video-evidence fraud. This position paper argues that the threat the field prepared for did not arrive, and the threats that did arrive are substantially different. An accounting of deepfake incidents in 2022--2026 shows that the dominant observed harms are peer-generated Non-Consensual Intimate Imagery (NCII), voice-clone scam calls targeting families and finance workers, and emotional-manipulation fraud. The predicted large-scale public-figure deepfake catastrophe did not materialize during the 2024 global information environment despite extensive preparation. Meanwhile, research effort, benchmarks, and detection methods remain concentrated on the inherited threat model. The central claim of this paper is that this misalignment is now the dominant bottleneck on real-world deepfake defense, not model capability. We argue the ML research community should substantially rebalance its research agenda toward the harm categories that are actually growing. We support this position with empirical accounting of research effort and harm distribution, identify the structural reasons the misalignment persists, and outline three concrete technical research agendas for the under-defended harm categories.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers