ArXiv TLDR

MetaCloak-JPEG: JPEG-Robust Adversarial Perturbation for Preventing Unauthorized DreamBooth-Based Deepfake Generation

🐦 Tweet
2604.18537

Tanjim Rahaman Fardin, S M Zunaid Alam, Mahadi Hasan Fahim, Md Faysal Mahfuz

cs.CV

TLDR

MetaCloak-JPEG introduces a JPEG-robust adversarial perturbation to prevent unauthorized DreamBooth deepfake generation by integrating a differentiable JPEG layer.

Key contributions

  • Introduces MetaCloak-JPEG, a novel adversarial perturbation robust to JPEG compression for deepfake prevention.
  • Incorporates a Differentiable JPEG (DiffJPEG) layer using Straight-Through Estimator to handle JPEG's non-differentiable `round()`.
  • Employs a JPEG-aware EOT distribution and a curriculum quality-factor schedule within a bilevel meta-learning loop.
  • Achieves 91.3% JPEG survival rate and outperforms PhotoGuard across all JPEG quality factors.

Why it matters

This paper addresses a critical vulnerability in deepfake protection: the failure of existing methods to account for social media's JPEG compression. By introducing MetaCloak-JPEG, it offers a robust defense against unauthorized DreamBooth-based deepfake generation, significantly improving privacy and security for online images.

Original Abstract

The rapid progress of subject-driven text-to-image synthesis, and in particular DreamBooth, has enabled a consent-free deepfake pipeline: an adversary needs only 4-8 publicly available face images to fine-tune a personalized diffusion model and produce photorealistic harmful content. Current adversarial face-protection systems -- PhotoGuard, Anti-DreamBooth, and MetaCloak -- perturb user images to disrupt surrogate fine-tuning, but all share a structural blindness: none backpropagates gradients through the JPEG compression pipeline that every major social-media platform applies before adversary access. Because JPEG quantization relies on round(), whose derivative is zero almost everywhere, adversarial energy concentrates in high-frequency DCT bands that JPEG discards, eliminating 60-80% of the protective signal. We introduce MetaCloak-JPEG, which closes this gap by inserting a Differentiable JPEG (DiffJPEG) layer built on the Straight-Through Estimator (STE): the forward pass applies standard JPEG compression, while the backward pass replaces round() with the identity. DiffJPEG is embedded in a JPEG-aware EOT distribution (~70% of augmentations include DiffJPEG) and a curriculum quality-factor schedule (QF: 95 to 50) inside a bilevel meta-learning loop. Under an l-inf perturbation budget of eps=8/255, MetaCloak-JPEG attains 32.7 dB PSNR, a 91.3% JPEG survival rate, and outperforms PhotoGuard on all 9 evaluated JPEG quality factors (9/9 wins, mean denoising-loss gain +0.125) within a 4.1 GB training-memory budget.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.