Geometry-Aware Localized Watermarking for Copyright Protection in Embedding-as-a-Service
Zhimin Chen, Xiaojie Liang, Wenbo Xu, Yuxuan Liu, Wei Lu
TLDR
GeoMark is a new geometry-aware watermarking framework for Embedding-as-a-Service, offering robust copyright protection against various attacks.
Key contributions
- Introduces GeoMark, a geometry-aware localized watermarking framework for EaaS copyright protection.
- Uses natural in-manifold embeddings as shared watermark targets and geometry-separated anchors.
- Activates watermark injection only within adaptive local neighborhoods, decoupling triggering from attribution.
- Achieves robust copyright verification against paraphrasing, dimensional perturbation, and CSE attacks.
Why it matters
Embedding-as-a-Service is crucial but vulnerable to copyright infringement. Existing watermarking methods struggle with robustness and false positives. GeoMark provides a novel solution by localizing watermarking and centralizing attribution, ensuring robust protection while maintaining utility.
Original Abstract
Embedding-as-a-Service (EaaS) has become an important semantic infrastructure for natural language and multimedia applications, but it is highly vulnerable to model stealing and copyright infringement. Existing EaaS watermarking methods face a fundamental robustness--utility--verifiability tension: trigger-based methods are fragile to paraphrasing, transformation-based methods are sensitive to dimensional perturbation, and region-based methods may incur false positives due to coincidental geometric affinity. To address this problem, we propose GeoMark, a geometry-aware localized watermarking framework for EaaS copyright protection. GeoMark uses a natural in-manifold embedding as a shared watermark target, constructs geometry-separated anchors with explicit target--anchor margins, and activates watermark injection only within adaptive local neighborhoods. This design decouples where watermarking is triggered from what ownership is attributed to, achieving localized triggering and centralized attribution. Experiments on four benchmark datasets show that GeoMark preserves downstream utility and geometric fidelity while maintaining robust copyright verification under paraphrasing, dimensional perturbation, and CSE (Clustering, Selection, Elimination) attacks, with improved verification stability and low false-positive risk.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.