Efficient Fuzzy Private Set Intersection from Secret-shared OPRF
Xinpeng Yang, Meng Hao, Chenkai Weng, Robert H. Deng, Yonggang Wen + 1 more
TLDR
This paper introduces highly efficient Fuzzy Private Set Intersection protocols for Lp distances, using symmetric-key operations and secret-shared OPRF.
Key contributions
- Proposes efficient FPSI protocols for Lp distance metrics using symmetric-key operations, avoiding expensive HE.
- Achieves linear communication and computation complexity in set sizes, dimension, and distance threshold.
- Introduces a novel oblivious programmable PRF with secret-shared outputs as a core building block.
- Incorporates a prefix technique to reduce distance threshold dependence to logarithmic, suitable for large δ.
Why it matters
This work significantly advances Fuzzy Private Set Intersection, a critical privacy-preserving technique. By leveraging symmetric-key operations and novel cryptographic primitives, it overcomes the practical limitations of prior approaches, making FPSI viable for real-world applications requiring efficient and secure data comparison.
Original Abstract
Private set intersection (PSI) enables a sender holding a set $Q$ of size $m$ and a receiver holding a set $W$ of size $n$ to securely compute the intersection $Q \cap W$. Fuzzy PSI (FPSI) is a PSI variant where the receiver learns the items $q \in Q$ for which there exists some $w \in W$ satisfying $\mathsf{dist}(q, w) \le δ$ under a given distance metric. Although several FPSI works are proposed for $L_{p}$ distance metrics with $p \in [1, \infty]$, they either heavily rely on expensive homomorphic encryptions, or incur undesirable complexity, e.g., exponential to the element dimension, both of which lead to poor practical efficiency. In this work, we propose efficient FPSI protocols for $L_{p \in [1, \infty]}$ distance metrics, primarily leveraging significantly cheaper symmetric-key operations. Our protocols achieve linear communication and computation complexity in the set sizes $m,n$, the dimension $d$, and the distance threshold $δ$. Our core building block is an oblivious programmable PRF with secret-shared outputs, which may be of independent interest. Furthermore, we incorporate a prefix technique that reduces the dependence on the distance threshold $δ$ to logarithmic, which is particularly suitable for large $δ$. We implement our FPSI protocols and compare them with state-of-the-art constructions. Experimental results demonstrate that our protocols consistently and significantly outperform existing works across all settings. Specifically, our protocols achieve a speedup of $12{\sim}145\times$ in running time and a reduction of $3{\sim}8\times$ in communication cost compared to Gao et al.~(ASIACRYPT'24) and a speedup of $9{\sim}80\times$ in running time and a reduction of $5{\sim}19\times$ in communication cost compared to Dang et al.~(CCS'25).
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.