Juan Zhai

2 papers · Latest: May 5, 2026

POSTCONDBENCH: Benchmarking Correctness and Completeness in Formal Postcondition Inference

This paper introduces POSTCONDBENCH, a new benchmark for evaluating LLM-generated program postconditions for both correctness and completeness.

2605.03356May 5, 2026

Cryptography & Security

Train in Vain: Functionality-Preserving Poisoning to Prevent Unauthorized Use of Code Datasets

FunPoison introduces a functionality-preserving poisoning method to prevent unauthorized use of code datasets for training CodeLLMs, maintaining compilability.

2604.22291Apr 24, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.