Juan Zhai
2 papers ยท Latest:
Software Engineering
POSTCONDBENCH: Benchmarking Correctness and Completeness in Formal Postcondition Inference
This paper introduces POSTCONDBENCH, a new benchmark for evaluating LLM-generated program postconditions for both correctness and completeness.
2605.03356
Cryptography & SecurityTrain in Vain: Functionality-Preserving Poisoning to Prevent Unauthorized Use of Code Datasets
FunPoison introduces a functionality-preserving poisoning method to prevent unauthorized use of code datasets for training CodeLLMs, maintaining compilability.
2604.22291
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.