Zhenyu Chen
3 papers ยท Latest:
Software Engineering
Breaking, Stale, or Missing? Benchmarking Coding Agents on Project-Level Test Evolution
TEBench is the first project-level benchmark for evaluating coding agents on test evolution, revealing limitations in handling stale and missing tests.
2605.06125
Cryptography & SecurityTrain in Vain: Functionality-Preserving Poisoning to Prevent Unauthorized Use of Code Datasets
FunPoison introduces a functionality-preserving poisoning method to prevent unauthorized use of code datasets for training CodeLLMs, maintaining compilability.
2604.22291
Software EngineeringLog-based, Business-aware REST API Testing
LoBREST is a log-based, business-aware REST API testing technique that uses historical request logs to test complex functionalities.
2604.08007
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.