Isaac David
2 papers ยท Latest:
Software Engineering
CrackMeBench: Binary Reverse Engineering for Agents
CrackMeBench is a new benchmark for evaluating language models on binary reverse engineering tasks, focusing on recovering validation logic from executables.
2605.10597
Cryptography & SecurityPatch2Vuln: Agentic Reconstruction of Vulnerabilities from Linux Distribution Binary Patches
Patch2Vuln uses a language model agent to reconstruct vulnerabilities from Linux binary patches, evaluated on Ubuntu packages.
2605.06601
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.