Yiran Zhang

3 papers · Latest: April 24, 2026

RealBench: A Repo-Level Code Generation Benchmark Aligned with Real-World Software Development Practices

RealBench is a new benchmark for repo-level code generation, using structured designs (UML) to better align LLM evaluation with real-world software development.

2604.22659Apr 24, 2026

Software Engineering

Bridging the Gap between User Intent and LLM: A Requirement Alignment Approach for Code Generation

REA-Coder improves LLM code generation by iteratively aligning user requirements, addressing the common issue of LLMs misunderstanding prompts.

2604.16198Apr 17, 2026

Cryptography & Security

RLSpoofer: A Lightweight Evaluator for LLM Watermark Spoofing Resilience

RLSpoofer is a lightweight, black-box RL-based attack that exposes the fragility of LLM watermarking with minimal data, achieving high spoof success.

2604.11546Apr 13, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.