Eray Tüzün

2 papers · Latest: April 27, 2026

Evaluation of LLM-Based Software Engineering Tools: Practices, Challenges, and Future Directions

This paper examines the unique challenges and current practices in evaluating LLM-based software engineering tools, proposing future directions for robust assessment.

2604.24621Apr 27, 2026

Software Engineering

Understanding the Limits of Automated Evaluation for Code Review Bots in Practice

Automated evaluation of LLM-powered code review bots in industrial settings shows moderate alignment with human labels, limited by contextual factors.

2604.24525Apr 27, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.