Eray Tüzün
2 papers · Latest:
Software Engineering
Evaluation of LLM-Based Software Engineering Tools: Practices, Challenges, and Future Directions
This paper examines the unique challenges and current practices in evaluating LLM-based software engineering tools, proposing future directions for robust assessment.
2604.24621
Software EngineeringUnderstanding the Limits of Automated Evaluation for Code Review Bots in Practice
Automated evaluation of LLM-powered code review bots in industrial settings shows moderate alignment with human labels, limited by contextual factors.
2604.24525
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.