ArXiv TLDR

Cuiyun Gao

6 papers ยท Latest:

Software Engineering

Schedule-and-Calibrate: Utility-Guided Multi-Task Reinforcement Learning for Code LLMs

ASTOR is a multi-task RL framework for code LLMs that uses utility-driven data scheduling and policy optimization, outperforming specialists.

2605.06111
Software Engineering

When Model Editing Meets Service Evolution: A Knowledge-Update Perspective for Service Recommendation

EVOREC is a framework for service recommendation that uses model editing and constrained decoding to adapt to evolving services and overcome outdated facts.

2604.26686
Software Engineering

Cascaded Code Editing: Large-Small Model Collaboration for Effective and Efficient Code Editing

This paper proposes Cascaded Code Editing, combining large models for edit sketch generation and small models for efficient application.

2604.19201
Software Engineering

On the Effectiveness of Context Compression for Repository-Level Tasks: An Empirical Investigation

This paper empirically investigates context compression for repository-level code tasks, finding it effective for performance and efficiency.

2604.13725
Software Engineering

Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios

This paper introduces CLI-Tool-Bench, a new benchmark for evaluating LLM-based 0-to-1 software generation, revealing current models struggle with end-to-end CLI tool creation.

2604.06742
Software Engineering

Dependency-Guided Repository-Level C-to-Rust Translation with Reinforcement Alignment

DepTrans is a new framework that automates C-to-Rust code migration using reinforcement learning and dependency-guided refinement, achieving high accuracy.

2604.02852

๐Ÿ“ฌ Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week โ€” summarized, scored, and delivered to your inbox every Monday.