Feng Luo
2 papers ยท Latest:
Natural Language Processing
Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models
This paper identifies and solves length inflation in on-policy distillation (OPD) for LLMs, improving training stability and performance.
2604.08527
Software EngineeringDependency-Guided Repository-Level C-to-Rust Translation with Reinforcement Alignment
DepTrans is a new framework that automates C-to-Rust code migration using reinforcement learning and dependency-guided refinement, achieving high accuracy.
2604.02852
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.