Feng Luo

2 papers · Latest: April 9, 2026

Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models

This paper identifies and solves length inflation in on-policy distillation (OPD) for LLMs, improving training stability and performance.

2604.08527Apr 9, 2026

Software Engineering

Dependency-Guided Repository-Level C-to-Rust Translation with Reinforcement Alignment

DepTrans is a new framework that automates C-to-Rust code migration using reinforcement learning and dependency-guided refinement, achieving high accuracy.

2604.02852Apr 3, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.