Yanjie Fu

2 papers · Latest: April 16, 2026

Metric-agnostic Learning-to-Rank via Boosting and Rank Approximation

This paper introduces a novel metric-agnostic Learning-to-Rank framework that uses a differentiable loss and gradient boosting for improved, generalizable ranking.

2604.15101Apr 16, 2026

Artificial Intelligence

StaRPO: Stability-Augmented Reinforcement Policy Optimization

StaRPO is a new RL framework that improves LLM reasoning by incorporating stability metrics (ACF, PE) to enhance logical consistency and accuracy.

2604.08905Apr 10, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.