Yanjie Fu
2 papers ยท Latest:
Information Retrieval
Metric-agnostic Learning-to-Rank via Boosting and Rank Approximation
This paper introduces a novel metric-agnostic Learning-to-Rank framework that uses a differentiable loss and gradient boosting for improved, generalizable ranking.
2604.15101
Artificial IntelligenceStaRPO: Stability-Augmented Reinforcement Policy Optimization
StaRPO is a new RL framework that improves LLM reasoning by incorporating stability metrics (ACF, PE) to enhance logical consistency and accuracy.
2604.08905
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.