Yang Yu

4 papers · Latest: May 7, 2026

MedHorizon: Towards Long-context Medical Video Understanding in the Wild

MedHorizon introduces a new benchmark for long-context medical video understanding, revealing current MLLMs struggle with sparse evidence retrieval and clinical reasoning.

2605.06537May 7, 2026

Validity and Limits of Low Order Hybridization Expansion Approaches for Multi-Orbital Systems

Low-order hybridization expansion methods' accuracy in multi-orbital systems is limited by the least correlated orbital, which suppresses features.

2605.02228May 4, 2026

Software Engineering

Mono2Sls: Automated Monolith-to-Serverless Migration via Multi-Stage Pipeline with Static Analysis

Mono2Sls automates monolith-to-serverless migration using a static analysis-guided LLM agent pipeline, achieving high deployment success and correctness.

2604.24550Apr 27, 2026

General Economics

On Benchmark Hacking in ML Contests: Modeling, Insights and Design

This paper models benchmark hacking in ML contests, revealing strategic effort allocation and reward impacts on true generalization.

2604.22230Apr 24, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.