Yang Yu
4 papers ยท Latest:
Computer Vision
MedHorizon: Towards Long-context Medical Video Understanding in the Wild
MedHorizon introduces a new benchmark for long-context medical video understanding, revealing current MLLMs struggle with sparse evidence retrieval and clinical reasoning.
2605.06537
Validity and Limits of Low Order Hybridization Expansion Approaches for Multi-Orbital Systems
Low-order hybridization expansion methods' accuracy in multi-orbital systems is limited by the least correlated orbital, which suppresses features.
2605.02228
Software EngineeringMono2Sls: Automated Monolith-to-Serverless Migration via Multi-Stage Pipeline with Static Analysis
Mono2Sls automates monolith-to-serverless migration using a static analysis-guided LLM agent pipeline, achieving high deployment success and correctness.
2604.24550
General EconomicsOn Benchmark Hacking in ML Contests: Modeling, Insights and Design
This paper models benchmark hacking in ML contests, revealing strategic effort allocation and reward impacts on true generalization.
2604.22230
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.