Hao Li

5 papers · Latest: May 12, 2026

Fill the GAP: A Granular Alignment Paradigm for Visual Reasoning in Multimodal Large Language Models

GAP proposes a granular alignment paradigm to stabilize visual latent reasoning in MLLMs by addressing feature-space mismatches, improving performance.

2605.12374May 12, 2026

Galaxies & Cosmology

A Universal Dance of Galactic Disks: Ubiquitous Precession and Its Implications

Galactic disk precession is ubiquitous, driven by tidal torques, and significantly impacts galaxy evolution, including warps and satellite alignment.

2605.00349May 1, 2026

Computer Vision

PhysInOne: Visual Physics Learning and Reasoning in One Suite

PhysInOne is a new large-scale dataset with 2 million videos and detailed annotations for training AI in physics-grounded visual reasoning.

2604.09415Apr 10, 2026

Software Engineering

Do AI Coding Agents Log Like Humans? An Empirical Study

AI coding agents log differently than humans, often less, and struggle to follow explicit logging instructions, requiring human intervention.

2604.09409Apr 10, 2026

Robotics

ViVa: A Video-Generative Value Model for Robot Reinforcement Learning

ViVa is a video-generative value model that improves robot reinforcement learning by using a pretrained video generator to estimate future dynamics and task value.

2604.08168Apr 9, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.