Hao Zhao

5 papers · Latest: May 7, 2026

Relit-LiVE: Relight Video by Jointly Learning Environment Video

Relit-LiVE relights videos consistently and stably without camera pose, by using raw images and jointly predicting environment videos.

2605.06658May 7, 2026

Computer Vision

Unified Map Prior Encoder for Mapping and Planning

UMPE is a Unified Map Prior Encoder that effectively fuses diverse map priors with BEV features for improved autonomous driving mapping and planning.

2605.02762May 4, 2026

Computer Vision

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

UniVidX is a unified multimodal framework that leverages video diffusion priors for versatile video generation across diverse tasks with strong performance.

2605.00658May 1, 2026

ASAP: An Azimuth-Priority Strip-Based Search Approach to Planar Microphone Array DOA Estimation in 3D

ASAP is a novel azimuth-priority strip-based search approach for fast and accurate 3D DOA estimation using planar microphone arrays.

2604.25387Apr 28, 2026

Computer Vision

LottieGPT: Tokenizing Vector Animation for Autoregressive Generation

LottieGPT is the first framework that tokenizes and autoregressively generates editable vector animations from prompts using a new tokenizer and large dataset.

2604.11792Apr 13, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.