Hao Zhao
5 papers ยท Latest:
Relit-LiVE: Relight Video by Jointly Learning Environment Video
Relit-LiVE relights videos consistently and stably without camera pose, by using raw images and jointly predicting environment videos.
Unified Map Prior Encoder for Mapping and Planning
UMPE is a Unified Map Prior Encoder that effectively fuses diverse map priors with BEV features for improved autonomous driving mapping and planning.
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
UniVidX is a unified multimodal framework that leverages video diffusion priors for versatile video generation across diverse tasks with strong performance.
ASAP: An Azimuth-Priority Strip-Based Search Approach to Planar Microphone Array DOA Estimation in 3D
ASAP is a novel azimuth-priority strip-based search approach for fast and accurate 3D DOA estimation using planar microphone arrays.
LottieGPT: Tokenizing Vector Animation for Autoregressive Generation
LottieGPT is the first framework that tokenizes and autoregressively generates editable vector animations from prompts using a new tokenizer and large dataset.
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.