Lingdong Kong

5 papers · Latest: May 13, 2026

OmniLiDAR: A Unified Diffusion Framework for Multi-Domain 3D LiDAR Generation

OmniLiDAR is a unified diffusion framework that generates 3D LiDAR scans across eight diverse domains using text conditioning, addressing single-domain limitations.

2605.13815May 13, 2026

Computer Vision

Masked Generative Transformer Is What You Need for Image Editing

EditMGT, a novel Masked Generative Transformer, offers faster, more precise image editing by localizing changes, outperforming diffusion models.

2605.10859May 11, 2026

Computer Vision

Is Your Driving World Model an All-Around Player?

WorldLens is a new benchmark, dataset, and agent for evaluating driving world models beyond visual realism, focusing on physical and behavioral fidelity.

2605.10858May 11, 2026

Artificial Intelligence

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

This paper introduces a "levels x laws" taxonomy for agentic world models, synthesizing over 400 works and outlining a roadmap for future development.

2604.22748Apr 24, 2026

Computer Vision

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

OneVL introduces a unified VLA and World Model framework, achieving state-of-the-art latent Chain-of-Thought reasoning at real-time speed.

2604.18486Apr 20, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.