Xian Sun
2 papers ยท Latest:
Computer Vision
Masked Generative Transformer Is What You Need for Image Editing
EditMGT, a novel Masked Generative Transformer, offers faster, more precise image editing by localizing changes, outperforming diffusion models.
2605.10859
Computer VisionIs Your Driving World Model an All-Around Player?
WorldLens is a new benchmark, dataset, and agent for evaluating driving world models beyond visual realism, focusing on physical and behavioral fidelity.
2605.10858
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.