Zhou Zhao

2 papers · Latest: April 27, 2026

Diffusion Model as a Generalist Segmentation Learner

DiGSeg repurposes diffusion models for versatile, text-conditioned segmentation across diverse domains without custom architectures.

Figma2Code automates design-to-code by leveraging rich multimodal Figma data, creating a new task and dataset to benchmark MLLMs.

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.