Zhou Zhao
2 papers ยท Latest:
Computer Vision
Diffusion Model as a Generalist Segmentation Learner
DiGSeg repurposes diffusion models for versatile, text-conditioned segmentation across diverse domains without custom architectures.
2604.24575
Software EngineeringFigma2Code: Automating Multimodal Design to Code in the Wild
Figma2Code automates design-to-code by leveraging rich multimodal Figma data, creating a new task and dataset to benchmark MLLMs.
2604.13648
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.