UIBenchKit: A unified toolkit for design-to-code model evaluation

May 13, 20262605.13141

Chinh T. Le, Trevor Ong Yee Siang, Jingyu Xiao, Yuxuan Wan, Yintong Huo

cs.SE

TLDR

UIBenchKit is an open-source toolkit that unifies the evaluation of design-to-code models, simplifying comparisons and accelerating research.

Key contributions

Unifies evaluation for design-to-code models, addressing lack of standardization.
Abstracts environment setup, model inference, and code rendering complexities.
Offers a plug-and-play architecture for consistent method comparison.
Provides an analytical interface for multi-metric benchmarking.

Why it matters

The absence of a standardized evaluation platform has limited progress in design-to-code generation. UIBenchKit bridges this gap by offering a unified, consistent environment, enabling fair comparisons and accelerating future research and innovation in web engineering.

Original Abstract

Recent years have seen substantial progress in automated design-to-code generation, with many methods proposed for generating HTML and CSS from webpage screenshots. However, the absence of a standardized evaluation platform makes it difficult to compare these methods fairly, limiting both practical adoption and systematic research progress. To bridge this gap, we introduce UIBenchKit, an open-source, integrated toolkit designed to unify the evaluation of design-to-code tasks. UIBenchKit abstracts the complexities of environment setup, model inference, and code rendering, offering researchers a plug-and-play architecture to compare various methods under consistent settings. In addition, it offers an analytical interface for comparison across multiple metrics. Using UIBenchKit, we conduct a benchmarking study of existing tools and derive several findings that highlight directions for future improvement. By providing a streamlined environment for both experimentation and evaluation, UIBenchKit aims to accelerate future benchmarking and innovations in web engineering. The evaluation platform and toolkit are available at the project page https://www.uibenchkit.com/.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers