ReVis: Towards Reusable Image-Based Visualizations with MLLMs
Xiaolin Wen, Changlin Li, Manusha Karunathilaka, Can Liu, Fangzhuo Jin + 1 more
TLDR
ReVis is a human-AI system that uses MLLMs and a DSL to parse and reproduce image-based visualizations, enabling flexible reuse and customization.
Key contributions
- Enables flexible reuse of image-based visualizations via a human-AI collaboration approach.
- Introduces a generic Domain-Specific Language (DSL) to model, decompose, and reproduce complex visualizations.
- Uses an MLLM-based pipeline to parse image-based visualizations into the DSL, capturing structures and encodings.
- Offers an interactive interface for users to upload images, inspect results, update data, and customize visuals.
Why it matters
This paper addresses the challenge of reusing image-based visualizations, which are currently difficult to adapt. ReVis provides a novel human-AI system that simplifies this process, making complex designs accessible for customization and new data. This significantly reduces the expertise and time needed for visualization practitioners.
Original Abstract
Many expressive visualizations are shared online only as bitmap images, making them difficult to redesign or adapt to new data. Reusing such image-based visualizations requires substantial expertise and is often time-consuming, even for experienced visualization practitioners. Existing work on reproducing visualizations often relies on structured SVG or specifications, supports limited visualization types, and offers limited flexibility for customization. To address these challenges, we present ReVis, a human-AI collaboration approach that enables flexible reuse of image-based visualizations. First, a generic Domain-Specific language (DSL) is proposed to model complex visualizations and support both visualization decomposition and reproduction. Then, ReVis employs an MLLM-based pipeline to parse an image-based visualization into the DSL, delineating its core visual structures and data-to-encoding mappings, and further reproduces the visualization from the DSL. Finally, ReVis includes an interactive interface to allow users to upload visualization images, inspect reproduced results, update the underlying data, and customize visual encodings. A gallery of 40 visualizations demonstrates the expressiveness of the DSL, and a quantitative study evaluates the reproduction quality of ReVis on these examples. Two usage scenarios and user interviews with 16 visualization practitioners demonstrate the effectiveness of ReVis.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.