Navigating the Conceptual Multiverse

April 20, 20262604.17815

Andre Ye, Jenny Y. Huang, Alicia Guo, Rose Novick, Tamara Broderick + 1 more

cs.HCcs.CLcs.CY

TLDR

The Conceptual Multiverse is an interactive system that reveals and allows users to navigate the hidden conceptual decisions made by language models.

Key contributions

Introduces the "conceptual multiverse" to make hidden LM decisions transparent and navigable.
Represents conceptual choices (framing, values) as an interactive, inspectable decision space.
Develops a rigorous verification framework aligned with expert domain reasoning norms.
Demonstrated utility across philosophy, alignment, and poetry for deeper problem understanding.

Why it matters

This paper addresses a critical limitation of language models: their opaque decision-making process. By providing a transparent and interactive system, it empowers users to understand, critique, and refine LM outputs. This enhances trust, utility, and the ability to develop a deeper understanding of complex problems.

Original Abstract

When language models answer open-ended problems, they implicitly make hidden decisions that shape their outputs, leaving users with uncontextualized answers rather than a working map of the problem; drawing on multiverse analysis from statistics, we build and evaluate the conceptual multiverse, an interactive system that represents conceptual decisions such as how to frame a question or what to value as a space users can transparently inspect, intervenably change, and check against principled domain reasoning; for this structure to be worth navigating rather than misleading, it must be rigorous and checkable against domain reasoning norms, so we develop a general verification framework that enforces properties of good decision structures like unambiguity and completeness calibrated by expert-level reasoning; across three domains, the conceptual multiverse helped participants develop a working map of the problem, with philosophy students rewriting essays with sharper framings and reversed theses, alignment annotators moving from surface preferences to reasoning about user intent and harm, and poets identifying compositional patterns that clarified their taste.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers