Functional Misalignment in Human-AI Interactions on Digital Platforms

April 13, 20262604.11459

cs.CYcs.HC

TLDR

This paper introduces 'functional misalignment' to explain how AI optimizing for predictable behavior, not human goals, causes negative societal outcomes.

Key contributions

Introduces "functional misalignment" as a unifying framework for human-AI interactions.
Details three mechanisms causing misalignment: reactive bias, feedback loops, and collective dynamics.
Explains how accurate individual-level predictions can produce adverse societal outcomes.
Proposes a research agenda to study and mitigate these identified effects.

Why it matters

This paper offers a crucial framework to understand why successful AI systems can lead to negative societal outcomes like mental health issues and polarization. By identifying 'functional misalignment,' it provides a unifying lens to analyze these problems and outlines a path for future research and mitigation strategies.

Original Abstract

Algorithmic systems, particularly social media recommenders, have achieved remarkable success in predicting behavior. By optimizing for observable signals such as clicks, views, and engagement, these systems effectively capture user attention and guide interaction. Yet their widespread adoption has coincided with troubling outcomes, including rising mental health concerns, increasing polarization, and erosion of trust. This paper argues that these effects are consequences of a structural functional misalignment between what algorithms optimize - predictable behavior - and the human goals these predictions are intended to serve. We propose that this misalignment arises through three mechanisms: (1) a bias toward modeling fast, reactive behavioral signals over reflective judgment, (2) feedback loops that couple user behavior with algorithmic learning, and (3) emergent collective dynamics that amplify these effects at scale. Together, these mechanisms explain how accurate individual-level predictions can produce adverse societal outcomes. We present functional misalignment as a unifying framework and outline a research agenda for studying and mitigating its effects in human-AI interaction systems.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers