Functional Misalignment in Human-AI Interactions on Digital Platforms
TLDR
This paper introduces 'functional misalignment' to explain how AI optimizing for predictable behavior, not human goals, causes negative societal outcomes.
Key contributions
- Introduces "functional misalignment" as a unifying framework for human-AI interactions.
- Details three mechanisms causing misalignment: reactive bias, feedback loops, and collective dynamics.
- Explains how accurate individual-level predictions can produce adverse societal outcomes.
- Proposes a research agenda to study and mitigate these identified effects.
Why it matters
This paper offers a crucial framework to understand why successful AI systems can lead to negative societal outcomes like mental health issues and polarization. By identifying 'functional misalignment,' it provides a unifying lens to analyze these problems and outlines a path for future research and mitigation strategies.
Original Abstract
Algorithmic systems, particularly social media recommenders, have achieved remarkable success in predicting behavior. By optimizing for observable signals such as clicks, views, and engagement, these systems effectively capture user attention and guide interaction. Yet their widespread adoption has coincided with troubling outcomes, including rising mental health concerns, increasing polarization, and erosion of trust. This paper argues that these effects are consequences of a structural functional misalignment between what algorithms optimize - predictable behavior - and the human goals these predictions are intended to serve. We propose that this misalignment arises through three mechanisms: (1) a bias toward modeling fast, reactive behavioral signals over reflective judgment, (2) feedback loops that couple user behavior with algorithmic learning, and (3) emergent collective dynamics that amplify these effects at scale. Together, these mechanisms explain how accurate individual-level predictions can produce adverse societal outcomes. We present functional misalignment as a unifying framework and outline a research agenda for studying and mitigating its effects in human-AI interaction systems.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.