ArXiv TLDR

Large Language Models Exhibit Normative Conformity

🐦 Tweet
2604.19301

Mikako Bito, Keita Nishimoto, Kimitaka Asatani, Ichiro Sakata

cs.AIcs.MAcs.NE

TLDR

LLMs exhibit normative conformity, distinct from informational, making multi-agent systems vulnerable to manipulation and suggesting internal norm mechanisms.

Key contributions

  • Introduces distinction between informational and normative conformity in LLMs.
  • Finds up to five of six evaluated LLMs exhibit normative conformity.
  • Demonstrates normative conformity can be controlled by manipulating social context.
  • Suggests distinct internal mechanisms drive different types of conformity in LLMs.

Why it matters

This work reveals LLM-based multi-agent systems are vulnerable to manipulation by malicious users due to LLMs' tendency to conform. It offers initial insights into how 'norms' are implemented in LLMs and their impact on group dynamics.

Original Abstract

The conformity bias exhibited by large language models (LLMs) can pose a significant challenge to decision-making in LLM-based multi-agent systems (LLM-MAS). While many prior studies have treated "conformity" simply as a matter of opinion change, this study introduces the social psychological distinction between informational conformity and normative conformity in order to understand LLM conformity at the mechanism level. Specifically, we design new tasks to distinguish between informational conformity, in which participants in a discussion are motivated to make accurate judgments, and normative conformity, in which participants are motivated to avoid conflict or gain acceptance within a group. We then conduct experiments based on these task settings. The experimental results show that, among the six LLMs evaluated, up to five exhibited tendencies toward not only informational conformity but also normative conformity. Furthermore, intriguingly, we demonstrate that by manipulating subtle aspects of the social context, it may be possible to control the target toward which a particular LLM directs its normative conformity. These findings suggest that decision-making in LLM-MAS may be vulnerable to manipulation by a small number of malicious users. In addition, through analysis of internal vectors associated with informational and normative conformity, we suggest that although both behaviors appear externally as the same form of "conformity," they may in fact be driven by distinct internal mechanisms. Taken together, these results may serve as an initial milestone toward understanding how "norms" are implemented in LLMs and how they influence group dynamics.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.