Do Emotions in Prompts Matter? Effects of Emotional Framing on Large Language Models
Minda Zhao, Yutong Yang, Chufei Peng, Rachel Gonsalves, Weiyue Li + 3 more
TLDR
Emotional framing in LLM prompts typically causes small, variable changes, but adaptive emotional prompting can yield more reliable gains.
Key contributions
- Static emotional prefixes usually produce only small accuracy changes across LLM tasks.
- Effects of emotional framing are more variable in socially grounded tasks.
- Stronger emotional wording or human-written prefixes show similar modest changes.
- Introduces EmotionRL, an adaptive framework, yielding more reliable gains than fixed emotional prompts.
Why it matters
This paper clarifies that emotional tone is a weak, input-dependent signal for LLMs, not a dominant driver. It demonstrates that while fixed emotional prompts offer limited benefit, adaptive strategies can effectively leverage this signal, guiding future prompt engineering and LLM development.
Original Abstract
Emotional tone is pervasive in human communication, yet its influence on large language model (LLM) behaviour remains unclear. Here, we examine how first-person emotional framing in user-side queries affect LLM performance across six benchmark domains, including mathematical reasoning, medical question answering, reading comprehension, commonsense reasoning and social inference. Across models and tasks, static emotional prefixes usually produce only small changes in accuracy, suggesting that affective phrasing is typically a mild perturbation rather than a reliable general-purpose intervention. This stability is not uniform: effects are more variable in socially grounded tasks, where emotional context more plausibly interacts with interpersonal reasoning. Additional analyses show that stronger emotional wording induces only modest extra change, and that human-written prefixes reproduce the same qualitative pattern as LLM-generated ones. We then introduce EmotionRL, an adaptive emotional prompting framework that selects emotional framing adaptively for each query. Although no single emotion is consistently beneficial, adaptive selection yields more reliable gains than fixed emotional prompting. Together, these findings show that emotional tone is neither a dominant driver of LLM performance nor irrelevant noise, but a weak and input-dependent signal that can be exploited through adaptive control.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.