ArXiv TLDR

Neurological Plausibility of AI-Generated Music for Commercial Environments: An In-Silico Cortical Investigation Using Wubble and TRIBE v2

🐦 Tweet
2604.04025

Shaad Sufi

q-bio.NCcs.SD

TLDR

This study uses an in-silico model to show that AI-generated music, when prompt-conditioned, can systematically shift predicted brain activity patterns.

Key contributions

  • Developed an in-silico framework combining Wubble (AI music) and TRIBE v2 (brain model) for neural pre-screening.
  • Found prompt-conditioned AI music systematically shifts predicted cortical activity patterns in auditory-temporal-prefrontal regions.
  • High-arousal, major-pop AI music elicited the strongest predicted whole-cortex and prefrontal responses.
  • Predicted cortical maps showed distinct spatial organization for low vs. high arousal AI music conditions.

Why it matters

This paper offers a novel in-silico method to pre-screen AI-generated music for commercial use based on predicted brain responses. It suggests AI music can plausibly influence brain states related to attention and value, paving the way for biologically informed music optimization.

Original Abstract

Background music shapes attention, affect, and approach behavior in commercial environments, yet the neural plausibility of AI-generated music for such settings remains poorly characterized. We present an in-silico pilot study that combines Wubble, a generative music system, with TRIBE v2, a publicly released whole-brain encoding model, to estimate cortical response profiles for prompt-conditioned retail music. Five fully instrumental tracks were generated to span low-to-high arousal, sparse-to-dense arrangement, and neutral-to-positive valence prompts, then analyzed with audio-only TRIBE v2 inference on loudness-normalized waveforms. Analysis focused on fsaverage5 cortical predictions summarized over auditory, superior temporal, temporo-parietal, and inferior frontal HCP parcels. The fast bright major-pop condition produced the largest whole-cortex mean activation (0.0402), the strongest prefrontal ROI composite response (0.0704), and the highest parcel means in IFJa (0.1102), IFJp (0.0995), A5 (0.0188), and area 45 (0.0015). Pairwise spatial correlations ranged from 0.787 to 0.974, indicating that prompt variation modulated predicted cortical states rather than yielding a single undifferentiated response profile. Predicted cortical surface maps further revealed visually distinct spatial organization between low-arousal and high-arousal conditions. These results support a cautious claim of cortical neurological plausibility: prompt-conditioned AI music can systematically shift predicted auditory-temporal-prefrontal patterns relevant to salience and valuation. Although the study does not establish subcortical reward engagement or consumer behavior, it provides a reproducible framework for neural pre-screening and pre-optimization of commercial music generation against biologically informed cortical proxies.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.