ArXiv TLDR

NeuralSet: A High-Performing Python Package for Neuro-AI

🐦 Tweet
2605.03169

Jean-Rémi King, Corentin Bel, Linnea Evanson, Julien Gadonneix, Sophia Houhamdi + 23 more

q-bio.NC

TLDR

NeuralSet is a Python package that unifies diverse neural data and stimuli processing for neuro-AI research, scaling from local to cluster.

Key contributions

  • Unifies diverse neural recordings (fMRI, M/EEG, spikes) and complex stimuli (text, audio, video).
  • Decouples metadata from lazy, memory-efficient data extraction for massive datasets.
  • Harmonizes neuroscientific preprocessing with pretrained deep learning embeddings.
  • Offers a PyTorch-ready interface that scales from local prototyping to HPC clusters.

Why it matters

NeuralSet solves the fragmented software problem in neuro-AI by unifying diverse neural data and stimuli processing. It offers a scalable, memory-efficient, PyTorch-ready interface, accelerating the integration of neuroscience with advanced AI for large datasets.

Original Abstract

Artificial intelligence (AI) is increasingly central to understanding how the brain processes information. However, the integration of neuroscience and modern AI is bottlenecked by a fragmented software ecosystem. Current tools are siloed by recording modality and optimized for small-scale, in-memory workflows, limiting the use of massive, naturalistic datasets. Here, we introduce NeuralSet, a Python framework that efficiently unifies the processing of diverse neural recordings (including fMRI, M/EEG, and spikes) and complex experimental stimuli (such as text, audio, and video). By decoupling experimental metadata from lazy, memory-efficient data extraction, NeuralSet harmonizes standard neuroscientific preprocessing pipelines with pretrained deep learning embeddings. This approach provides a single PyTorch-ready interface that scales seamlessly from local prototyping to high-performance cluster execution. By eliminating manual data wrangling and ensuring full computational provenance, NeuralSet establishes a scalable, unified infrastructure for the next generation of neuro-AI research.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.