ArXiv TLDR

NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures

🐦 Tweet
2604.28059

Muhammad Ihsan Al Hafiz, Artur Podobas

cs.ARcs.DCcs.NE

TLDR

NeuroRing is a scalable multi-FPGA SNN accelerator using a stream-dataflow and bidirectional ring topology for efficient, faster-than-real-time simulation.

Key contributions

  • Introduces NeuroRing, a modular SNN accelerator on FPGAs using stream-dataflow and a bidirectional ring.
  • Achieves faster-than-real-time execution (RTF 0.83) for full-scale cortical microcircuits.
  • Demonstrates strong and weak scaling with competitive energy efficiency across multiple FPGAs.
  • Integrates with NEST simulator, supporting flexible single- and multi-FPGA deployments.

Why it matters

Large-scale SNNs are challenging due to communication overhead. NeuroRing offers a flexible, high-performance FPGA-based solution, enabling faster neuroscience simulations and broader event-driven applications. This advances energy-efficient AI research.

Original Abstract

Spiking neural networks (SNNs) are a promising paradigm for energy-efficient event-driven computation, but large-scale SNN execution remains challenging because sparse spike communication and synchronization can dominate runtime. Existing solutions across CPU, GPU, ASIC, and FPGA platforms offer different trade-offs between programmability, efficiency, and scalability. To address this gap, we present NeuroRing, a modular and scalable SNN accelerator based on a stream-dataflow architecture and a bidirectional ring topology, implemented in High-Level Synthesis (HLS) on programmable FPGAs. NeuroRing supports modular single- and multi-FPGA deployment and is compatible with existing SNN workflows through integration with the NEST simulator. We evaluate NeuroRing on the cortical microcircuit benchmark and a Sudoku constraint-satisfaction workload. Results show that NeuroRing preserves the key activity statistics of the NEST reference model, achieves faster-than-real-time execution of the full-scale cortical microcircuit with a real-time factor (RTF) of 0.83, exhibits meaningful strong and weak scaling, and provides competitive energy efficiency on two programmable FPGAs. These results position NeuroRing as a flexible and scalable platform for both neuroscience simulation and broader event-driven applications.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.