NeuroRing: Scaling Spiking Neural Networks via Multi-FPGA Bidirectional Ring Topologies and Stream-Dataflow Architectures
Muhammad Ihsan Al Hafiz, Artur Podobas
TLDR
NeuroRing is a scalable multi-FPGA SNN accelerator using a stream-dataflow and bidirectional ring topology for efficient, faster-than-real-time simulation.
Key contributions
- Introduces NeuroRing, a modular SNN accelerator on FPGAs using stream-dataflow and a bidirectional ring.
- Achieves faster-than-real-time execution (RTF 0.83) for full-scale cortical microcircuits.
- Demonstrates strong and weak scaling with competitive energy efficiency across multiple FPGAs.
- Integrates with NEST simulator, supporting flexible single- and multi-FPGA deployments.
Why it matters
Large-scale SNNs are challenging due to communication overhead. NeuroRing offers a flexible, high-performance FPGA-based solution, enabling faster neuroscience simulations and broader event-driven applications. This advances energy-efficient AI research.
Original Abstract
Spiking neural networks (SNNs) are a promising paradigm for energy-efficient event-driven computation, but large-scale SNN execution remains challenging because sparse spike communication and synchronization can dominate runtime. Existing solutions across CPU, GPU, ASIC, and FPGA platforms offer different trade-offs between programmability, efficiency, and scalability. To address this gap, we present NeuroRing, a modular and scalable SNN accelerator based on a stream-dataflow architecture and a bidirectional ring topology, implemented in High-Level Synthesis (HLS) on programmable FPGAs. NeuroRing supports modular single- and multi-FPGA deployment and is compatible with existing SNN workflows through integration with the NEST simulator. We evaluate NeuroRing on the cortical microcircuit benchmark and a Sudoku constraint-satisfaction workload. Results show that NeuroRing preserves the key activity statistics of the NEST reference model, achieves faster-than-real-time execution of the full-scale cortical microcircuit with a real-time factor (RTF) of 0.83, exhibits meaningful strong and weak scaling, and provides competitive energy efficiency on two programmable FPGAs. These results position NeuroRing as a flexible and scalable platform for both neuroscience simulation and broader event-driven applications.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.