EMGFlow: Robust and Efficient Surface Electromyography Synthesis via Flow Matching
Boxuan Jiang, Chenyun Dai, Can Han
TLDR
EMGFlow uses Flow Matching for robust and efficient synthetic sEMG data generation, outperforming GANs and diffusion models to address data scarcity.
Key contributions
- First to apply Flow Matching for conditional sEMG data generation.
- Outperforms GANs and diffusion models in sEMG synthesis fidelity and utility.
- Achieves superior quality-efficiency trade-offs through optimized generation dynamics.
- Addresses data scarcity and limited subject diversity in sEMG gesture recognition.
Why it matters
Deep learning for sEMG is limited by data scarcity. EMGFlow provides a novel, efficient, and high-quality method for synthetic sEMG data generation using Flow Matching. This significantly improves data augmentation and the utility of myoelectric control systems.
Original Abstract
Deep learning-based surface electromyography (sEMG) gesture recognition is frequently bottlenecked by data scarcity and limited subject diversity. While synthetic data generation via Generative Adversarial Networks (GANs) and diffusion models has emerged as a promising augmentation strategy, these approaches often face challenges regarding training stability or inference efficiency. To bridge this gap, we propose EMGFlow, a conditional sEMG generation framework. To the best of our knowledge, this is the first study to investigate the application of Flow Matching (FM) and continuous-time generative modeling in the sEMG domain. To validate EMGFlow across three benchmark sEMG datasets, we employ a unified evaluation protocol integrating feature-based fidelity, distributional geometry, and downstream utility. Extensive evaluations show that EMGFlow outperforms conventional augmentation and GAN baselines, and provides stronger standalone utility than the diffusion baselines considered here under the train-on-synthetic test-on-real (TSTR) protocol. Furthermore, by optimizing generation dynamics through advanced numerical solvers and targeted time sampling, EMGFlow achieves improved quality-efficiency trade-offs. Taken together, these results suggest that Flow Matching is a promising and efficient paradigm for addressing data bottlenecks in myoelectric control systems. Our code is available at: https://github.com/Open-EXG/EMGFlow.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.