Robust and Clinically Reliable EEG Biomarkers: A Cross Population Framework for Generalizable Parkinson's Disease Detection
Nicholas R. Rasmussen, Longwei Wang, Rodrigue Rizk, Md Rezwanul Akter Pallab, Samuel Stuwart + 3 more
TLDR
This paper introduces a population-aware framework for developing robust and generalizable EEG biomarkers to detect Parkinson's disease across diverse clinical cohorts.
Key contributions
- Proposes a population-aware evaluation framework for EEG biomarkers under distribution shift.
- Uses an n-gram expansion strategy for 75 cross-population train/test configurations.
- Employs nested cross-validation with channel selection for prospective biomarker ID.
- Achieves up to 94.1% accuracy on held-out cohorts, improving with training diversity.
Why it matters
Current EEG models often fail to generalize across diverse patient populations due to specific artifacts. This framework provides a principled approach to develop robust, clinically reliable biomarkers. It significantly improves Parkinson's disease detection in multi-site settings.
Original Abstract
Developing robust and clinically reliable EEG biomarkers requires evaluation frameworks that explicitly address cross population generalization in multi site settings such as Parkinsons disease (PD) detection. Models trained under i.i.d. assumptions often capture population specific artifacts rather than disease relevant neural structure, leading to poor generalization across clinical cohorts. EEG further amplifies this challenge due to low signal to noise ratio and heterogeneous acquisition conditions. We propose a population aware evaluation framework to assess the robustness and clinical reliability of EEG biomarkers under distribution shift. Using an n gram expansion strategy, we enumerate all cross population train test configurations across five independent cohorts, resulting in 75 directional evaluations. A nested cross validation design with integrated channel selection ensures prospective biomarker identification without population leakage. Results show that cross population transfer is asymmetric and that both accuracy and biomarker stability improve with increasing training population diversity, achieving up to 94.1% accuracy on held out cohorts. A theoretical analysis based on mixture risk optimization and hypothesis space contraction explains these trends, showing that multi population training promotes population robust representations. This work establishes a principled framework for learning robust, generalizable, and clinically reliable EEG biomarkers for multi site biomedical applications.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.