Benchmarking Heritability Estimation Strategies Across 86 Configurations and Their Downstream Effect on Polygenic Risk Score Performance
Muhammad Muneeb, David B. Ascher
TLDR
This study benchmarks 86 heritability estimation strategies, finding significant variability in estimates but surprisingly robust polygenic risk score performance.
Key contributions
- Benchmarked 86 heritability estimation configurations across 6 tool families and 10 UK Biobank phenotypes.
- Found heritability estimates varied widely (e.g., -0.862 to 2.735), with algorithm choice having the largest effect.
- Demonstrated that downstream polygenic risk score (PRS) performance is only weakly coupled to heritability magnitude.
- Concluded SNP heritability is a configuration-sensitive parameter, not a stable scalar, requiring full specification.
Why it matters
This paper highlights that SNP heritability is a configuration-dependent parameter, not a universal constant. It underscores the importance of reporting full estimation specifications and shows that polygenic risk score performance is robust to moderate variations in heritability inputs. This is crucial for reliable genetic research.
Original Abstract
Objective: SNP heritability estimates vary substantially across estimation strategies, yet the downstream consequences for polygenic risk score (PRS) construction remain poorly characterised. We systematically benchmarked heritability estimation configurations and assessed their propagation into downstream PRS performance. Methods: We benchmarked 86 heritability-estimation configurations spanning six tool families (GEMMA, GCTA, LDAK, DPR, LDSC, and SumHer) and ten method groups across 10 UK Biobank phenotypes, yielding 844 configuration-level estimates. Each estimate was propagated into GCTA-SBLUP and LDpred2-lassosum2 PRS frameworks and evaluated across five cross-validation folds using null, PRS-only, and full models. Eleven binary analytical contrasts were tested using Mann-Whitney U tests to identify drivers of heritability variability. Results: Heritability ranged from -0.862 to 2.735 (mean = 0.134, SD = 0.284), with 133 of 844 estimates (15.8%) being negative and concentrated in unconstrained estimation regimes. Ten of eleven analytical contrasts significantly affected heritability magnitude, with algorithm choice and GRM standardisation showing the largest effects. Despite this upstream variability, downstream PRS test performance was only weakly coupled to heritability magnitude: pooled Pearson correlations between h^2 and test AUC were r = -0.023 for GCTA-SBLUP and r = +0.014 for LDpred2-lassosum2, with both being non-significant. Conclusion: SNP heritability is best interpreted as a configuration-sensitive modelling parameter rather than a universally stable scalar input. Heritability estimates should always be reported alongside their full estimation specification, and downstream PRS performance is comparatively robust to moderate variation in the heritability input.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.