CHRep: Cross-modal Histology Representation and Post-hoc Calibration for Spatial Gene Expression Prediction
Changfan Wang, Xinran Wang, Donghai Liu, Fei Su, Lulu Sun + 2 more
TLDR
CHRep predicts spatial gene expression from H&E slides using a two-phase framework with topology-preserving representation and post-hoc calibration.
Key contributions
- Introduces CHRep, a two-phase framework for robust spatial gene expression prediction from H&E images.
- Learns structure-aware representations via correlation-aware regression, image-expression alignment, and spatial topology regularization.
- Employs a lightweight post-hoc calibration module for cross-slide robustness without fine-tuning the backbone model.
- Achieves significant improvements in gene-wise correlation (e.g., 39.5% PCC gain) under leave-one-slide-out evaluation.
Why it matters
This paper addresses the limitations of current spatial transcriptomics by offering a cost-effective, high-throughput alternative. CHRep's novel approach of combining topology-preserving learning with post-hoc calibration significantly improves prediction accuracy and robustness across different slides, making large-cohort studies more feasible.
Original Abstract
Spatial transcriptomics (ST) enables spatially resolved gene profiling but remains expensive and low-throughput, limiting large-cohort studies and routine clinical use. Predicting spatial gene expression from routine hematoxylin and eosin (H&E) slides is a promising alternative, yet under realistic leave-one-slide-out evaluation, existing models often suffer from slide-level appearance shifts and regression-driven over-smoothing that suppress biologically meaningful variation. CHRep is a two-phase framework for robust histology-to-expression prediction. In the training phase, CHRep learns a structure-aware representation by jointly optimizing correlation-aware regression, symmetric image-expression alignment, and coordinate-induced spatial topology regularization. In the inference phase, cross-slide robustness is improved without backbone fine-tuning through a lightweight calibration module trained on the training slides, which combines a non-parametric estimate from a training gallery with a magnitude-regularized correction module. Unlike prior embedding-alignment or retrieval-based transfer methods that rely on a single prediction route, CHRep couples topology-preserving representation learning with post-hoc calibration, enabling stable neighborhood retrieval and controlled bias correction under slide-level shifts. Across the three cohorts, CHRep consistently improves gene-wise correlation under leave-one-slide-out evaluation, with the largest gains observed on Alex+10x. Relative to HAGE, the Pearson correlation coefficient on all considered genes [PCC(ACG)] increases by 4.0% on cSCC and 9.8% on HER2+. Relative to mclSTExp, PCC(ACG) further improves by 39.5% on Alex+10x, together with 9.7% and 9.0% reductions in mean squared error (MSE) and mean absolute error (MAE), respectively.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.