ArXiv TLDR

PPI-Net connects molecular protein interactions to functional processes in disease

🐦 Tweet
2605.07838

Kyle Higgins, Guadalupe Gonzalez, Dennis Veselkov, Ivan Laponogov, Kirill Veselkov

q-bio.QMcs.AIcs.LG

TLDR

PPI-Net is a hierarchical GNN integrating protein interaction networks with pathway representations to model disease from molecular interactions to functional processes.

Key contributions

  • Introduces PPI-Net, a hierarchical graph neural network for disease modeling.
  • Integrates protein-protein interaction networks (STRING) with pathway hierarchies (Reactome).
  • Achieves over 90% balanced accuracy in predicting disease across ten cancer types.
  • Improves interpretability, recovering canonical oncogenic modules and revealing coherent programs.

Why it matters

This paper introduces PPI-Net, a novel approach that bridges the gap between molecular interactions and functional processes in disease. By integrating PPI networks with pathway hierarchies, it offers both high predictive accuracy and crucial mechanistic insights into complex diseases like cancer, advancing our understanding of disease propagation.

Original Abstract

Understanding how molecular alterations propagate across biological systems to drive disease remains a central challenge. Although high-throughput profiling enables comprehensive characterization of tumor states, most models neglect structured biological relationships or lack interpretability across scales. Here we present PPI-Net, a hierarchical graph neural network that integrates protein-protein interaction (PPI) networks with pathway-level representations to model disease from molecular interactions to functional processes. Patient-specific molecular profiles are embedded within a shared interaction network from STRING and propagated through a multi-layer Reactome hierarchy using graph attention, enabling aggregation of gene-level signals into higher-order biological programs. Across RNA-seq data from ten cancer types from The Cancer Genome Atlas, PPI-Net achieves robust predictive performance, with balanced accuracy exceeding 90% in multiple cohorts. Comparative analysis on RNA-Seq data from breast cancer demonstrated that PPI-Net's integration of the Reactome hierarchy improved balanced accuracy by 6.7% relative to a PPI-only model, while hierarchical multi-level supervision improved balanced accuracy by 12.3% relative to using only a single top-level prediction head. Applying a multi-omics approach using RNA-seq and methylation data improves model interpretation, recovering canonical oncogenic modules, including TP53-AKT signaling and stress response pathways, while revealing convergence onto coherent programs such as ion signaling and cellular responses to stimuli. These results demonstrate that integrating interaction networks with pathway hierarchies enables accurate prediction while providing mechanistic insight into cancer biology.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.