Hierarchical Mesh Transformers with Topology-Guided Pretraining for Morphometric Analysis of Brain Structures
Yujian Xiong, Mohammad Farazi, Yanxi Chen, Wenhui Zhu, Xuanzhao Dong + 6 more
TLDR
A hierarchical transformer with topology-guided pretraining enables robust morphometric analysis of brain structures across diverse mesh types and clinical features.
Key contributions
- Introduces a hierarchical transformer for heterogeneous brain mesh analysis (volumetric & surface).
- Uses spatially adaptive tree partitions for efficient multi-scale attention on arbitrary mesh topologies.
- Employs a feature projection module to integrate diverse per-vertex clinical morphometric descriptors.
- Self-supervised pretraining via masked reconstruction improves transferability for downstream tasks.
Why it matters
Existing neuroimaging methods struggle with diverse mesh topologies and integrating multiple morphometric features. This paper introduces a unified, efficient hierarchical transformer that overcomes these limitations, enabling robust analysis across various brain structures. It significantly advances disease detection and prediction, offering a powerful tool for clinical research.
Original Abstract
Representation learning on large-scale unstructured volumetric and surface meshes poses significant challenges in neuroimaging, especially when models must incorporate diverse vertex-level morphometric descriptors, such as cortical thickness, curvature, sulcal depth, and myelin content, which carry subtle disease-related signals. Current approaches either ignore these clinically informative features or support only a single mesh topology, restricting their use across imaging pipelines. We introduce a hierarchical transformer framework designed for heterogeneous mesh analysis that operates on spatially adaptive tree partitions constructed from simplicial complexes of arbitrary order. This design accommodates both volumetric and surface discretizations within a single architecture, enabling efficient multi-scale attention without topology-specific modifications. A feature projection module maps variable-length per-vertex clinical descriptors into the spatial hierarchy, separating geometric structure from feature dimensionality and allowing seamless integration of different neuroimaging feature sets. Self-supervised pretraining via masked reconstruction of both coordinates and morphometric channels on large unlabeled cohorts yields a transferable encoder backbone applicable to diverse downstream tasks and mesh modalities. We validate our approach on Alzheimer's disease classification and amyloid burden prediction using volumetric brain meshes from ADNI, as well as focal cortical dysplasia detection on cortical surface meshes from the MELD dataset, achieving state-of-the-art results across all benchmarks.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.