ArXiv TLDR

3D MRI Image Pretraining via Controllable 2D Slice Navigation Task

🐦 Tweet
2605.06487

Yu Wang, Qingchao Chen

cs.CVcs.AI

TLDR

This paper introduces a novel self-supervised pretraining method for 3D MRI by converting volumes into controllable 2D slice navigation sequences.

Key contributions

  • Introduces a self-supervised method for 3D MRI pretraining using controllable 2D slice navigation.
  • Converts 3D MRI volumes into dense video-action sequences by rendering slices with continuous controls.
  • Employs an action-conditioned objective with a tokenizer and a latent dynamics model.
  • Outperforms static-volume baselines in learning anatomical and spatial representations.

Why it matters

Existing MRI pretraining methods are limited by static views. This paper introduces a dynamic, interactive approach, simulating slice navigation to capture richer spatial and anatomical information. This could lead to more robust and generalizable models for various downstream MRI tasks.

Original Abstract

Self-supervised pretraining has become the mainstream approach for learning MRI representations from unlabeled scans. However, most existing objectives still treat each scan primarily as static aggregations of slices, patches or volumes. We ask whether there exists an intrinsic form of self-supervision signal that is different from reconstructing the masked patches, through transforming the 3D volumes into controllable 2D rendered sequences: by rendering slices at continuous positions, orientations, and scales, a 3D volume can be converted into dense video-action sequences whose controls are the action trajectories. We study this formulation with an action-conditioned pretraining objective, where a tokenizer encodes slice observations and a latent dynamics model predicts the evolution of latent features. Across representative anatomical and spatial downstream tasks, the proposed pretraining is evaluated against standard static-volume baselines, tokenizer-only pretraining, and dynamics variants without aligned actions. These results suggest that controllable MRI slice navigation provides a useful complementary pretraining interface for learning anatomical and spatial representations from large unlabeled MRI collections.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.