ArXiv TLDR

Generative Drifting for Conditional Medical Image Generation

🐦 Tweet
2604.19736

Zirong Li, Siyuan Mei, Weiwen Wu, Andreas Maier, Lina Gölz + 1 more

cs.CV

TLDR

GDM is a generative drifting framework that efficiently creates high-fidelity 3D medical images by balancing patient-specific details and distribution plausibility.

Key contributions

  • Introduces GDM, a generative drifting framework for efficient 3D medical image generation.
  • Uses an attractive-repulsive drift and multi-level feature bank for stable 3D volumetric learning.
  • Employs gradient coordination to balance patient-specific fidelity and distribution-level objectives.
  • Outperforms GANs, flow-matching, and SDEs in 3D medical image synthesis tasks like MRI-to-CT.

Why it matters

This paper introduces GDM, a novel generative drifting framework that addresses the critical challenge of balancing efficiency, fidelity, and plausibility in 3D medical image generation. By outperforming existing methods and offering one-step inference, GDM provides a practical and effective solution for clinically relevant imaging tasks.

Original Abstract

Conditional medical image generation plays an important role in many clinically relevant imaging tasks. However, existing methods still face a fundamental challenge in balancing inference efficiency, patient-specific fidelity, and distribution-level plausibility, particularly in high-dimensional 3D medical imaging. In this work, we propose GDM, a generative drifting framework that reformulates deterministic medical image prediction as a multi-objective learning problem to jointly promote distribution-level plausibility and patient-specific fidelity while retaining one-step inference. GDM extends drifting to 3D medical imaging through an attractive-repulsive drift that minimizes the discrepancy between the generator pushforward and the target distribution. To enable stable drifting-based learning in 3D volumetric data, GDM constructs a multi-level feature bank from a medical foundation encoder to support reliable affinity estimation and drifting field computation across complementary global, local, and spatial representations. In addition, a gradient coordination strategy in the shared output space improves optimization balance under competing distribution-level and fidelity-oriented objectives. We evaluate the proposed framework on two representative tasks, MRI-to-CT synthesis and sparse-view CT reconstruction. Experimental results show that GDM consistently outperforms a wide range of baselines, including GAN-based, flow-matching-based, and SDE-based generative models, as well as supervised regression methods, while improving the balance among anatomical fidelity, quantitative reliability, perceptual realism, and inference efficiency. These findings suggest that GDM provides a practical and effective framework for conditional 3D medical image generation.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.