ArXiv TLDR

PhyloSDF: Phylogenetically-Conditioned Neural Generation of 3D Skull Morphology via Residual Flow Matching

🐦 Tweet
2604.25371

Kaikwan Lau, Gary P. T. Choi

q-bio.QMcs.CV

TLDR

PhyloSDF generates realistic 3D biological shapes, like skulls, by integrating phylogenetic conditioning and a novel Residual Flow Matching.

Key contributions

  • DeepSDF auto-decoder uses a Phylogenetic Consistency Loss to align latent space with evolutionary distances (r=0.993).
  • Residual Conditional Flow Matching (Residual CFM) enables generation from as few as ~4 specimens per species.
  • Generates novel 3D skull meshes for Darwin's Finches, achieving 88-129% of real intra-species variation.
  • Outperforms diffusion models and standard flow matching in fidelity and morphometric Fréchet distance.

Why it matters

This paper addresses the challenge of generating biologically plausible 3D structures with scarce data, crucial for evolutionary biology. Its novel approach allows for accurate shape generation and phylogenetic extrapolation, even with limited samples. This could revolutionize studies of morphological evolution and ancestral reconstruction.

Original Abstract

Generating novel, biologically plausible three-dimensional morphological structures is a fundamental challenge in computational evolutionary biology, hampered by extreme data scarcity and the requirement that generated shapes respect phylogenetic relationships among species. In this work, we present PhyloSDF, a phylogenetically-conditioned neural generative model for 3D biological morphology that integrates two innovations: (1) a DeepSDF auto-decoder regularized by a novel Phylogenetic Consistency Loss that structures the latent space to correlate with evolutionary distances (Pearson $r=0.993$); (2) a Residual Conditional Flow Matching (Residual CFM) architecture that factorizes generation into analytic species-centroid lookup and learned residual prediction, enabling generation from as few as ~4 specimens per species. We evaluate PhyloSDF on 100 micro-CT-scanned skulls of Darwin's Finches and their relatives across 24 species. The model generates novel meshes achieving 88-129% of real intra-species variation at the code level, with all 180 generated meshes verified as non-memorized. Residual CFM surpasses denoising diffusion (which fails entirely at this scale), standard flow matching (which mode-collapses to 3-6% variation), and a Gaussian mixture baseline in both fidelity (Chamfer Distance 0.00181 vs. 0.00190) and morphometric Fréchet distance (10,641 vs. 13,322). Leave-one-species-out experiments across 18 species demonstrate phylogenetic extrapolation capability, and smooth latent interpolations produce biologically plausible ancestral skull reconstructions.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.