ArXiv TLDR

Amodal SAM: A Unified Amodal Segmentation Framework with Generalization

🐦 Tweet
2604.20748

Bo Zhang, Zhuotao Tian, Xin Tao, Songlin Tang, Jun Yu + 1 more

cs.CV

TLDR

Amodal SAM unifies amodal image and video segmentation by extending SAM's generalization, achieving SOTA performance and robust generalization.

Key contributions

  • Introduces a lightweight Spatial Completion Adapter for occluded region reconstruction.
  • Develops Target-Aware Occlusion Synthesis (TAOS) to generate diverse synthetic training data.
  • Proposes novel learning objectives for regional consistency and topological regularization.

Why it matters

This paper addresses the critical challenge of amodal segmentation generalization. By extending SAM's capabilities and introducing innovative components, Amodal SAM advances practical systems for real-world environments.

Original Abstract

Amodal segmentation is a challenging task that aims to predict the complete geometric shape of objects, including their occluded regions. Although existing methods primarily focus on amodal segmentation within the training domain, these approaches often lack the generalization capacity to extend effectively to novel object categories and unseen contexts. This paper introduces Amodal SAM, a unified framework that leverages SAM (Segment Anything Model) for both amodal image and amodal video segmentation. Amodal SAM preserves the powerful generalization ability of SAM while extending its inherent capabilities to the amodal segmentation task. The improvements lie in three aspects: (1) a lightweight Spatial Completion Adapter that enables occluded region reconstruction, (2) a Target-Aware Occlusion Synthesis (TAOS) pipeline that addresses the scarcity of amodal annotations by generating diverse synthetic training data, and (3) novel learning objectives that enforce regional consistency and topological regularization. Extensive experiments demonstrate that Amodal SAM achieves state-of-the-art performance on standard benchmarks, while simultaneously exhibiting robust generalization to novel scenarios. We anticipate that this research will advance the field toward practical amodal segmentation systems capable of operating effectively in unconstrained real-world environments.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.