CoCo-SAM3: Harnessing Concept Conflict in Open-Vocabulary Semantic Segmentation
Yanhui Chen, Baoyao Yang, Siqi Liu, Jingchao Wang
TLDR
CoCo-SAM3 enhances open-vocabulary semantic segmentation by resolving concept conflicts and unifying evidence scales for stable multi-class inference.
Key contributions
- Addresses overlapping masks and unstable competition in multi-class open-vocabulary segmentation.
- Mitigates intra-class drift by aligning and aggregating evidence from synonymous prompts.
- Introduces a unified, comparable scale for inter-class competition, enabling direct pixel-wise comparisons.
- Achieves consistent improvements across 8 benchmarks without requiring any additional training.
Why it matters
CoCo-SAM3 significantly improves open-vocabulary semantic segmentation. It resolves concept conflicts and unifies evidence scales, stabilizing multi-class inference and achieving consistent improvements without additional training.
Original Abstract
SAM3 advances open-vocabulary semantic segmentation by introducing a prompt-driven mask generation paradigm. However, in multi-class open-vocabulary scenarios, masks generated independently from different category prompts lack a unified and inter-class comparable evidence scale, often resulting in overlapping coverage and unstable competition. Moreover, synonymous expressions of the same concept tend to activate inconsistent semantic and spatial evidence, leading to intra-class drift that exacerbates inter-class conflicts and compromises overall inference stability. To address these issues, we propose CoCo-SAM3 (Concept-Conflict SAM3), which explicitly decouples inference into intra-class enhancement and inter-class competition. Our method first aligns and aggregates evidence from synonymous prompts to strengthen concept consistency. It then performs inter-class competition on a unified comparable scale, enabling direct pixel-wise comparisons among all candidate classes. This mechanism stabilizes multi-class inference and effectively mitigates inter-class conflicts. Without requiring any additional training, CoCo-SAM3 achieves consistent improvements across eight open-vocabulary semantic segmentation benchmarks.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.