Hyp2Former: Hierarchy-Aware Hyperbolic Embeddings for Open-Set Panoptic Segmentation
Yao Lu, Rohit Mohan, Florian Drews, Yakov Miron, Abhinav Valada
TLDR
Hyp2Former uses hierarchy-aware hyperbolic embeddings for Open-Set Panoptic Segmentation, improving unknown object detection without explicit training.
Key contributions
- Introduces Hyp2Former, an end-to-end framework for Open-Set Panoptic Segmentation (OPS).
- Learns hierarchical semantic similarities in hyperbolic space to detect unknowns without explicit modeling.
- Explicitly encodes known category hierarchies to create a structured embedding space.
- Achieves state-of-the-art OPS performance on MS COCO, Cityscapes, and Lost&Found datasets.
Why it matters
This paper addresses the critical challenge of recognizing unknown objects in safety-critical systems. By leveraging semantic hierarchies in hyperbolic space, it significantly improves Open-Set Panoptic Segmentation without needing explicit unknown object training, making it more robust and practical.
Original Abstract
Recognizing unknown objects is crucial for safety-critical applications such as autonomous driving and robotics. Open-Set Panoptic Segmentation (OPS) aims to segment known thing and stuff classes while identifying valid unknown objects as separate instances. Prior OPS approaches largely treat known categories as a flat label set, ignoring the semantic hierarchy that provides valuable structural priors for distinguishing unknown objects from in-distribution classes. In this work, we propose Hyp2Former, an end-to-end framework for OPS that does not require explicit modeling of unknowns during training, and instead learns hierarchical semantic similarities continuously in hyperbolic space. By explicitly encoding hierarchical relationships among known categories, the model learns a structured embedding space that captures multiple levels of semantic abstraction. As a result, unknown objects that cannot be confidently classified as known categories still remain in close proximity to higher-level concepts (e.g., an unknown animal remains closer to "animal" or "object" than to unrelated concepts such as "electronics" or "stuff") and can therefore be reliably detected, even if their fine-grained category was not represented during training. Empirical evaluations across multiple public datasets such as MS COCO, Cityscapes, and Lost&Found demonstrate that Hyp2Former outperforms existing methods on OPS, achieving the best balance between unknown object discovery and in-distribution robustness.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.