Uncertainty-Aware Pedestrian Attribute Recognition via Evidential Deep Learning
Zhuofan Lou, Shihang Zhang, Fangle Zhu, Shengjie Ye, Pingyu Wang
TLDR
UAPAR is the first EDL-based framework for pedestrian attribute recognition, improving robustness by quantifying prediction uncertainty on low-quality samples.
Key contributions
- Introduces UAPAR, the first EDL-based framework for uncertainty-aware pedestrian attribute recognition.
- Integrates Evidential Deep Learning (EDL) into a CLIP-based architecture to estimate prediction uncertainty.
- Employs a Region-Aware Evidence Reasoning module for fine-grained feature capture and epistemic uncertainty.
- Develops an uncertainty-guided dual-stage curriculum learning strategy to reduce label noise effects.
Why it matters
Conventional pedestrian attribute recognition methods often fail to assess prediction reliability on challenging samples. UAPAR enhances system robustness by providing crucial uncertainty estimates, making it more reliable for real-world applications. This is vital for critical scenarios like surveillance where prediction confidence matters.
Original Abstract
We propose UAPAR, an Uncertainty-Aware Pedestrian Attribute Recognition framework. To the best of our knowledge, this is the first EDL-based uncertainty-aware framework for pedestrian attribute recognition (PAR). Unlike conventional deterministic methods, which fail to assess prediction reliability on low-quality samples, UAPAR effectively identifies unreliable predictions and thus enhances system robustness in complex real-world scenarios. To achieve this, UAPAR incorporates Evidential Deep Learning (EDL) into a CLIP-based architecture. Specifically, a Region-Aware Evidence Reasoning module employs cross-attention and spatial prior masks to capture fine-grained local features, which are further processed by an evidence head to estimate attribute-wise epistemic uncertainty. To further enhance training robustness, we develop an uncertainty-guided dual-stage curriculum learning strategy to alleviate the adverse effects of severe label noise during training. Extensive experiments on the PA100K, PETA, RAPv1, and RAPv2 datasets demonstrate that UAPAR achieves competitive or superior performance. Furthermore, qualitative results confirm that the proposed framework generates uncertainty estimates that are predictive of challenging or erroneous samples.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.