Monte Carlo Stochastic Depth for Uncertainty Estimation in Deep Learning

April 14, 20262604.12719

Adam T. Müller, Tobias Rögelein, Nicolaj C. Stache

cs.LGstat.ML

TLDR

Monte Carlo Stochastic Depth (MCSD) is introduced for robust uncertainty estimation, with theoretical Bayesian links and strong empirical performance.

Key contributions

Provides theoretical insights connecting MCSD to principled approximate variational inference.
First comprehensive benchmark of MCSD on state-of-the-art object detectors (YOLO, RT-DETR).
Achieves competitive accuracy, improved calibration (ECE), and uncertainty ranking (AUARC) over MCD.
Establishes MCSD as a robust, computationally efficient, and theoretically-grounded UQ tool.

Why it matters

Reliable uncertainty quantification is crucial for deep learning in safety-critical systems. This paper establishes Monte Carlo Stochastic Depth (MCSD) as a theoretically-grounded and empirically-validated method. It offers a computationally efficient alternative, improving calibration for modern architectures.

Original Abstract

The deployment of deep neural networks in safety-critical systems necessitates reliable and efficient uncertainty quantification (UQ). A practical and widespread strategy for UQ is repurposing stochastic regularizers as scalable approximate Bayesian inference methods, such as Monte Carlo Dropout (MCD) and MC-DropBlock (MCDB). However, this paradigm remains under-explored for Stochastic Depth (SD), a regularizer integral to the residual-based backbones of most modern architectures. While prior work demonstrated its empirical promise for segmentation, a formal theoretical connection to Bayesian variational inference and a benchmark on complex, multi-task problems like object detection are missing. In this paper, we first provide theoretical insights connecting Monte Carlo Stochastic Depth (MCSD) to principled approximate variational inference. We then present the first comprehensive empirical benchmark of MCSD against MCD and MCDB on state-of-the-art detectors (YOLO, RT-DETR) using the COCO and COCO-O datasets. Our results position MCSD as a robust and computationally efficient method that achieves highly competitive predictive accuracy (mAP), notably yielding slight improvements in calibration (ECE) and uncertainty ranking (AUARC) compared to MCD. We thus establish MCSD as a theoretically-grounded and empirically-validated tool for efficient Bayesian approximation in modern deep learning.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers