Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation

April 13, 20262604.11775

Ricardo Coimbra Brioso, Giulio Sichili, Damiano Dei, Nicola Lambri, Pietro Mancosu + 2 more

cs.CVcs.AI

TLDR

This paper introduces an efficient KernelSHAP framework for 3D medical image segmentation, accelerating explanations for patch-based models.

Key contributions

Introduces an efficient KernelSHAP framework for 3D medical image segmentation.
Accelerates inference using patch logit caching, reducing computation by 15-30%.
Compares whole-organ, regular supervoxels, and organ-aware supervoxels for feature abstraction.
Organ-aware units provide more clinically interpretable explanations, especially for false positives.

Why it matters

Perturbation-based explainability like KernelSHAP is often too slow for 3D medical image segmentation. This work makes it efficient and practical, significantly reducing computation. It also explores feature abstractions to yield more clinically interpretable explanations, crucial for medical AI adoption.

Original Abstract

Perturbation-based explainability methods such as KernelSHAP provide model-agnostic attributions but are typically impractical for patch-based 3D medical image segmentation due to the large number of coalition evaluations and the high cost of sliding-window inference. We present an efficient KernelSHAP framework for volumetric CT segmentation that restricts computation to a user-defined region of interest and its receptive-field support, and accelerates inference via patch logit caching, reusing baseline predictions for unaffected patches while preserving nnU-Net's fusion scheme. To enable clinically meaningful attributions, we compare three automatically generated feature abstractions within the receptive-field crop: whole-organ units, regular FCC supervoxels, and hybrid organ-aware supervoxels, and we study multiple aggregation/value functions targeting stabilizing evidence (TP/Dice/Soft Dice) or false-positive behavior. Experiments on whole-body CT segmentations show that caching substantially reduces redundant computation (with computational savings ranging from 15% to 30%) and that faithfulness and interpretability exhibit clear trade-offs: regular supervoxels often maximize perturbation-based metrics but lack anatomical alignment, whereas organ-aware units yield more clinically interpretable explanations and are particularly effective for highlighting false-positive drivers under normalized metrics.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers