Expert-Guided Class-Conditional Goodness-of-Fit Scores for Interpretable Classification with Informative Missingness: An Application to Seismic Monitoring

April 16, 20262604.14809

Shahar Cohen, David M. Steinberg, Yael Radzyner, Yochai Ben Horin

stat.MLcs.LGstat.AP

TLDR

This paper introduces an expert-guided framework for interpretable classification with informative missingness, outperforming standard ML in small data settings.

Key contributions

Introduces an expert-guided framework for interpretable classification with informative missingness.
Uses expert knowledge to build class-conditional models and derive goodness-of-fit features.
These features quantify data agreement with expert models, aiding transparent decision-making.
Demonstrates superior performance over standard ML, especially with small training datasets.

Why it matters

This framework offers a novel approach to integrate expert knowledge and handle missing data in classification. Its interpretability and strong performance, even with small datasets, make it valuable for critical applications like seismic monitoring. It reduces expert workload and enhances trust in automated screening tools.

Original Abstract

We study a classification problem with three key challenges: pervasive informative missingness, the integration of partial prior expert knowledge into the learning process, and the need for interpretable decision rules. We propose a framework that encodes prior knowledge through an expert-guided class-conditional model for one or more classes, and use this model to construct a small set of interpretable goodness-of-fit features. The features quantify how well the observed data agree with the expert model, isolating the contributions of different aspects of the data, including both observed and missing components. These features are combined with a few transparent auxiliary summaries in a simple discriminative classifier, resulting in a decision rule that is easy to inspect and justify. We develop and apply the framework in the context of seismic monitoring used to assess compliance with the Comprehensive Nuclear-Test-Ban Treaty. We show that the method has strong potential as a transparent screening tool, reducing workload for expert analysts. A simulation designed to isolate the contribution of the proposed framework shows that this interpretable expert-guided method can even outperform strong standard machine-learning classifiers, particularly when training samples are small.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers