Expert-Guided Class-Conditional Goodness-of-Fit Scores for Interpretable Classification with Informative Missingness: An Application to Seismic Monitoring
Shahar Cohen, David M. Steinberg, Yael Radzyner, Yochai Ben Horin
TLDR
This paper introduces an expert-guided framework for interpretable classification with informative missingness, outperforming standard ML in small data settings.
Key contributions
- Introduces an expert-guided framework for interpretable classification with informative missingness.
- Uses expert knowledge to build class-conditional models and derive goodness-of-fit features.
- These features quantify data agreement with expert models, aiding transparent decision-making.
- Demonstrates superior performance over standard ML, especially with small training datasets.
Why it matters
This framework offers a novel approach to integrate expert knowledge and handle missing data in classification. Its interpretability and strong performance, even with small datasets, make it valuable for critical applications like seismic monitoring. It reduces expert workload and enhances trust in automated screening tools.
Original Abstract
We study a classification problem with three key challenges: pervasive informative missingness, the integration of partial prior expert knowledge into the learning process, and the need for interpretable decision rules. We propose a framework that encodes prior knowledge through an expert-guided class-conditional model for one or more classes, and use this model to construct a small set of interpretable goodness-of-fit features. The features quantify how well the observed data agree with the expert model, isolating the contributions of different aspects of the data, including both observed and missing components. These features are combined with a few transparent auxiliary summaries in a simple discriminative classifier, resulting in a decision rule that is easy to inspect and justify. We develop and apply the framework in the context of seismic monitoring used to assess compliance with the Comprehensive Nuclear-Test-Ban Treaty. We show that the method has strong potential as a transparent screening tool, reducing workload for expert analysts. A simulation designed to isolate the contribution of the proposed framework shows that this interpretable expert-guided method can even outperform strong standard machine-learning classifiers, particularly when training samples are small.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.