ArXiv TLDR

Shapley Value-Guided Adaptive Ensemble Learning for Explainable Financial Fraud Detection with U.S. Regulatory Compliance Validation

🐦 Tweet
2604.14231

Mohammad Nasir Uddin, Md Munna Aziz

cs.LGcs.AIcs.NE

TLDR

This paper introduces a SHAP-guided adaptive ensemble for explainable financial fraud detection, validated against U.S. regulatory compliance.

Key contributions

  • Evaluated explanation quality (faithfulness, stability); XGBoost+TreeExplainer achieved near-perfect stability.
  • Introduced SHAP-Guided Adaptive Ensemble (SGAE) for dynamic weighting, achieving highest AUC-ROC (0.9245 CV).
  • Performed full evaluation of LSTM, Transformer, and GNN-GraphSAGE on a 590k-transaction dataset.

Why it matters

AI fraud detection models often lack transparency, hindering regulatory compliance. This paper addresses this by providing explainable models and a novel ensemble method, improving both performance and auditability for financial institutions.

Original Abstract

Financial crime costs U.S. institutions over $32 billion each year. Although AI tools for fraud detection have become more advanced, their use in real-world systems still faces a major obstacle: many of these models operate as black boxes that cannot provide the transparent, auditable explanations required by regulations such as OCC Bulletin 2011-12 and Federal Reserve SR 11-7. This study makes three main contributions. First, it offers a thorough evaluation of explanation quality across faithfulness (sufficiency and comprehensiveness at k=5, 10, and 15) and stability (Kendall's W across 30 bootstrap samples). XGBoost paired with TreeExplainer achieves near-perfect stability (W=0.9912), while LSTM with DeepExplainer shows weak results (W=0.4962). Second, the paper introduces the SHAP-Guided Adaptive Ensemble (SGAE), which dynamically adjusts per-transaction ensemble weights based on SHAP attribution agreement, achieving the highest AUC-ROC among all tested models (0.8837 held-out; 0.9245 cross-validation). Third, a complete three-architecture evaluation of LSTM, Transformer, and GNN-GraphSAGE on the full 590,540-transaction IEEE-CIS dataset is provided, with GNN-GraphSAGE achieving AUC-ROC 0.9248 and F1=0.6013. All results are mapped directly to OCC, SR 11-7, and BSA-AML regulatory compliance requirements.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.