Deep-testing: the case of dependence detection
Gery Geenens, Pierre Lafaye de Micheaux, Ivan Muyun Zou
TLDR
Deep-testing uses neural networks to perform hypothesis testing, demonstrating superior power in detecting complex dependencies compared to traditional methods.
Key contributions
- Introduces 'deep-testing,' a novel procedure applying deep learning to classical hypothesis testing problems.
- Leverages deep neural networks' strong discriminating power to construct highly powerful test statistics.
- Applies the method to independence testing, a crucial problem in statistics.
- Outperforms nineteen competing methods in detecting complex dependence structures in large-scale simulations.
Why it matters
This paper introduces a novel, powerful approach to classical statistical inference, potentially revolutionizing hypothesis testing. By outperforming existing methods in dependence detection, it opens new avenues for applying deep learning beyond typical classification tasks.
Original Abstract
Deep learning methods have proved highly effective for classification and image recognition problems. In this paper, we ask whether this success can be transferred to hypothesis testing: if a neural network can distinguish, for example, an image of a handwritten digit from another, can it also distinguish an "image of a sample" (such as a scatter plot) generated under a given statistical model from one generated outside that model? Motivated by this idea, we propose a novel procedure called deep-testing, which approaches the classical inferential problem of hypothesis testing through deep learning. More specifically, the test statistic is a classification map learned by a deep neural network from simulated data satisfying the null and alternative hypotheses, leveraging its strong discriminating power to construct a highly powerful test. As a proof of concept, we apply deep-testing to the problem of independence testing, arguably one of the most important problems in statistics. In a large-scale simulation study, deep-testing achieves the highest overall power against nineteen competing methods across a broad range of complex dependence structures, confirming the viability of the proposed approach.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.