ArXiv TLDR

A novel hybrid approach for positive-valued DAG learning

🐦 Tweet
2604.08935

Yao Zhao

stat.MLcs.LG

TLDR

H-MRS is a novel algorithm for learning causal DAGs from positive-valued data by combining moment-based scoring with log-scale regression.

Key contributions

  • Introduces H-MRS for causal DAG learning from inherently positive-valued data.
  • Utilizes a novel moment-ratio criterion for effective causal ordering.
  • Integrates log-scale Ridge regression with raw-scale moment ratios and Elastic Net.
  • Demonstrates competitive performance and computational efficiency on synthetic data.

Why it matters

Causal discovery from positive-valued data, common in genomics and economics, is challenging due to multiplicative dynamics. H-MRS provides an efficient and accurate solution that naturally respects positivity constraints, offering a practical framework for these domains.

Original Abstract

Causal discovery from observational data remains a fundamental challenge in machine learning and statistics, particularly when variables represent inherently positive quantities such as gene expression levels, asset prices, company revenues, or population counts, which often follow multiplicative rather than additive dynamics. We propose the Hybrid Moment-Ratio Scoring (H-MRS) algorithm, a novel method for learning directed acyclic graphs (DAGs) from positive-valued data by combining moment-based scoring with log-scale regression. The key idea is that for positive-valued variables, the moment ratio $\frac{\mathbb{E}[X_j^2]}{\mathbb{E}[(\mathbb{E}[X_j \mid S])^2]}$ provides an effective criterion for causal ordering, where $S$ denotes candidate parent sets. H-MRS integrates log-scale Ridge regression for moment-ratio estimation with a greedy ordering procedure based on raw-scale moment ratios, followed by Elastic Net-based parent selection to recover the final DAG structure. Experiments on synthetic log-linear data demonstrate competitive precision and recall. The proposed method is computationally efficient and naturally respects positivity constraints, making it suitable for applications in genomics and economics. These results suggest that combining log-scale modeling with raw-scale moment ratios provides a practical framework for causal discovery in positive-valued domains.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.