Dynamic Treatment on Networks

May 7, 20262605.06564

Bengusu Nar, Jiguang Li, Veronika Ročková, Panos Toulis

stat.MLcs.LG

TLDR

Q-Ising is a new pipeline for dynamic treatment allocation on networks, combining Bayesian Ising models with offline reinforcement learning.

Key contributions

Proposes Q-Ising, a 3-stage pipeline for dynamic treatment on networks using Bayesian Ising models and offline RL.
Estimates network adoption dynamics and augments treatment histories with continuous posterior latent states.
Quantifies uncertainty over dynamic decisions, yielding ensemble policies with interpretable spillover estimates.
Demonstrates Q-Ising's superior performance over static benchmarks on real and synthetic networks.

Why it matters

Existing treatment strategies on networks are largely static, while dynamic frameworks ignore network structure. This paper bridges that gap, offering a robust, data-driven approach to optimize interventions in networked systems. This is crucial for fields like public health and social policy where spillovers are key.

Original Abstract

In networks, effective dynamic treatment allocation requires deciding both whom to treat and also when, so as to amplify policy impact through spillovers. An early intervention at a well-connected node can trigger cascades that change which nodes are worth targeting in the next period. Existing treatment strategies under network interference are largely static while dynamic treatment frameworks typically ignore network structure altogether. We integrate these perspectives and propose Q-Ising, a three-stage pipeline that (i) estimates network adoption dynamics via a Bayesian dynamic Ising model from a single observed panel, (ii) augments treatment adoption histories with continuous posterior latent states, and (iii) learns a dynamic policy via offline reinforcement learning. The Bayesian mechanism enables uncertainty quantification over dynamic decisions, yielding posterior ensemble policies with interpretable spillover estimates. We provide a finite-sample regret upper bound that decomposes into standard offline-RL uncertainty, network abstraction error, and first stage error in Ising state estimation. We apply our method to data from Indian village microfinance networks and synthetic stochastic block models under simulated heterogeneous susceptible-infected-susceptible (SIS) dynamics and demonstrate that adaptive targeting outperforms static centrality benchmarks.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers