Dynamic Treatment on Networks
Bengusu Nar, Jiguang Li, Veronika Ročková, Panos Toulis
TLDR
Q-Ising is a new pipeline for dynamic treatment allocation on networks, combining Bayesian Ising models with offline reinforcement learning.
Key contributions
- Proposes Q-Ising, a 3-stage pipeline for dynamic treatment on networks using Bayesian Ising models and offline RL.
- Estimates network adoption dynamics and augments treatment histories with continuous posterior latent states.
- Quantifies uncertainty over dynamic decisions, yielding ensemble policies with interpretable spillover estimates.
- Demonstrates Q-Ising's superior performance over static benchmarks on real and synthetic networks.
Why it matters
Existing treatment strategies on networks are largely static, while dynamic frameworks ignore network structure. This paper bridges that gap, offering a robust, data-driven approach to optimize interventions in networked systems. This is crucial for fields like public health and social policy where spillovers are key.
Original Abstract
In networks, effective dynamic treatment allocation requires deciding both whom to treat and also when, so as to amplify policy impact through spillovers. An early intervention at a well-connected node can trigger cascades that change which nodes are worth targeting in the next period. Existing treatment strategies under network interference are largely static while dynamic treatment frameworks typically ignore network structure altogether. We integrate these perspectives and propose Q-Ising, a three-stage pipeline that (i) estimates network adoption dynamics via a Bayesian dynamic Ising model from a single observed panel, (ii) augments treatment adoption histories with continuous posterior latent states, and (iii) learns a dynamic policy via offline reinforcement learning. The Bayesian mechanism enables uncertainty quantification over dynamic decisions, yielding posterior ensemble policies with interpretable spillover estimates. We provide a finite-sample regret upper bound that decomposes into standard offline-RL uncertainty, network abstraction error, and first stage error in Ising state estimation. We apply our method to data from Indian village microfinance networks and synthetic stochastic block models under simulated heterogeneous susceptible-infected-susceptible (SIS) dynamics and demonstrate that adaptive targeting outperforms static centrality benchmarks.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.