Scaling the Queue: Reinforcement Learning for Equitable Call Classification Capacity in NYC Municipal Complaint Systems

May 7, 20262605.06482

Irene Aldridge, Ellie Bae, Siddhesh Darak, Nicholas Donat, Akhil Fernando-Bell + 23 more

econ.EMcs.CY

TLDR

This paper introduces an equity-centered reinforcement learning framework to optimize complaint classification and routing in NYC's 311 system, addressing service disparities.

Key contributions

Develops an equity-centered RL framework for municipal complaint classification and routing.
RL agents intelligently route complaints to actions like escalate, batch, defer, or inspect now.
Formalizes each operational domain as an MDP, with equitable classification coverage as a primary reward.
SHAP analysis shows complaint recurrence and neighborhood stats predict violations better than raw volume.

Why it matters

This paper addresses a critical issue of inequitable service delivery in municipal complaint systems. By using an RL framework, it not only improves efficiency but also actively works to close historical equity gaps. The findings on predictive features offer valuable insights for more effective and fair resource allocation.

Original Abstract

Municipal 311 call centers and complaint intake systems face a structural mismatch between incoming volume and classification capacity. The staff and heuristics available to triage, route, and prioritize complaints cannot scale with demand. This bottleneck produces differential service quality that follows income and racial lines (\cite{liu2024sla}). We develop an equity-centered reinforcement learning (RL) framework that augments call classification capacity across six New York City Department of Buildings (DOB) operational domains: boiler safety, crane and derrick oversight, heat and hot water complaints, housing complaint triage, scaffold safety, and Natural Area District (SNAD) protection. Rather than replacing human classifiers, our agents act as intelligent intake routers: learning to assign incoming complaints to action categories: escalate, batch, defer, inspect now. The proposed technique is designed to maximize throughput, minimize misclassification cost, and actively narrow historical equity gaps in service delivery. We formalize each domain as a Markov Decision Process (MDP) in which equitable classification coverage is a first-class reward objective. Post-hoc SHAP attribution reveals that complaint recurrence and neighborhood-level statistics are stronger predictors of actionable violations than raw complaint volume. This finding has direct implications for complaint routing given the demographic correlates of those features.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers