Meituan Merchant Business Diagnosis via Policy-Guided Dual-Process User Simulation

April 16, 20262604.15190

Ziyang Chen, Renbing Chen, Daowei Li, Jinzhi Liao, Jiashen Sun + 2 more

cs.AIcs.CL

TLDR

This paper introduces PGHS, a dual-process user simulation framework that combines LLM reasoning and ML fitting to accurately evaluate merchant strategies.

Key contributions

Introduces Policy-Guided Hybrid Simulation (PGHS) for robust user behavior simulation.
Uses a dual-process framework with LLM-based reasoning and ML-based fitting branches.
Mines transferable decision policies as a shared alignment layer to prevent over-rationalization.
Achieves 8.80% simulation error on Meituan, outperforming baselines by over 40%.

Why it matters

This paper offers a scalable and trustworthy method for evaluating merchant strategies without expensive online experiments. PGHS addresses critical challenges in user simulation, providing a more accurate way to predict group-level behavior. This allows businesses to optimize their operations efficiently.

Original Abstract

Simulating group-level user behavior enables scalable counterfactual evaluation of merchant strategies without costly online experiments. However, building a trustworthy simulator faces two structural challenges. First, information incompleteness causes reasoning-based simulators to over-rationalize when unobserved factors such as offline context and implicit habits are missing. Second, mechanism duality requires capturing both interpretable preferences and implicit statistical regularities, which no single paradigm achieves alone. We propose Policy-Guided Hybrid Simulation (PGHS), a dual-process framework that mines transferable decision policies from behavioral trajectories and uses them as a shared alignment layer. This layer anchors an LLM-based reasoning branch that prevents over-rationalization and an ML-based fitting branch that absorbs implicit regularities. Group-level predictions from both branches are fused for complementary correction. We deploy PGHS on Meituan with 101 merchants and over 26,000 trajectories. PGHS achieves a group simulation error of 8.80%, improving over the best reasoning-based and fitting-based baselines by 45.8% and 40.9% respectively.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers