PyVRP$^+$: LLM-Driven Metacognitive Heuristic Evolution for Hybrid Genetic Search in Vehicle Routing Problems
Manuj Malik, Jianan Zhou, Shashank Reddy Chirra, Zhiguang Cao
TLDR
PyVRP$^+$ introduces Metacognitive Evolutionary Programming (MEP), an LLM-driven framework that strategically evolves VRP heuristics, improving solution quality and runtime.
Key contributions
- Introduces Metacognitive Evolutionary Programming (MEP) for LLM-driven heuristic evolution.
- MEP uses a Reason-Act-Reflect cycle, enabling LLMs to diagnose failures and form design hypotheses.
- Evolves novel heuristics for Hybrid Genetic Search (HGS) in VRP, outperforming baselines.
- Achieves up to 2.70% better solution quality and over 45% runtime reduction for VRPs.
Why it matters
This paper introduces a novel LLM-driven framework, MEP, that moves beyond reactive black-box optimization. By enabling LLMs to strategically reason and reflect on heuristic design, it significantly advances the automation of metaheuristic development. The resulting performance gains in VRPs demonstrate a powerful new paradigm for solving complex combinatorial optimization problems.
Original Abstract
Designing high-performing metaheuristics for NP-hard combinatorial optimization problems, such as the Vehicle Routing Problem (VRP), remains a significant challenge, often requiring extensive domain expertise and manual tuning. Recent advances have demonstrated the potential of large language models (LLMs) to automate this process through evolutionary search. However, existing methods are largely reactive, relying on immediate performance feedback to guide what are essentially black-box code mutations. Our work departs from this paradigm by introducing Metacognitive Evolutionary Programming (MEP), a framework that elevates the LLM to a strategic discovery agent. Instead of merely reacting to performance scores, MEP compels the LLM to engage in a structured Reason-Act-Reflect cycle, forcing it to explicitly diagnose failures, formulate design hypotheses, and implement solutions grounded in pre-supplied domain knowledge. By applying MEP to evolve core components of the state-of-the-art Hybrid Genetic Search (HGS) algorithm, we discover novel heuristics that significantly outperform the original baseline. By steering the LLM to reason strategically about the exploration-exploitation trade-off, our approach discovers more effective and efficient heuristics applicable across a wide spectrum of VRP variants. Our results show that MEP discovers heuristics that yield significant performance gains over the original HGS baseline, improving solution quality by up to 2.70\% and reducing runtime by over 45\% on challenging VRP variants.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.