pith. sign in

arxiv: 2607.00604 · v1 · pith:WG2VRDQVnew · submitted 2026-07-01 · 🧮 math.OC

Vehicle Routing Problem Meets Large Language Models: An Overview and Perspectives

Pith reviewed 2026-07-02 08:42 UTC · model grok-4.3

classification 🧮 math.OC
keywords vehicle routing problemlarge language modelsoptimizationheuristicsmulti-agent systemssurveylogisticsconstraints
0
0 comments X

The pith

Large language models serve as modelers, designers, or coordinators when applied to vehicle routing problems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This survey reviews applications of large language models to the vehicle routing problem, a central but NP-hard task in logistics that involves capacities, time windows, and dynamic requests. It organizes existing studies into three roles that LLMs fulfill. Modelers convert natural-language requirements into formal constraints and executable code. Designers produce heuristics, operators, or complete route plans. Coordinators handle tool calls, multi-agent setups, and links to other solvers. A reader would care because the categorization clarifies how flexible language interfaces can address the dual difficulties of modeling and solving complex routing tasks.

Core claim

The paper establishes that LLM-driven VRP research can be organized into three roles: modelers that translate natural-language requirements into constraints and modeling code, designers that generate heuristics, operators, or route plans, and coordinators that organize tool calls, multi-agent collaboration, and connections with neural solvers. The survey also covers standard benchmarks, operational datasets, LLM-oriented evaluation frameworks, and comparative experiments to support this organization.

What carries the argument

The three-role categorization (modeler, designer, coordinator) that classifies how LLMs contribute to VRP research and organizes the landscape of studies.

If this is right

  • Future work can systematically target gaps in the modeler role for handling intricate operational constraints.
  • Designers can focus on generating effective heuristics that compete with classical methods on NP-hard instances.
  • Coordinators enable tighter integration between LLMs and existing neural or exact solvers for practical use.
  • Standardized benchmarks and evaluation frameworks can compare performance across the three roles.
  • The structure provides a reference point for advancing intelligent decision-making in manufacturing and automation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same role division could be tested on other combinatorial problems such as scheduling or facility location.
  • Multi-role LLM systems might allow a single model to switch between translation, planning, and coordination within one routing session.
  • Real-world logistics datasets could be used to measure whether coordinator-style LLM setups reduce response time to dynamic requests.
  • Industrial users without optimization expertise might rely more on modeler LLMs to formulate problems directly from descriptions.

Load-bearing premise

The three-role categorization accurately and comprehensively captures the main contributions in LLM-driven VRP research without significant gaps or overlaps.

What would settle it

A review that finds multiple published LLM-VRP studies that cannot be placed into the modeler, designer, or coordinator roles would undermine the central organization.

Figures

Figures reproduced from arXiv: 2607.00604 by Chong Shen, Wanquan Liu, Xianchao Xiu, Yanjiao Zhu.

Figure 1
Figure 1. Figure 1: Timeline of LLM-driven VRP research. tion II introduces the basic concepts of VRP, its main variants, classical and learning-based solvers, and relevant background on LLMs. Sections III–V discuss LLMs from the roles of modeler, designer, and coordinator, covering automatic mod￾eling and code generation, automated heuristic design and end-to-end route reasoning, tool-chain orchestration, multi￾agent collabo… view at source ↗
Figure 2
Figure 2. Figure 2: Role classification of LLMs in VRP research. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Illustration of the modeler workflow. adapts general-purpose LLMs to optimization-code generation through instruction tuning on code-form problem descriptions and expert optimization programs. Multi-agent and self-improvement mechanisms further ex￾tend automatic modeling. Thind et al. [38] developed OptimAI with roles such as formulator, planner, coder, and code critic, allowing mathematical formalization,… view at source ↗
Figure 4
Figure 4. Figure 4: Illustration of the designer workflow. improve local components through reasoning, execution, and reflection. Xie et al. [55] similarly generated replaceable or complementary local heuristic components around an existing CVRP solver, reflecting a move from standalone heuristic generation to plug-in-style solver enhancement. Some work constrains the generation space more explicitly. Zhao et al. [56] propose… view at source ↗
Figure 5
Figure 5. Figure 5: Illustration of the coordinator workflow. [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Cost and time score comparison of LLM-generated heuristics across multiple VRP variants. [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗
read the original abstract

The vehicle routing problem (VRP) is a central optimization problem in artificial intelligence, logistics automation, transportation scheduling, and industrial decision-making. VRP and its variants are NP-hard, and practical routing tasks often combine time windows, vehicle capacities, pickup-and-delivery relations, dynamic requests, and other operational constraints, making both modeling and solving difficult. Large language models (LLMs) provide a flexible interface for routing optimization by processing natural-language requirements, generating code, reasoning over constraints, and interacting with external tools. This survey reviews LLM-driven research on VRP, covering the basic definition, main variants, major solver families, and LLM concepts needed for this topic. Existing studies are organized into three roles: modelers translate natural-language requirements into constraints and modeling code; designers generate heuristics, operators, or route plans; and coordinators organize tool calls, multi-agent collaboration, and connections with neural solvers. The survey also reviews standard benchmarks, real or near-real operational datasets, LLM-oriented evaluation frameworks, and two comparative experiments. The goal is to clarify current progress in LLM-assisted routing optimization and provide a structured reference for intelligent decision-making, advanced manufacturing, and industrial automation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The manuscript is a survey on the intersection of the Vehicle Routing Problem (VRP) and large language models (LLMs). It covers VRP definitions and variants, major solver families, core LLM concepts, organizes existing studies into three roles (modelers that translate natural-language requirements into constraints and code; designers that generate heuristics, operators, or route plans; coordinators that organize tool calls, multi-agent collaboration, and neural-solver connections), reviews standard benchmarks, operational datasets, and LLM-oriented evaluation frameworks, and reports two comparative experiments.

Significance. If the three-role taxonomy can be made robust, the survey would supply a structured reference for LLM-assisted routing optimization in logistics, advanced manufacturing, and industrial automation. The coverage of benchmarks, datasets, and experiments is a concrete strength that could help researchers identify evaluation standards.

major comments (1)
  1. [Section organizing existing studies into three roles] The section that introduces and applies the three-role categorization (modelers, designers, coordinators) provides no explicit assignment criteria, decision tree, or overlap analysis. Because many LLM-VRP tasks (e.g., an LLM that emits PyVRP-style modeling code that also contains custom operators and external solver calls) simultaneously perform modeling, heuristic generation, and tool coordination, the claimed partitioning is vulnerable to non-exclusive or subjective placement of the same papers, undermining the central organizational claim.
minor comments (1)
  1. [Section describing the comparative experiments] The description of the two comparative experiments would benefit from an explicit statement of the evaluation metrics, baseline LLMs, and VRP instances used, so readers can assess reproducibility.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on our survey. We address the major comment below and will revise the manuscript to strengthen the presentation of the taxonomy.

read point-by-point responses
  1. Referee: The section that introduces and applies the three-role categorization (modelers, designers, coordinators) provides no explicit assignment criteria, decision tree, or overlap analysis. Because many LLM-VRP tasks (e.g., an LLM that emits PyVRP-style modeling code that also contains custom operators and external solver calls) simultaneously perform modeling, heuristic generation, and tool coordination, the claimed partitioning is vulnerable to non-exclusive or subjective placement of the same papers, undermining the central organizational claim.

    Authors: We appreciate the referee's observation. The three-role taxonomy is intended as a high-level conceptual framework to organize the literature according to the dominant function of the LLM in each study, rather than as a set of mutually exclusive partitions. We agree, however, that the absence of explicit assignment criteria and overlap analysis leaves the framework open to the subjectivity concerns raised. In the revised manuscript we will add a dedicated subsection that (i) states the primary classification criteria (the main intended contribution of the LLM as described by the authors of each paper), (ii) supplies a concise decision flowchart for assignment, and (iii) discusses representative overlap cases, including hybrid modeling-plus-design examples, with explicit placement rationales. These additions will make the organizational claim more robust without altering the surveyed content. revision: yes

Circularity Check

0 steps flagged

Survey paper exhibits no circularity; taxonomy is organizational, not derived.

full rationale

This is a literature survey with no equations, derivations, predictions, or fitted quantities. The three-role categorization (modelers, designers, coordinators) is presented as a review structure for organizing existing studies rather than a result obtained from any self-referential definition, self-citation chain, or input-to-output reduction. No load-bearing steps reduce to the paper's own inputs by construction, satisfying the default expectation for non-circular survey work.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a literature survey paper. No new free parameters, axioms, or invented entities are introduced; the contribution rests on summarizing and categorizing prior work in VRP and LLMs.

pith-pipeline@v0.9.1-grok · 5739 in / 983 out tokens · 37277 ms · 2026-07-02T08:42:53.802246+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

99 extracted references · 32 canonical work pages · 12 internal anchors

  1. [1]

    Machine learning for combinato- rial optimization: A methodological tour d’horizon,

    Y . Bengio, A. Lodi, and A. Prouvost, “Machine learning for combinato- rial optimization: A methodological tour d’horizon,”European Journal of Operational Research, vol. 290, no. 2, pp. 405–421, 2021

  2. [2]

    Language models are few-shot learners,

    T. Brown, B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal, A. Neelakantanet al., “Language models are few-shot learners,” in Advances in Neural Information Processing Systems (NeurIPS), vol. 33, 2020, pp. 1877–1901

  3. [3]

    Evaluating Large Language Models Trained on Code

    M. Chen, J. Tworek, H. Jun, Q. Yuan, H. P. d. O. Pintoet al., “Evaluating large language models trained on code,”arXiv preprint arXiv:2107.03374, 2021

  4. [4]

    A systematic survey on large language models for algorithm design,

    F. Liu, Y . Yao, P. Guo, Z. Yang, Z. Zhao, X. Lin, X. Tong, M. Yuan, Z. Lu, Z. Wanget al., “A systematic survey on large language models for algorithm design,”arXiv preprint arXiv:2410.14716, 2024

  5. [5]

    Large Language Models for Operations Research: A Comprehensive Survey

    X. Xiu, J. Li, J. Zhang, Y . Zheng, N. Wang, Y . Xue, T. Liu, J. Tang, and Y . Li, “Large language models for operations research: A comprehensive survey,”arXiv preprint arXiv:2605.20849, 2026

  6. [6]

    Large language models in opera- tions research: Methods, applications, and challenges,

    K. Wang, Z. Wan, Z. Jiao, S. Y . Chang, J. Ning, L. Zhang, J. Duan, Z. Guo, Q. Zhou, S. Yanget al., “Large language models in opera- tions research: Methods, applications, and challenges,”arXiv preprint arXiv:2509.18180, 2025

  7. [7]

    A structured review of large language models in metaheuristic optimisation,

    S. Ghanbarzadeh, P. Sumari, and R. Vinuesa, “A structured review of large language models in metaheuristic optimisation,”Machine Learning with Applications, vol. 22, p. 100740, 2025

  8. [8]

    Route optimization reimagined: Multi-modal large language models for next- generation vehicle routing,

    S. Y . Albalkhi, D. F. Alotaibi, T. Dimitriou, and I. Ahmad, “Route optimization reimagined: Multi-modal large language models for next- generation vehicle routing,”IEEE Access, 2026

  9. [9]

    The truck dispatching problem,

    G. B. Dantzig and J. H. Ramser, “The truck dispatching problem,” Management Science, vol. 6, no. 1, pp. 80–91, 1959

  10. [10]

    Stronger multi-commodity flow formulations of the Capacitated Vehicle Routing Problem,

    A. N. Letchford and J. J. Salazar-Gonz ´alez, “Stronger multi-commodity flow formulations of the Capacitated Vehicle Routing Problem,”Euro- pean Journal of Operational Research, vol. 244, no. 3, pp. 730–738, 2015

  11. [11]

    Fifty years of vehicle routing,

    G. Laporte, “Fifty years of vehicle routing,”Transportation Science, vol. 43, no. 4, pp. 408–416, 2009

  12. [12]

    A general heuristic for vehicle routing problems,

    D. Pisinger and S. Ropke, “A general heuristic for vehicle routing problems,”Computers & Operations Research, vol. 34, no. 8, pp. 2403– 2435, 2007

  13. [13]

    An adaptive large neighborhood search heuristic for the pickup and delivery problem with time windows,

    S. Ropke and D. Pisinger, “An adaptive large neighborhood search heuristic for the pickup and delivery problem with time windows,” Transportation Science, vol. 40, no. 4, pp. 455–472, 2006

  14. [14]

    A hybrid genetic algorithm with adaptive diversity management for a large class of vehicle routing problems with time windows,

    T. Vidal, T. G. Crainic, M. Gendreau, and C. Prins, “A hybrid genetic algorithm with adaptive diversity management for a large class of vehicle routing problems with time windows,”Computers & Operations Research, vol. 40, no. 1, pp. 475–489, 2013

  15. [15]

    Pointer networks,

    O. Vinyals, M. Fortunato, and N. Jaitly, “Pointer networks,” inAdvances in Neural Information Processing Systems (NeurIPS), vol. 28, 2015

  16. [16]

    Attention, learn to solve routing problems!

    W. Kool, H. van Hoof, and M. Welling, “Attention, learn to solve routing problems!” inInternational Conference on Learning Representations (ICLR), 2019

  17. [17]

    POMO: Policy optimization with multiple optima for reinforcement learning,

    Y .-D. Kwon, J. Choo, B. Kim, I. Yoon, Y . Gwon, and S. Min, “POMO: Policy optimization with multiple optima for reinforcement learning,” in Advances in Neural Information Processing Systems (NeurIPS), vol. 33, 2020

  18. [18]

    Sym-NCO: Leveraging symmetricity for neural combinatorial optimization,

    M. Kim, J. Park, and J. Park, “Sym-NCO: Leveraging symmetricity for neural combinatorial optimization,” inAdvances in Neural Information Processing Systems (NeurIPS), vol. 35, 2022

  19. [19]

    Neural combinatorial optimization with heavy decoder: Toward large scale generalization,

    F. Luo, X. Lin, F. Liu, Q. Zhang, and Z. Wang, “Neural combinatorial optimization with heavy decoder: Toward large scale generalization,” in Advances in Neural Information Processing Systems (NeurIPS), vol. 36, 2023. 15

  20. [20]

    RouteFinder: Towards foundation models for vehicle routing problems,

    F. Berto, C. Hua, N. G. Zepeda, A. Hottung, N. Wouda, L. Lan, J. Park, K. Tierney, and J. Park, “RouteFinder: Towards foundation models for vehicle routing problems,”Transactions on Machine Learning Research (TMLR), 2025

  21. [21]

    Exact combinatorial optimization with graph convolutional neural networks,

    M. Gasse, D. Chetelat, N. Ferroni, L. Charlin, and A. Lodi, “Exact combinatorial optimization with graph convolutional neural networks,” inAdvances in Neural Information Processing Systems (NeurIPS), vol. 32, 2019

  22. [22]

    Neural large neighborhood search for the capacitated vehicle routing problem,

    A. Hottung and K. Tierney, “Neural large neighborhood search for the capacitated vehicle routing problem,” inEuropean Conference on Artificial Intelligence (ECAI), 2020, pp. 443–450

  23. [23]

    Progress in neural NLP: Modeling, learning, and reasoning,

    M. Zhou, N. Duan, S. Liu, and H.-Y . Shum, “Progress in neural NLP: Modeling, learning, and reasoning,”Engineering, vol. 6, no. 3, pp. 275– 290, 2020

  24. [24]

    Attention is all you need,

    A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” inAdvances in Neural Information Processing Systems (NeurIPS), vol. 30, 2017

  25. [25]

    Recent advances in natural language processing via large pre-trained language models: A survey,

    B. Min, H. Ross, E. Sulem, A. P. B. Veyseh, A. Bhattacharjee, A. Cattan, S. Yoon, Y . Kwon, Y . Hsuet al., “Recent advances in natural language processing via large pre-trained language models: A survey,”ACM Computing Surveys, vol. 56, pp. 1–40, 2023

  26. [26]

    How abilities in large language models are affected by supervised fine-tuning data composition,

    G. Dong, H. Yuan, K. Luet al., “How abilities in large language models are affected by supervised fine-tuning data composition,”arXiv preprint arXiv:2310.05492, 2023

  27. [27]

    Training language models to follow instructions with human feedback,

    L. Ouyang, J. Wu, X. Jiang, D. Almeida, C. Wainwright, P. Mishkin, C. Zhang, S. Agarwal, K. Slama, A. Rayet al., “Training language models to follow instructions with human feedback,” inAdvances in Neural Information Processing Systems (NeurIPS), vol. 35, 2022, pp. 27 730–27 744

  28. [28]

    GPT-4 technical report,

    OpenAI, “GPT-4 technical report,” OpenAI, Tech. Rep., 2023

  29. [29]

    Qwen2.5 Technical Report

    A. Yang, B. Yang, B. Zhang, B. Hui, B. Zheng, B. Yu, C. Li, D. Liu, F. Huang, H. Weiet al., “Qwen2.5 technical report,”arXiv preprint arXiv:2412.15115, 2024

  30. [30]

    DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    DeepSeek-AI, “DeepSeek-R1: Incentivizing reasoning capability in LLMs via reinforcement learning,”arXiv preprint arXiv:2501.12948, 2025

  31. [31]

    LoRA: Low-Rank Adaptation of Large Language Models

    E. J. Hu, Y . Shen, P. Wallis, Z. Allen-Zhuet al., “LoRA: Low-rank adaptation of large language models,”arXiv preprint arXiv:2106.09685, 2022

  32. [32]

    OptiMUS: Optimization modeling using MIP solvers and large language models,

    A. AhmadiTeshnizi, W. Gao, and M. Udell, “OptiMUS: Optimization modeling using MIP solvers and large language models,”arXiv preprint arXiv:2310.06116, 2023

  33. [33]

    OptiMUS: Scalable opti- mization modeling with (MI)LP solvers and large language models,

    A. Ahmaditeshnizi, W. Gao, and M. Udell, “OptiMUS: Scalable opti- mization modeling with (MI)LP solvers and large language models,” in International Conference on Machine Learning (ICML), 2024

  34. [34]

    Solving general natural-language-description optimization problems with large language models,

    J. Zhang, W. Wang, S. Guo, L. Wang, F. Lin, C. Yang, and W. Yin, “Solving general natural-language-description optimization problems with large language models,” inthe 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 6: Industry Track), 2024, pp. 483–490

  35. [35]

    ORMind: A cognitive-inspired end-to-end reasoning framework for operations research,

    Z. Wang, B. Chen, Y . Huang, Q. Cao, M. He, J. Fan, and X. Liang, “ORMind: A cognitive-inspired end-to-end reasoning framework for operations research,” inthe 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), 2025, pp. 104–131

  36. [36]

    ORLM: A customizable framework in training large models for automated optimization modeling,

    C. Huang, Z. Tang, S. Hu, R. Jiang, X. Zheng, D. Ge, B. Wang, and Z. Wang, “ORLM: A customizable framework in training large models for automated optimization modeling,”Operations Research, vol. 73, no. 6, pp. 2986–3009, 2025

  37. [37]

    LLaMoCo: Instruction tuning of large language models for optimization code generation,

    Z. Ma, H. Guo, J. Chen, G. Peng, Z. Cao, Y . Ma, and Y .-J. Gong, “LLaMoCo: Instruction tuning of large language models for optimization code generation,”arXiv preprint arXiv:2403.01131, 2024

  38. [38]

    Op- timAI: Optimizing optimization problems with AI,

    R. Thind, V . Vashishtha, J. Umenberger, and K. H. Johansson, “Op- timAI: Optimizing optimization problems with AI,”arXiv preprint arXiv:2504.16918, 2025

  39. [39]

    AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library

    X. Kong, W. Ning, H. Wu, and Y . Zheng, “AlphaOPT: Formulating optimization programs with self-improving LLM experience library,” arXiv preprint arXiv:2510.18428, 2025

  40. [40]

    OptiTree: Hierarchical thoughts generation with tree search for LLM optimization modeling,

    H. Liu, J. Wang, Y . Cai, X. Han, Y . Kuang, and J. Hao, “OptiTree: Hierarchical thoughts generation with tree search for LLM optimization modeling,” inAdvances in Neural Information Processing Systems (NeurIPS), 2025

  41. [41]

    OR-R1: Automating modeling and solving of operations research optimization problem via test-time re- inforcement learning,

    Z. Ding, Z. Tan, J. Zhang, and T. Chen, “OR-R1: Automating modeling and solving of operations research optimization problem via test-time re- inforcement learning,” inthe AAAI Conference on Artificial Intelligence, 2026

  42. [42]

    To the Globe (TTG): Towards language-driven guaranteed travel planning,

    D. Ju, S. Jiang, A. Cohen, A. Foss, S. Mitts, A. Zharmagambetov, B. Amos, X. Li, J. T. Kao, M. Fazel-Zarandi, and Y . Tian, “To the Globe (TTG): Towards language-driven guaranteed travel planning,” inthe 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2024

  43. [43]

    DRoC: Elevating large language models for complex vehicle routing via decomposed retrieval of constraints,

    X. Jiang, Y . Wu, C. Zhang, and Y . Zhang, “DRoC: Elevating large language models for complex vehicle routing via decomposed retrieval of constraints,” inInternational Conference on Learning Representations (ICLR), 2025

  44. [44]

    ORThought: Benchmarking and automating logis- tics optimization modeling with structured LLM reasoning,

    Y . Yang, Y . Li, Z. Xiong, H. Tian, Q. Su, X. Wang, D. Wu, F. He, Y . Yin, X. Yeet al., “ORThought: Benchmarking and automating logis- tics optimization modeling with structured LLM reasoning,”Artificial Intelligence for Transportation, vol. 6, p. 100059, 2026

  45. [45]

    Algorithm evolution using large language model,

    F. Liu, X. Lin, Q. Zhang, K. C. Tan, and S. Kwong, “Algorithm evolution using large language model,”arXiv preprint arXiv:2311.15249, 2023

  46. [46]

    Mathematical discoveries from program search with large language models,

    B. Romera-Paredes, M. Barekatain, A. Novikov, M. Balog, M. P. Kumar, E. Dupontet al., “Mathematical discoveries from program search with large language models,”Nature, vol. 625, pp. 468–475, 2024

  47. [47]

    Evolution of heuristics: Towards efficient automatic algorithm design using large language model,

    F. Liu, X. Tong, M. Yuan, X. Lin, F. Luo, Z. Wang, Z. Lu, and Q. Zhang, “Evolution of heuristics: Towards efficient automatic algorithm design using large language model,” inInternational Conference on Machine Learning (ICML), 2024

  48. [48]

    ReEvo: Large language models as hyper-heuristics with reflective evolution,

    H. Ye, J. Wang, Z. Cao, F. Berto, C. Hua, H. Kim, J. Park, and G. Song, “ReEvo: Large language models as hyper-heuristics with reflective evolution,” inAdvances in Neural Information Processing Systems (NeurIPS), 2024

  49. [49]

    Monte carlo tree search for comprehensive exploration in LLM-based automatic heuristic design,

    Z. Zheng, Z. Xie, Z. Wang, and B. Hooi, “Monte carlo tree search for comprehensive exploration in LLM-based automatic heuristic design,” inInternational Conference on Machine Learning (ICML), 2025, pp. 78 338–78 373

  50. [50]

    HSEvo: Elevating automatic heuristic design with diversity-driven harmony search and genetic algo- rithm using LLMs,

    P. V . T. Dat, L. Doan, and H. T. T. Binh, “HSEvo: Elevating automatic heuristic design with diversity-driven harmony search and genetic algo- rithm using LLMs,” inAAAI Conference on Artificial Intelligence, 2025, pp. 26 931–26 938

  51. [51]

    Multi-objective evolution of heuristic using large language model,

    S. Yao, F. Liu, X. Linet al., “Multi-objective evolution of heuristic using large language model,” inAAAI Conference on Artificial Intelligence, 2025, pp. 27 144–27 152

  52. [52]

    Automatic algorithm design assisted by LLMs for solving vehicle routing problems,

    L. Ma, X. Hao, R. Yanget al., “Automatic algorithm design assisted by LLMs for solving vehicle routing problems,” inInternational Conference on Signal Processing (ICSP), 2024, pp. 247–252

  53. [53]

    VRPAgent: LLM-driven discovery of heuristic operators for vehicle routing problems,

    A. Hottung, F. Berto, C. Hua, H. Ye, H. Kim, J. Park, and K. Tierney, “VRPAgent: LLM-driven discovery of heuristic operators for vehicle routing problems,”arXiv preprint arXiv:2510.07073, 2025

  54. [54]

    PyVRP$^+$: LLM-Driven Metacognitive Heuristic Evolution for Hybrid Genetic Search in Vehicle Routing Problems

    M. Malik, J. Zhou, S. R. Chirra, and Z. Cao, “PyVRP+: LLM-driven metacognitive heuristic evolution for hybrid genetic search in vehicle routing problems,”arXiv preprint arXiv:2604.07872, 2026

  55. [55]

    Enhancing CVRP Solver through LLM-driven Automatic Heuristic Design

    Z. Xie, F. Liu, Z. Wang, and Q. Zhang, “Enhancing CVRP solver through LLM-driven automatic heuristic design,”arXiv preprint arXiv:2602.23092, 2026

  56. [56]

    G-LNS: Generative large neighbor- hood search for LLM-based automatic heuristic design,

    B. Zhao, H. Wang, and L. Zeng, “G-LNS: Generative large neighbor- hood search for LLM-based automatic heuristic design,”arXiv preprint arXiv:2602.08253, 2026

  57. [57]

    LLM-based automatic heuristic design for vehicle- drone collaborative routing problems,

    H. Shi and L. Zhen, “LLM-based automatic heuristic design for vehicle- drone collaborative routing problems,”Transportation Research Part E: Logistics and Transportation Review, vol. 209, p. 104760, 2026

  58. [58]

    LLM4AD: A platform for algorithm design with large language model,

    F. Liu, R. Zhang, Z. Xie, X. Tong, M. Yuan, Q. Zhang, and X. Lin, “LLM4AD: A platform for algorithm design with large language model,” arXiv preprint arXiv:2412.17287, 2024

  59. [59]

    HeurAgenix: Leveraging LLMs for solving complex combinatorial optimization challenges,

    X. Yang, L. Zhang, H. Qianet al., “HeurAgenix: Leveraging LLMs for solving complex combinatorial optimization challenges,”arXiv preprint arXiv:2506.15196, 2025

  60. [60]

    Gen- eralizable heuristic generation through LLMs with meta-optimization,

    Y . Shi, J. Zhou, W. Song, J. Bi, Y . Wu, Z. Cao, and J. Zhang, “Gen- eralizable heuristic generation through LLMs with meta-optimization,” inthe International Conference on Learning Representations (ICLR), 2026

  61. [61]

    CALM: Co- evolution of algorithms and language model for automatic heuristic design,

    Z. Huang, W. Wu, K. Wu, J. Wang, and W.-B. Lee, “CALM: Co- evolution of algorithms and language model for automatic heuristic design,”arXiv preprint arXiv:2505.12285, 2025

  62. [62]

    Refining hybrid genetic search for CVRP via reinforcement learning-finetuned LLM,

    R. Zhu, C. Zhang, and Z. Cao, “Refining hybrid genetic search for CVRP via reinforcement learning-finetuned LLM,” inthe International Conference on Learning Representations (ICLR), 2026

  63. [63]

    A large language model-enhanced Q- Learning for capacitated vehicle routing problem with time windows,

    L. Cao, M. Wang, and X. Xiong, “A large language model-enhanced Q- Learning for capacitated vehicle routing problem with time windows,” in 2025 IEEE 28th International Conference on Intelligent Transportation Systems (ITSC), 2025. 16

  64. [64]

    From words to routes: Applying large language models to vehicle routing,

    Z. Huang, G. Shi, and G. S. Sukhatme, “From words to routes: Applying large language models to vehicle routing,”arXiv preprint arXiv:2403.10795, 2024

  65. [65]

    ARS: Automatic routing solver with large language models,

    K. Li, F. Liu, Z. Wanget al., “ARS: Automatic routing solver with large language models,”arXiv preprint arXiv:2502.15359, 2025

  66. [66]

    Using large language models to solve the electric vehicle routing problem with advanced prompting techniques,

    U. Zafar and S. Bayhan, “Using large language models to solve the electric vehicle routing problem with advanced prompting techniques,” in2025 IEEE 19th International Conference on Compatibility, Power Electronics and Power Engineering (CPE-POWERENG), 2025

  67. [67]

    ReAct: Synergizing reasoning and acting in language models,

    S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. Narasimhan, and Y . Cao, “ReAct: Synergizing reasoning and acting in language models,” in International Conference on Learning Representations (ICLR), 2023

  68. [68]

    Toolformer: Language models can teach themselves to use tools,

    T. Schick, J. Dwivedi-Yu, R. Dess `ı, R. Raileanu, M. Lomeliet al., “Toolformer: Language models can teach themselves to use tools,” in Advances in Neural Information Processing Systems (NeurIPS), 2023

  69. [69]

    AutoGPT,

    T. B. Richards, “AutoGPT,” 2023, gitHub repository

  70. [70]

    LangChain: Building applications with LLMs through com- posability,

    H. Chase, “LangChain: Building applications with LLMs through com- posability,” 2022, gitHub repository

  71. [71]

    DRAGON: LLM-driven decomposition and reconstruction agents for large-scale combinatorial optimization,

    X. Chen, X. Zhang, X. Wu, C. Yang, Z. Liao, L. Li, S. Xia, F. Shi, Y . Yang, and W. Ma, “DRAGON: LLM-driven decomposition and reconstruction agents for large-scale combinatorial optimization,” inthe International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2026

  72. [72]

    Multi-agent large language models as evolutionary optimizers for scheduling optimization,

    Y . Wang, J. Wang, and Z. Chu, “Multi-agent large language models as evolutionary optimizers for scheduling optimization,”Computers & Industrial Engineering, vol. 206, p. 111197, 2025

  73. [73]

    HMACE: Heterogeneous Multi-Agent Collaborative Evolution for Combinatorial Optimization

    D. Yan, M. Xue, Z. Wang, Q. Hao, and W. Du, “HMACE: Heteroge- neous multi-agent collaborative evolution for combinatorial optimiza- tion,”arXiv preprint arXiv:2605.07214, 2026

  74. [74]

    CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

    A. Qu, H. Zheng, Z. Zhou, Y . Yan, Y . Tang, S. Y . Ong, F. Hong, K. Zhou, C. Jiang, M. Kong, J. Zhu, X. Jiang, S. Li, C. Wu, B. K. H. Low, J. Zhao, and P. P. Liang, “CORAL: Towards autonomous multi-agent evolution for open-ended discovery,”arXiv preprint arXiv:2604.01658, 2026

  75. [75]

    An agentic framework with LLMs for solving complex vehicle routing problems,

    N. Zhang, Z. Cao, J. Zhou, C. Zhang, and Y .-S. Ong, “An agentic framework with LLMs for solving complex vehicle routing problems,” inInternational Conference on Learning Representations (ICLR), 2026

  76. [76]

    Large language models powered neural solvers for generalized vehicle routing problems,

    C. D. Tran, Q. Nguyen-Tri, H. T. T. Binh, and H. Thanh-Tung, “Large language models powered neural solvers for generalized vehicle routing problems,” inICLR 2025 Workshop on Towards Agentic AI for Science, 2025

  77. [77]

    LLMAide: Language-assisted neural solver for vehicle routing problems,

    M. Malik, J. Zhou, Y . Jin, and Z. Cao, “LLMAide: Language-assisted neural solver for vehicle routing problems,” inthe International Confer- ence on Autonomous Agents and Multiagent Systems (AAMAS) Extended Abstracts, 2026

  78. [78]

    URS: A Unified Neural Routing Solver for Cross-Problem Zero-Shot Generalization

    C. Zhou, C. Yu, S. Yao, X. Lin, Z. Wang, Y . Zhou, and Q. Zhang, “URS: A unified neural routing solver for cross-problem zero-shot generalization,”arXiv preprint arXiv:2509.23413, 2025

  79. [79]

    A generalized neural solver based on LLM- guided heuristic evoluation framework for solving diverse variants of vehicle routing problems,

    M. Chi, W. Pang, X. Wu, P. Zhao, Y . Li, T. Wang, J. Qian, Y . Xiao, L. Wang, and Y . Zhou, “A generalized neural solver based on LLM- guided heuristic evoluation framework for solving diverse variants of vehicle routing problems,”Expert Systems with Applications, vol. 296, no. Part A, p. 128876, 2026

  80. [80]

    A universal framework for vehicle routing problems with large language models,

    R.-B. Zeng, M.-X. Yang, M.-L. Lei, L.-F. Niu, and Y .-H. Dai, “A universal framework for vehicle routing problems with large language models,”Journal of the Operations Research Society of China, 2026

Showing first 80 references.