Pith Number

pith:QNE6M7I3

pith:2026:QNE6M7I34RDIGWSDAJ4RA26EUA

not attested not anchored not stored refs resolved

A Unified Knowledge Embedded Reinforcement Learning-based Framework for Generalized Capacitated Vehicle Routing Problems

Hao Hu, Liang Wang, Wen Wang, Xiangchen Wu, Xianping Tao

A framework embedding classical routing knowledge into RL achieves better solutions for diverse CVRP variants.

arxiv:2605.14416 v1 · 2026-05-14 · cs.AI

Open paper page JSON Open Graph Bundle Merged state Verified badge What is a Pith Number?

Add to your LaTeX paper

\usepackage{pith}
\pithnumber{QNE6M7I34RDIGWSDAJ4RA26EUA}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp

2 Internet Archive

3 Author claim open · sign in to claim

4 Citations open

5 Replications open

✓ Portable graph bundle live · download bundle · merged state

The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Extensive experiments show that this framework achieves superior solution quality compared with state-of-the-art learning-based methods, with a smaller gap to classical heuristics, demonstrating strong generalization across diverse CVRP variants.

C2weakest assumption

That the Route-First Cluster-Second decomposition plus dynamic programming guidance will reliably mitigate partial observability and produce generalizable improvements without introducing new biases or overfitting to the tested CVRP variants.

C3one line summary

A knowledge-embedded RL framework decomposes generalized CVRPs into route-first and cluster-second subproblems, using dynamic programming to guide the RL solver and a history-enhanced context module to handle partial observability, yielding better solutions than prior learning methods.

References

42 extracted · 42 resolved · 0 Pith anchors

[1] Routefinder: Towards foundation models for vehicle routing problems 2024

[2] Learn- ing to handle complex constraints for vehicle routing prob- lems.Advances in Neural Information Processing Sys- tems, 37:93479–93509, 2024

[3] Learning to perform local rewriting for combinatorial opti- mization.Advances in neural information processing sys- tems, 32, 2019

[4] Select and optimize: Learning to solve large-scale tsp instances 2023

[5] Learning 2-opt heuristics for the traveling salesman problem via deep re- inforcement learning 2020

Receipt and verification

First computed	2026-05-17T23:39:07.300744Z
Builder	pith-number-builder-2026-05-17-v1
Signature	Pith Ed25519 (`pith-v1-2026-05`) · public key
Schema	pith-number/v1.0

Canonical hash

8349e67d1be446835a430279106bc4a00e55747bd9d161c83f34c204a91946be

Aliases

arxiv: 2605.14416 · arxiv_version: 2605.14416v1 · doi: 10.48550/arxiv.2605.14416 · pith_short_12: QNE6M7I34RDI · pith_short_16: QNE6M7I34RDIGWSD · pith_short_8: QNE6M7I3

Agent API

Resolver JSON Graph JSON Events JSON Schema Signing key

Verify this Pith Number yourself

curl -sH 'Accept: application/ld+json' https://pith.science/pith/QNE6M7I34RDIGWSDAJ4RA26EUA \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 8349e67d1be446835a430279106bc4a00e55747bd9d161c83f34c204a91946be

Canonical record JSON

{
  "metadata": {
    "abstract_canon_sha256": "f88afeb8bc5a0cdb2dba1101c43bc30c320a2c1924230ad1581bd593fb419220",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.AI",
    "submitted_at": "2026-05-14T06:05:22Z",
    "title_canon_sha256": "6d597269492c269738a83d7a698cb3dc078be69e23ea18195e50a3a9e7de21a8"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.14416",
    "kind": "arxiv",
    "version": 1
  }
}