pith. sign in

arxiv: 2604.16316 · v1 · submitted 2026-02-08 · 💻 cs.CY · cs.IR

CrossTraffic: An Open-Source Framework for Reproducible and Executable Transportation Analysis and Knowledge Management

Pith reviewed 2026-05-16 05:41 UTC · model grok-4.3

classification 💻 cs.CY cs.IR
keywords transportation engineeringknowledge graphlarge language modelsreproducibilityopen-source frameworkHighway Capacity Manualexecutable workflowsontology validation
0
0 comments X

The pith

CrossTraffic embeds transportation rules in a knowledge graph so large language models can run accurate, validated analyses via natural language.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents CrossTraffic as an open-source system that converts fragmented transportation methodologies from manuals into a single executable and verifiable computational core. It builds an ontology-driven knowledge graph that stores engineering rules, provenance, and validation logic, then uses this graph as a guardrail when large language models receive natural-language requests. Experiments show the constrained approach produces near-zero numerical error across models and perfectly flags invalid inputs, whereas context-only prompting does not. The result is a reproducible workflow that lets practitioners query complex procedures without losing methodological fidelity.

Core claim

CrossTraffic treats transportation methodologies as continuously deployable software infrastructure. An ontology-driven knowledge graph encodes rules and provenance from sources such as the Highway Capacity Manual and acts as a semantic validation layer. A conversational interface links large language models to this layer through structured tool calls, allowing natural-language access while blocking procedurally invalid analyses. Tests across multiple models report mean absolute error below 0.50 and perfect detection of invalid inputs (F1 approximately 1.0).

What carries the argument

The ontology-driven knowledge graph that encodes engineering rules, provenance, and validation logic from source manuals and serves as the semantic validation layer for LLM-driven analytical workflows.

If this is right

  • Analyses become reproducible across users and platforms because the same validated execution core is invoked every time.
  • Updates to manuals can be propagated once through the graph and immediately affect all connected tools and LLM interfaces.
  • New transportation models and additional manuals can be added modularly without rebuilding the core system.
  • Collaborative development is enabled because the executable core and validation rules are openly available rather than locked in proprietary software.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same pattern of graph-constrained execution could be applied to other domains that rely on dense technical manuals, such as structural engineering or environmental permitting.
  • Over time the framework might reduce dependence on proprietary traffic software by offering an open, extensible alternative that still meets regulatory standards.
  • Periodic audits of the knowledge graph against the latest manual editions would be a practical way to maintain long-term accuracy.

Load-bearing premise

The knowledge graph correctly and completely encodes every engineering rule, provenance detail, and validation check from the source manuals without omissions or encoding errors.

What would settle it

Run a set of standard Highway Capacity Manual calculations both by hand and through the framework after deliberately omitting one rule from the graph, then check whether numerical outputs diverge or invalid inputs go undetected.

Figures

Figures reproduced from arXiv: 2604.16316 by Bin Ran, Rei Tamaru.

Figure 1
Figure 1. Figure 1: Modular architecture of the CrossTraffic platform. The system is stratified into [PITH_FULL_IMAGE:figures/full_fig_p006_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: CrossTraffic deployed across multiple platforms. (a) Linux console. (b) Web [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The validation execution pipeline. The pipeline converts heterogeneous design [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: The KG Ontology. Rules (Yellow) act as the central connectors, linking [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Case study visualization on WIS 35/WIS 65 along the River Falls Bypass [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Impact of framework augmentation on computational error. Blue bars: base [PITH_FULL_IMAGE:figures/full_fig_p017_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Qualitative scoring heatmap across multiple agents with different augmentation [PITH_FULL_IMAGE:figures/full_fig_p018_7.png] view at source ↗
read the original abstract

Transportation engineering often relies on technical manuals and analytical tools for planning, design, and operations. However, the dissemination and management of these methodologies, such as those defined in the Highway Capacity Manual (HCM), remain fragmented. Computational procedures are often embedded within proprietary tools, updates are inconsistently propagated across platforms, and knowledge transfer is limited. These challenges hinder reproducibility, interoperability, and collaborative advancement in transportation analysis. This paper introduces CrossTraffic, an open-source framework that treats transportation methodologies and regulatory knowledge as continuously deployable and verifiable software infrastructure. CrossTraffic provides an executable computational core for transportation analysis with cross-platform access through standardized interfaces. An ontology-driven knowledge graph encodes engineering rules and provenance and serves as a semantic validation layer for analytical workflows. A conversational interface further connects large language models to this validated execution environment through structured tool invocation, enabling natural-language access while preventing procedurally invalid analyses. Experimental results show that knowledge-graph-constrained execution substantially improves numerical accuracy and methodological fidelity compared with context-only approaches, achieving near-zero numerical error (MAE<0.50) across multiple large language models and perfect detection of invalid analytical inputs in stress testing (F1~=~1.0). Its modular architecture supports the integration of additional transportation manuals and research models, providing a foundation for an open and collaborative transportation science ecosystem with a reproducible computational core. The system implementation is publicly available at https://github.com/crosstraffic.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The paper introduces CrossTraffic, an open-source framework for transportation analysis that uses an ontology-driven knowledge graph to encode engineering rules and provenance from sources like the Highway Capacity Manual. It provides executable computational cores, standardized interfaces, and a conversational LLM interface with structured tool invocation for validated analyses. Experiments claim that KG-constrained execution achieves MAE < 0.50 and F1 ≈ 1.0 for validity detection compared to context-only approaches.

Significance. If the results hold, this framework could significantly advance reproducibility and accessibility in transportation engineering by providing a verifiable, open-source platform that integrates knowledge management with computational execution, potentially reducing reliance on proprietary tools and improving knowledge transfer.

major comments (3)
  1. [Experimental results] Experimental results section: The headline claims of MAE<0.50 numerical accuracy and F1≈1.0 for invalid-input detection are presented as summary statistics with no description of the test cases, baselines, data splits, number of trials, or error-bar reporting, so the central performance claims cannot be verified from the manuscript.
  2. [Methods / Knowledge Graph] Knowledge-graph construction: The abstract and methods state that the ontology-driven KG 'encodes engineering rules and provenance' from the Highway Capacity Manual, but supply no account of the encoding process, expert review, automated consistency checks, or side-by-side validation against reference manual calculations; this omission directly undermines the claim that KG-constrained execution improves methodological fidelity.
  3. [Evaluation / Stress testing] Evaluation design: The stress-testing protocol for perfect detection of invalid analyses risks circularity because the test inputs may have been generated from the same KG that is used for validation; without an independently sourced test set, the F1≈1.0 result cannot be taken as evidence of general robustness.
minor comments (1)
  1. [Abstract] The phrase 'near-zero numerical error (MAE<0.50)' should be clarified as to whether the MAE is absolute or relative and what units are used for the transportation metrics.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive and detailed feedback. We address each major comment below and have revised the manuscript to strengthen the verifiability of our claims.

read point-by-point responses
  1. Referee: Experimental results section: The headline claims of MAE<0.50 numerical accuracy and F1≈1.0 for invalid-input detection are presented as summary statistics with no description of the test cases, baselines, data splits, number of trials, or error-bar reporting, so the central performance claims cannot be verified from the manuscript.

    Authors: We agree that the original Experimental Results section provided insufficient detail on the experimental protocol. In the revised manuscript we have expanded this section to describe the full set of test cases (including specific HCM-derived scenarios and synthetic invalid inputs), the context-only and other baselines, the train/test splits used, the number of independent trials, and error bars on all reported metrics. These additions enable direct verification of the MAE < 0.50 and F1 ≈ 1.0 figures. revision: yes

  2. Referee: Knowledge-graph construction: The abstract and methods state that the ontology-driven KG 'encodes engineering rules and provenance' from the Highway Capacity Manual, but supply no account of the encoding process, expert review, automated consistency checks, or side-by-side validation against reference manual calculations; this omission directly undermines the claim that KG-constrained execution improves methodological fidelity.

    Authors: We acknowledge that the manuscript omitted a detailed description of KG construction. We have added a new subsection in Methods that documents the encoding workflow, the mapping of specific HCM procedures into ontology classes and relations, the expert review process, the automated consistency checks applied, and quantitative side-by-side comparisons of KG-derived results against reference manual calculations. This material directly supports the fidelity claims. revision: yes

  3. Referee: Evaluation design: The stress-testing protocol for perfect detection of invalid analyses risks circularity because the test inputs may have been generated from the same KG that is used for validation; without an independently sourced test set, the F1≈1.0 result cannot be taken as evidence of general robustness.

    Authors: We have revised the Evaluation section to clarify that the invalid-input test cases were drawn from an independent collection of published transportation case studies and expert-authored scenarios that were never used during KG construction or rule encoding. We now report the provenance of this test set and the procedure used to ensure it is disjoint from the KG content, thereby removing the circularity concern while preserving the reported F1 score. revision: yes

Circularity Check

0 steps flagged

No significant circularity; framework presents independent experimental results

full rationale

The paper describes a software framework (CrossTraffic) that integrates an ontology-driven knowledge graph for validating transportation analysis workflows and constraining LLM tool calls. Reported metrics (MAE<0.50 numerical error and F1≈1.0 invalid-input detection) arise from direct empirical comparisons of KG-constrained execution versus context-only baselines across multiple LLMs, plus stress testing. No equations, fitted parameters, or derivations appear in the provided text that would reduce these outcomes to self-referential definitions or inputs by construction. The knowledge graph is positioned as an encoding of external manuals (e.g., HCM), and the evaluation is framed as testing fidelity against that encoding rather than deriving the metrics from the encoding itself. This matches the reader's assessment of minimal circularity risk and qualifies as self-contained against external benchmarks per the guidelines.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The framework rests on the assumption that source manuals can be faithfully translated into an ontology without loss of procedural validity; no numerical parameters are fitted to data in the reported results.

axioms (1)
  • domain assumption Transportation engineering rules from manuals such as the Highway Capacity Manual can be encoded completely and correctly into an ontology that supports semantic validation of workflows.
    The knowledge graph is presented as the authoritative validation layer; any incompleteness in this encoding would invalidate the fidelity claims.
invented entities (1)
  • CrossTraffic knowledge graph no independent evidence
    purpose: Encodes engineering rules and provenance to serve as semantic validation layer for analytical workflows
    Newly constructed component introduced by the paper; no independent evidence outside the framework itself is supplied.

pith-pipeline@v0.9.0 · 5556 in / 1414 out tokens · 32064 ms · 2026-05-16T05:41:40.313525+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages · 1 internal anchor

  1. [1]

    L. Oman, A. J. Wilson, F. D. Harrison, Implementing transportation knowledge networks, Research Report 20-75, National Academies of Sciences, Engineering, and Medicine, 2009

  2. [2]

    Kleinsteuber, T

    E. Kleinsteuber, T. Al Mustafa, F. Zander, B. König-Ries, S. Babalou, Managing provenance data in knowledge graph management platforms, Datenbank Spektrum 24 (2024) 43–52

  3. [3]

    C. W. Jenks, C. Hedges, A. C. Lemer, S. Moore, E. P. Delaney, S. E. Hitchcock, A guide to agency-wide knowledge management for state departments of transportation, NCHRP Report 813, National Academies of Sciences, Engineering, and Medicine, 2015

  4. [4]

    F. D. Harrison, C. Brown, K. Admas, T. Hall, Knowledge management at state departments of transportation research roadmap, Research Report 1134, National Academies of Sciences, Engineering, and Medicine, 2025

  5. [5]

    X. Bai, S. He, Y. Li, X. Yabo, Z. Xin, D. Wenli, L. Jian-Rong, Con- struction of a knowledge graph for framework material enabled by large language models and its application, npj Computational Materials 11 (2025). 21

  6. [6]

    Sparks of Artificial General Intelligence: Early experiments with GPT-4

    S. Bubeck, V. Chandrasekaran, R. Eldan, J. Gehrke, E. Horvitz, E. Ka- mar, P. Lee, Y. T. Lee, Y. Li, S. Lundberg, et al., Sparks of artificial general intelligence: Early experiments with gpt-4, arXiv preprint arXiv:2303.12712 (2023)

  7. [7]

    Pangaribuan, A

    L. Pangaribuan, A. Satrya, The role of knowledge management, trans- formational leadership, and organizational commitment on employee performance: empirical study in public sector, Journal of Theory and Applied Management 17 (2024) 355–371

  8. [8]

    Bolisani, C

    E. Bolisani, C. Bratianu, The elusive definition of knowledge, volume None ofKnowledge Management and Organizational Learning, none ed., Springer, 2018

  9. [9]

    F. Olan, O. E. Arakpogun, J. Suklan, F. Nakpodia, N. Damij, U. Jayaw- ickrama, Artificial intelligence and knowledge sharing: Contributing factors to organizational performance, Journal of Business Research 145 (2022) 605–615

  10. [10]

    M. H. Jarrahi, D. Askay, A. Eshraghi, P. Smith, Artificial intelligence and knowledge management: A partnership between human and ai, Business Horizons 66 (2023) 87–99

  11. [11]

    Weerakkody, M

    V. Weerakkody, M. Janssen, R. El-Haddadeh, The resurgence of business process re-engineering in public sector transformation efforts: Exploring the systemic challenges and unintended consequences, Inf Syst E-Bus Manage 19 (2021) 993–1014

  12. [12]

    Higgins, The dcc curation lifecycle model, International Journal of Digital Curation 3 (2008) 134–140

    S. Higgins, The dcc curation lifecycle model, International Journal of Digital Curation 3 (2008) 134–140

  13. [13]

    Irani, R

    Z. Irani, R. M. Abril, V. Weerakkody, A. Omar, U. Sivarajah, The impact of legacy systems on digital transformation in european public administration: Lesson learned from a multi case analysis, Government Information Quarterly 40 (2023) 101784

  14. [14]

    Chowdhury, K

    M. Chowdhury, K. Dey, A. Apon, Data analytics for intelligent trans- portation systems, Elsevier, 2024

  15. [15]

    Kušić, R

    K. Kušić, R. Schumann, E. Ivanjko, A digital twin in transportation: Real-time synergy of traffic data streams and simulation for virtualiz- ing motorway dynamics, Advanced Engineering Informatics 55 (2023) 101858. 22

  16. [16]

    Goumopoulos, Smart city middleware: A survey and a conceptual framework, IEEE Access 12 (2024) 4015–4047

    C. Goumopoulos, Smart city middleware: A survey and a conceptual framework, IEEE Access 12 (2024) 4015–4047

  17. [17]

    Irfan, M

    I. Irfan, M. S. U. K. Sumbal, F. Khurshid, F. Chan, Toward a resilient supply chain model: critical role of knowledge management and dynamic capabilities, Industrial management & data systems 122 (2022) 1153– 1182

  18. [18]

    E. F. Z. Santana, A. P. Chaves, M. A. Gerosa, F. Kon, D. S. Milojicic, Software platforms for smart cities: Concepts, requirements, challenges, and a unified reference architecture, ACM Comput. Surv. 50 (2017)

  19. [19]

    Riehl, K

    K. Riehl, K. A., M. A. M., Revisiting reproducibility in transportation simulation studies, European Transport Research Review 17 (2025)

  20. [20]

    M. K., M. F., Ontologies for transportation research: A survey, Trans- portation Research Part C: Emerging Technologies 89 (2018) 53–82

  21. [21]

    Zhang, Z

    Q. Zhang, Z. Ma, P. e. a. Zhang, Mobility knowledge graph: Review and its application in public transport, Transportation (2025) 1119–1145

  22. [22]

    Lewis, W

    P.Lewis, E.Perez, A.Piktus, F.Petroni, V.Karpukhin, N.Goyal, H.Küt- tler, M. Lewis, W. Yih, T. Rocktäschel, et al., Retrieval-augmented generation for knowledge-intensive nlp tasks, Advances in Neural Infor- mation Processing Systems 33 (2020) 9459–9474

  23. [23]

    Tupayachi, H

    J. Tupayachi, H. Xu, O. A. Omitaomu, M. C. Camur, A. Sharmin, X. Li, Towards next-generation urban decision support systems through ai-powered construction of scientific ontology using large language mod- els—a case in optimizing intermodal freight transportation, Smart Cities 7 (2024) 2392–2421

  24. [24]

    S. Ye, Q. Wu, P. Fan, Q. Fan, A survey on semantic communications in internet of vehicles, Entropy 27 (2025)

  25. [25]

    Y. Lu, B. Yao, H. Gu, J. Huang, Z. J. Wang, Y. Li, J. Gesi, Q. He, T. J. Li, D. Wang, Uxagent: An llm agent-based usability testing framework for web design, in: Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, CHI EA ’25, Association for Computing Machinery, New York, NY, USA, 2025

  26. [26]

    Masri, H

    S. Masri, H. I. Ashqar, M. Elhenawy, Large language models (llms) as traffic control systems at urban intersections: A new paradigm, Vehicles 7 (2025). 23

  27. [27]

    H. Yang, M. Siew, C. Joe-Wong, An llm-based digital twin for optimizing human-in-the loop systems, in: 2024 IEEE International Workshop on Foundation Models for Cyber-Physical Systems & Internet of Things (FMSys), 2024, pp. 26–31

  28. [28]

    C. Cui, Y. Ma, X. Cao, W. Ye, Y. Zhou, K. Liang, J. Chen, J. Lu, Z. Yang, K. Liao, T. Gao, E. Li, K. Tang, Z. Cao, T. Zhou, A. Liu, X. Yan, S. Mei, J. Cao, Z. Wang, C. Zheng, A survey on multimodal large language models for autonomous driving, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2024, pp. 958–979

  29. [29]

    T. Nie, J. Sun, W. Ma, Exploring the roles of large language models in reshaping transportation systems: A survey, framework, and roadmap, Artificial Intelligence for Transportation 1 (2025) 100003

  30. [30]

    T. R. Gruber, A translation approach to portable ontology specifications, Knowledge Acquisition 5 (1993) 199–220

  31. [31]

    Studer, V

    R. Studer, V. Benjamins, D. Fensel, Knowledge engineering: Principles and methods, Data & Knowledge Engineering 25 (1998) 161–197

  32. [32]

    Hogan, E

    A. Hogan, E. Blomqvist, M. Cochez, C. d’Amato, G. D. Melo, C. Gutier- rez, S. Kirrane, J. E. L. Gayo, R. Navigli, S. Neumaier, et al., Knowledge graphs, ACM Computing Surveys (Csur) 54 (2021) 1–37

  33. [33]

    M. Mora, F. Wang, J. M. Gómez, G. Phillips-Wren, Development methodologies for ontology-based knowledge management systems: A review, Expert Systems 39 (2022) e12851

  34. [34]

    McBride, The resource description framework (RDF) and its vocab- ulary description language RDFS, Springer Berlin Heidelberg, Berlin, Heidelberg, 2004, pp

    B. McBride, The resource description framework (RDF) and its vocab- ulary description language RDFS, Springer Berlin Heidelberg, Berlin, Heidelberg, 2004, pp. 51–65

  35. [35]

    Janowicz, P

    K. Janowicz, P. Hitzler, W. Li, D. Rehberger, M. Schildhauer, R. Zhu, C. Shimizu, C. Fisher, L. Cai, G. Mai, et al., Know, know where, knowwheregraph: A densely connected, cross-domain knowledge graph and geo-enrichment service stack for applications in environmental intel- ligence, AI Magazine 43 (2022) 30–39

  36. [36]

    Fernandez, R

    S. Fernandez, R. Hadfi, T. Ito, I. Marsa-Maestre, J. R. Velasco, Ontology- based architecture for intelligent transportation systems using a traffic sensor network, Sensors 16 (2016) 1287. 24

  37. [37]

    B. Gan, D. Zhang, Z. Huang, F. Zheng, R. Zhu, W. Zhang, Ontology- driven knowledge graph for decision-making in resilience enhancement of underground structures: Framework and application, Tunnelling and Underground Space Technology 163 (2025) 106739

  38. [38]

    Moradi, A

    M. Moradi, A. Aghaie, M. Hosseini, Knowledge-collector agents: Apply- ing intelligent agents in marketing decisions with knowledge management approach, Knowledge-Based Systems 52 (2013) 181–193

  39. [39]

    N. D. Matsakis, F. S. K., The rust language, Proceedings of the 2014 ACM SIGAda annual conference on High integrity language technology (2014)

  40. [40]

    URL:https://www.w3.org/TR/ wasm-core-2/

    WebAssembly Project and Contributors, WebAssembly core specifica- tion, Technical Report, W3C, 2022. URL:https://www.w3.org/TR/ wasm-core-2/

  41. [41]

    PyO3 Project and Contributors, PyO3, https://github.com/PyO3/ pyo3, 2017–2025

  42. [42]

    P. A. Lopez, M. Behrisch, L. Bieker-Walz, J. Erdmann, Y. Flötteröd, R. Hilbrich, L. Lücken, J. Rummel, P. Wagner, E. Wiessner, Microscopic traffic simulation using sumo, in: 2018 21st International Conference on Intelligent Transportation Systems (ITSC), 2018, pp. 2575–2582

  43. [43]

    Shankar, J

    S. Shankar, J. Zamfirescu-Pereira, B. Hartmann, A. Parameswaran, I. Arawjo, Who validates the validators? aligning llm-assisted evaluation of llm outputs with human preferences, in: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, UIST ’24, Association for Computing Machinery, New York, NY, USA, 2024

  44. [44]

    Polak, D

    M. Polak, D. Morgan, Extracting accurate materials data from research papers with conversational language models and prompt engineering, Nat Commun 15 (2024)

  45. [45]

    Huang, Y

    H. Huang, Y. Tang, X. Shi, X. Mao, Dependency-uware neural topic model, Information Processing & Management 61 (2024) 103530

  46. [46]

    A comprehensive survey of retrieval- augmented generation (rag): Evolution, current landscape and future directions.arXiv preprint arXiv:2410.12837, 2024

    S. Gupta, R. Ranjan, S. N. Singh, A comprehensive survey of retrieval- augmented generation (rag): Evolution, current landscape and future directions, arXiv preprint arXiv:2410.12837 (2024). 25

  47. [47]

    J. Wood, I. Schalkwyk, Reproducibility in transportation research: Importance, best practices, and dealing with protected and sensitive data, Journal of Transportation Technologies 15 (2025) 179–202. 26