End-to-end differentiable network traffic simulation with dynamic route choice

Toru Seo

arxiv: 2604.11380 · v3 · submitted 2026-04-13 · 📡 eess.SY · cs.SY

End-to-end differentiable network traffic simulation with dynamic route choice

Toru Seo This is my paper

Pith reviewed 2026-05-10 15:55 UTC · model grok-4.3

classification 📡 eess.SY cs.SY

keywords differentiable simulationtraffic flow modelautomatic differentiationdynamic user optimumcongestion tolllink transmission modelgradient optimization

0 comments

The pith

An end-to-end differentiable traffic simulator using automatic differentiation enables efficient optimization of dynamic congestion tolls on large urban networks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper develops a network traffic simulator that is fully differentiable, allowing automatic computation of gradients with respect to parameters like tolls. It achieves this by combining the Link Transmission Model, whose min and max operations have subgradients, with a Dynamic User Optimum route choice model that produces continuous diverge ratios. Sympathetic readers would care because this removes the barrier of deriving complex gradients by hand or using slow numerical methods, opening the door to optimizing real-world traffic systems with thousands of variables.

Core claim

The paper introduces an end-to-end differentiable simulator based on the Link Transmission Model and Dynamic User Optimum route choice, where piecewise-linear operations admit subgradients and diverge ratios are continuous, thus supporting automatic differentiation for solving large optimization problems such as dynamic toll setting on the Chicago-Sketch network with 2500 links and 15000 variables in 40 minutes.

What carries the argument

Automatic differentiation applied to the Link Transmission Model's piecewise-linear min/max operations and the continuous diverge ratios derived from the Dynamic User Optimum model.

Load-bearing premise

Subgradients of the piecewise-linear min/max operations and the continuous diverge ratios from the DUO model provide sufficiently accurate gradients for stable convergence in gradient-based optimization.

What would settle it

A failure of the proposed simulator to produce a high-quality toll optimization solution on the Chicago-Sketch dataset within 3000 iterations would falsify the claim of its practical effectiveness.

Figures

Figures reproduced from arXiv: 2604.11380 by Toru Seo.

**Figure 1.** Figure 1: The simulation framework combining LTM traffic flow model and DUO route choice. 3.1 Parameters and state variables, and what the differentiable simulator means A road network is represented as a directed graph G = (N ,L), where N denotes a set of nodes and L denotes a set of directed links. Each link l ∈ L connects an upstream node to a downstream node. For each node ν ∈ N , we define L in ν and L out ν as… view at source ↗

**Figure 2.** Figure 2: Triangular FD. The LTM uses the cumulative vehicle counts (i.e., the number of vehicles that passed the location from a certain reference time to the current time) at the upstream and downstream ends of each link as its state variables. They are denoted as Nl,U (t) for upstream and Nl,D(t) for downstream, respectively, of link l at time t. The LTM is derived from Newell’s simplified kinematic wave (KW) the… view at source ↗

**Figure 3.** Figure 3: Mechanism of LTM. Top: time–space diagram on link l. Bottom: its cumulative curve plot. Link l determines its demand Dl and supply Sl considering its traffic state. Then, the node model determines inflow f in l and outflow f out l considering demand and supply of connected links. where tp denotes the exit time from the p-th link. The total travel time is tP − t0. The path P itself can be determined by a ti… view at source ↗

**Figure 4.** Figure 4: Vehicle trajectories and cumulative counts (adapted from Seo (2023)). (e.g., Dl(t) is denoted as Dl) [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Node with multiple incoming and outgoing links. Let Bν = [blo] denote the turning fraction matrix, where blo is the fraction of outflow from inlink l directed to outlink o, and let αl denote the merge priority of inlink l. The value of Bν is obtained from the route choice model. The INM initializes inflow allocations qˆl = 0 for all l and outflow allocations qˆo = 0 for all o, then iterates the following s… view at source ↗

**Figure 6.** Figure 6: Conceptual illustration of logit-DUO. In summary, the DUO and logit-DUO models preserve end-to-end differentiability: the gradient flows from the objective through the per-destination cumulative counts, through the diverge ratio computation, through the node models, and back to the FD and demand parameters. Furthermore, the logit-DUO typically has a non-zero gradient with respect to link or path cost. The … view at source ↗

**Figure 7.** Figure 7: Merge network (a) t = 300 s (b) t = 800 s [PITH_FULL_IMAGE:figures/full_fig_p019_7.png] view at source ↗

**Figure 8.** Figure 8: Network speed (normalized by free-flow speed). The width represents the density at the segment. First, we define the total travel time (TTT) as an objective function: TTT = X l∈L T XS−1 t=0 max{Nl,U (t) − Nl,D(t), 0} · ∆t + X ν∈Norig T XS−1 t=0 rν(t) · ∆t, (29) where Norig denotes the set of origin nodes. The first term accounts for vehicles on links, and the second term accounts for vehicles in vertical q… view at source ↗

**Figure 9.** Figure 9: Time-space diagrams of density (normalized by jam density). Note that these values are obtained by numerically differentiating N for visualization purposes, and thus some numerical noises exist, especially near the link borders. With respect to the FD parameters, the following values are obtained: ∂TTT ∂u1 = −1278.282, ∂TTT ∂u2 = −616.873, ∂TTT ∂u3 = −2024.685. (31) All values are negative, meaning that in… view at source ↗

**Figure 10.** Figure 10: Two-routes network (a) Fast route. The bottleneck is at 1000 m location. (b) Slow route [PITH_FULL_IMAGE:figures/full_fig_p022_10.png] view at source ↗

**Figure 11.** Figure 11: Time–space diagrams of normalized density in the two-route network. We compute partial derivatives with respect to the bottleneck capacity q ∗ BN. The results are summarized in [PITH_FULL_IMAGE:figures/full_fig_p022_11.png] view at source ↗

**Figure 12.** Figure 12: Simulated average link delay in the Chicago-Sketch data scenario without pricing. “Delay” is the ratio of the excess of the average link travel time over the free-flow travel time. 24 [PITH_FULL_IMAGE:figures/full_fig_p024_12.png] view at source ↗

**Figure 13.** Figure 13: shows the average network state in the best pricing case. By comparing with the no-toll case ( [PITH_FULL_IMAGE:figures/full_fig_p025_13.png] view at source ↗

**Figure 14.** Figure 14: Convergence of the objective function, TTT, and gradient [PITH_FULL_IMAGE:figures/full_fig_p026_14.png] view at source ↗

**Figure 15.** Figure 15: Average link toll in network. 4.2.4 Comparison with SPSA In order to quantify the advantage of the proposed AD-based gradient computation, we solved the same congestion pricing problem using SPSA (Spall, 1998), a conventional derivative-free optimization method in the simulation and DTA literature (Balakrishna et al., 2007; Lu et al., 2015). SPSA estimates the gradient by evaluating the objective function… view at source ↗

**Figure 16.** Figure 16: Time-series of toll and traffic states. where the step size ak = a/(A + k) α and perturbation magnitude ck = c/kγ follow the standard decay schedule recommended by Spall (1998) with A = 100, α = 0.602, γ = 0.101, and the initial parameters c = 30 and a = 0.0001 are calibrated to achieve the best performance as much as possible. The objective function for SPSA is the same as that for AD, Eq. (36). In order… view at source ↗

**Figure 17.** Figure 17: Macroscopic Fundamental Diagram. set to 17 000, which took 8532 sec [PITH_FULL_IMAGE:figures/full_fig_p029_17.png] view at source ↗

**Figure 18.** Figure 18: Comparison between AD and SPSA. software on GitHub (https://github.com/toruseo/UNsim) and PyPI (pip install unsim). All code that reproduces the presented results is also published in the same GitHub repository. For future research, the following directions are worth considering. First, there is room for improvement in the model and algorithms to facilitate numerical applications. During this study’s nume… view at source ↗

read the original abstract

Optimization using network traffic models requires computing gradients of objective functions with respect to model parameters. However, derivation of such gradients has often been considered difficult or impractical due to their complexity and size. Conventional approaches rely on numerical differentiation or derivative-free methods that do not scale well with the parameter dimension, or on adjoint methods that require manual derivation for each specific model. This study proposes a novel end-to-end differentiable network traffic flow simulator based on automatic differentiation (AD), employing the Link Transmission Model (LTM) and a Dynamic User Optimum (DUO) route choice model. The LTM operates on continuous aggregate state variables through piecewise-linear min/max operations, which admit subgradients almost everywhere and thus require no smooth relaxation for AD. The DUO is also suitable for AD: although the shortest path search is itself discrete, the resulting diverge ratios at each node are continuous functions of per-destination vehicle counts and are thus differentiable. In order to demonstrate the capability of the proposed model, we solved a dynamic congestion toll optimization problem on the Chicago-Sketch dataset with approximately 2500 links, 1 million vehicles, a 3-hour duration, and 15000 decision variables. The proposed model successfully derived a high-quality solution in 3000 iterations, taking about 40 minutes. The simulator, implemented in Python and JAX, is released as open-source software named UNsim (https://github.com/toruseo/UNsim).

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper delivers a JAX implementation of an end-to-end differentiable LTM+DUO simulator, open-sourced, and demonstrates it on a 15k-variable toll optimization over the Chicago-Sketch network.

read the letter

The core advance is an open-source JAX simulator that runs automatic differentiation straight through the Link Transmission Model's piecewise-linear min/max operations and the continuous diverge ratios from the Dynamic User Optimum route choice model, without any smoothing. They used it to optimize dynamic congestion tolls on a network with roughly 2500 links and a million vehicles, reaching a high-quality solution in 3000 iterations that took about 40 minutes. That scale is the practical payoff: it shows the approach can handle problems large enough to matter for real planning work.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes an end-to-end differentiable traffic simulator combining the Link Transmission Model (LTM) with piecewise-linear min/max operations and a Dynamic User Optimum (DUO) route-choice model whose diverge ratios are continuous in vehicle counts. Implemented in JAX, the simulator is used to solve a dynamic congestion-toll optimization problem on the Chicago-Sketch network (~2500 links, 1 million vehicles, 3-hour horizon, 15 000 decision variables), reporting a high-quality solution after 3000 iterations in roughly 40 minutes. The code is released as open-source UNsim.

Significance. If the automatically differentiated subgradients prove accurate and stable, the work would enable gradient-based optimization of traffic models at scales that previously required derivative-free or manually derived adjoint methods, representing a practical advance for large-scale transportation network design. The open-source release strengthens reproducibility and potential follow-on use.

major comments (2)

[Abstract] Abstract: the central claim that LTM min/max operations 'admit subgradients almost everywhere and thus require no smooth relaxation' and that DUO diverge ratios 'are continuous functions of per-destination vehicle counts and are thus differentiable' is load-bearing for the reported 3000-iteration, 15 000-variable convergence. No verification is supplied that JAX's automatic subgradient selection at the non-differentiable loci matches finite-difference gradients or avoids zero-gradient directions and instability when back-propagated through the full network and route-choice layers.
[Results (Chicago-Sketch toll optimization)] Chicago-Sketch experiment: the statement that a 'high-quality solution' was obtained provides no diagnostics (gradient-norm histories, comparison against a non-differentiable baseline, or sensitivity to subgradient choice) that would confirm the AD gradients, rather than algorithmic heuristics, drove reliable convergence on the 15 000-variable instance.

minor comments (2)

[Abstract] The abstract and methods should explicitly state the precise form of the toll-optimization objective and any regularization terms applied to the 15 000 decision variables.
[Figures] Figure captions and axis labels in the results section would benefit from clearer indication of which curves correspond to the differentiable simulator versus any reference methods.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The two major comments correctly identify that the manuscript currently lacks explicit numerical verification of the automatic-differentiation subgradients and supporting convergence diagnostics. We address both points below and will incorporate the requested material in the revised manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that LTM min/max operations 'admit subgradients almost everywhere and thus require no smooth relaxation' and that DUO diverge ratios 'are continuous functions of per-destination vehicle counts and are thus differentiable' is load-bearing for the reported 3000-iteration, 15 000-variable convergence. No verification is supplied that JAX's automatic subgradient selection at the non-differentiable loci matches finite-difference gradients or avoids zero-gradient directions and instability when back-propagated through the full network and route-choice layers.

Authors: We agree that direct numerical verification of the subgradients is necessary to support the central claims. In the revised manuscript we will add a dedicated verification subsection (likely in Section 3 or 4) that compares JAX-computed subgradients against central finite-difference approximations on small synthetic networks (both for isolated LTM min/max operations and for the full LTM+DUO pipeline). We will also report the frequency of non-differentiable points encountered during the Chicago-Sketch run and any safeguards (e.g., subgradient selection rules) used by JAX. These additions will be limited to a few pages and will not alter the core algorithmic contribution. revision: yes
Referee: [Results (Chicago-Sketch toll optimization)] Chicago-Sketch experiment: the statement that a 'high-quality solution' was obtained provides no diagnostics (gradient-norm histories, comparison against a non-differentiable baseline, or sensitivity to subgradient choice) that would confirm the AD gradients, rather than algorithmic heuristics, drove reliable convergence on the 15 000-variable instance.

Authors: We acknowledge that the current presentation of the Chicago-Sketch results is insufficient to isolate the contribution of the AD gradients. In revision we will augment the results section with (i) a plot of gradient-norm history over the 3000 iterations, (ii) a brief comparison of final objective values obtained with the differentiable simulator versus a derivative-free baseline (e.g., a simple random-search or Nelder-Mead run on a reduced problem), and (iii) a short sensitivity test repeating the optimization with different JAX subgradient modes or with added smoothing. These diagnostics will be presented concisely and will strengthen the claim that the reported convergence is attributable to the end-to-end differentiability. revision: yes

Circularity Check

0 steps flagged

Differentiability and optimization results follow from model properties and empirical demonstration without circular reduction

full rationale

The paper asserts that LTM piecewise-linear min/max operations admit subgradients almost everywhere (allowing direct AD) and that DUO diverge ratios are continuous functions of per-destination vehicle counts (hence differentiable). These are presented as intrinsic mathematical properties of the selected models rather than results derived from fitted parameters, self-referential definitions, or prior self-citations. The central empirical claim—successful convergence of gradient-based toll optimization on the external Chicago-Sketch network after 3000 iterations—is a reported outcome of running the simulator, not a quantity forced by construction or reduced to the inputs. No load-bearing step in the derivation chain (as described in the abstract) equates a prediction to its own fitted or defined inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The work relies on existing traffic models plus the mathematical fact that piecewise-linear min/max admit subgradients and that DUO diverge ratios are continuous in vehicle counts. No new free parameters or invented entities are introduced.

axioms (2)

domain assumption LTM operates on continuous aggregate state variables through piecewise-linear min/max operations, which admit subgradients almost everywhere and thus require no smooth relaxation for AD.
Stated directly in the abstract as the reason AD works without modification.
domain assumption The DUO shortest-path search yields diverge ratios at each node that are continuous functions of per-destination vehicle counts and are thus differentiable.
Stated in the abstract as the property that makes the route-choice component compatible with AD.

pith-pipeline@v0.9.0 · 5552 in / 1414 out tokens · 38723 ms · 2026-05-10T15:55:37.329530+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

65 extracted references · 65 canonical work pages

[1]

Towards differentiable agent-based simulation

Andelfinger, P., 2023. Towards differentiable agent-based simulation. ACM Transactions on Modeling and Computer Simulation 33 (1--2), 1--26

work page 2023
[2]

N., 2007

Balakrishna, R., Ben-Akiva, M., Koutsopoulos, H. N., 2007. Offline calibration of dynamic traffic assignment: Simultaneous demand-and-supply estimation. Transportation Research Record 2003, 50--58

work page 2007
[3]

G., Pearlmutter, B

Baydin, A. G., Pearlmutter, B. A., Radul, A. A., Siskind, J. M., 2018. Automatic differentiation in machine learning: a survey. Journal of Machine Learning Research 18 (153), 1--43

work page 2018
[4]

On a routing problem

Bellman, R., 1958. On a routing problem. Quarterly of applied mathematics 16 (1), 87--90

work page 1958
[5]

Discrete choice methods and their applications to short term travel decisions

Ben-Akiva, M., Bierlaire, M., 1999. Discrete choice methods and their applications to short term travel decisions. In: Hall, R. (Ed.), Handbook of Transportation Science. Springer, pp. 5--33

work page 1999
[6]

J., Leary, C., Maclaurin, D., Necula, G., Paszke, A., VanderPlas, J., Wanderman-Milne, S., Zhang, Q., 2018

Bradbury, J., Frostig, R., Hawkins, P., Johnson, M. J., Leary, C., Maclaurin, D., Necula, G., Paszke, A., VanderPlas, J., Wanderman-Milne, S., Zhang, Q., 2018. JAX : composable transformations of Python+NumPy programs. http://github.com/jax-ml/jax

work page 2018
[7]

Chen, R. T. Q., Rubanova, Y., Bettencourt, J., Duvenaud, D., 2018. Neural ordinary differential equations. In: Advances in Neural Information Processing Systems (NeurIPS)

work page 2018
[8]

A simulation-based optimization algorithm for dynamic large-scale urban transportation problems

Chong, L., Osorio, C., 2018. A simulation-based optimization algorithm for dynamic large-scale urban transportation problems. Transportation Science 52 (3), 637--656

work page 2018
[9]

F., 1994

Daganzo, C. F., 1994. The cell transmission model: A dynamic representation of highway traffic consistent with the hydrodynamic theory. Transportation Research Part B: Methodological 28 (4), 269--287

work page 1994
[10]

F., 1995

Daganzo, C. F., 1995. The cell transmission model, part II : Network traffic. Transportation Research Part B: Methodological 29 (2), 79--93

work page 1995
[11]

L., 2024

Dantsuji, T., Ngoduy, D., Pu, Z., Lee, S., Vu, H. L., 2024. A hybrid neural network for real-time OD demand calibration under disruptions. arXiv preprint arXiv:2408.06659

work page arXiv 2024
[12]

K., 2025

Du, K., Lee, E., Ma, Q., Su, Z., Zhang, S., Lo, H. K., 2025. Modeling metro passenger routing choices with a fully differentiable end-to-end simulation-based optimization ( SBO ) approach. Transportation Science 59 (4), 802--822

work page 2025
[13]

o tter \

Fl \"o tter \"o d, G., Rohde, J., 2011. Operational macroscopic modeling of complex urban road intersections. Transportation Research Part B: Methodological 45 (6), 903--922

work page 2011
[14]

R., 1956

Ford, L. R., 1956. Network flow theory. RAND Corporation Paper, Santa Monica, 1956

work page 1956
[15]

L., Bernstein, D., Smith, T

Friesz, T. L., Bernstein, D., Smith, T. E., Tobin, R. L., Wie, B.-W., 1993. A variational inequality formulation of the dynamic network user equilibrium problem. Operations Research 41 (1), 179--191

work page 1993
[16]

Managing network congestion with link-based incentives: A surrogate-based optimization approach

Fu, Q., Wu, J., Wu, X., Sun, J., Tian, Y., 2024. Managing network congestion with link-based incentives: A surrogate-based optimization approach. Transportation Research Part A: Policy and Practice 182, 104033

work page 2024
[17]

F., 2007

Geroliminis, N., Daganzo, C. F., 2007. Macroscopic modeling of traffic in cities. In: Transportation Research Board 86th Annual Meeting

work page 2007
[18]

Hysteresis phenomena of a macroscopic fundamental diagram in freeway networks

Geroliminis, N., Sun, J., 2011 a . Hysteresis phenomena of a macroscopic fundamental diagram in freeway networks. Transportation Research Part A: Policy and Practice 45 (9), 966--979

work page 2011
[19]

Properties of a well-defined macroscopic fundamental diagram for urban traffic

Geroliminis, N., Sun, J., 2011 b . Properties of a well-defined macroscopic fundamental diagram for urban traffic. Transportation Research Part B: Methodological 45 (3), 605--617

work page 2011
[20]

Discrete adjoint gradient computation for multiclass traffic flow models on road networks

Goatin, P., Klar, A., Mezquita-Nieto, C., 2026. Discrete adjoint gradient computation for multiclass traffic flow models on road networks. arXiv preprint arXiv:2604.00670

work page arXiv 2026
[21]

E., 1989

Goldberg, D. E., 1989. Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley

work page 1989
[22]

Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation, 2nd Edition

Griewank, A., Walther, A., 2008. Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation, 2nd Edition. SIAM

work page 2008
[23]

DiffTaichi : Differentiable programming for physical simulation

Hu, Y., Anderson, L., Li, T.-M., Sun, Q., Carr, N., Ragan-Kelley, J., Durand, F., 2020. DiffTaichi : Differentiable programming for physical simulation. In: International Conference on Learning Representations (ICLR)

work page 2020
[24]

Properties of dynamic user equilibrium solution: existence, uniqueness, stability, and robust solution methodology

Iryo, T., 2013. Properties of dynamic user equilibrium solution: existence, uniqueness, stability, and robust solution methodology. Transportmetrica B: Transport Dynamics 1 (1), 52--67

work page 2013
[25]

P., Ba, J., 2015

Kingma, D. P., Ba, J., 2015. Adam: A method for stochastic optimization. In: International Conference on Learning Representations (ICLR)

work page 2015
[26]

Dynamic user optimal assignment with physical queues for a many-to-many OD pattern

Kuwahara, M., Akamatsu, T., 2001. Dynamic user optimal assignment with physical queues for a many-to-many OD pattern. Transportation Research Part B: Methodological 35 (5), 461--479

work page 2001
[27]

P., 1996

Lebacque, J. P., 1996. The G odunov scheme and what it means for first order traffic flow models. In: Lesort, J. B. (Ed.), Proceedings of the 13th International Symposium on Transportation and Traffic Theory. Elsevier, pp. 647--677

work page 1996
[28]

Traffic assignment as a differentiable program

Li, J., Nie, M., 2026. Traffic assignment as a differentiable program. Available at SSRN

work page 2026
[29]

J., Whitham, G

Lighthill, M. J., Whitham, G. B., 1955. On kinematic waves. II . a theory of traffic flow on long crowded roads. Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences 229 (1178), 317--345

work page 1955
[30]

K., 2023

Liu, Z., Yin, Y., Bai, F., Grimm, D. K., 2023. End-to-end learning of user equilibrium with implicit neural networks. Transportation Research Part C: Emerging Technologies 150, 104085

work page 2023
[31]

An enhanced SPSA algorithm for the calibration of dynamic traffic assignment models

Lu, L., Xu, Y., Antoniou, C., Ben-Akiva, M., 2015. An enhanced SPSA algorithm for the calibration of dynamic traffic assignment models. Transportation Research Part C: Emerging Technologies 51, 149--166

work page 2015
[32]

Incorporating graph neural network into route choice model

Ma, Y., Seo, T., 2025. Incorporating graph neural network into route choice model. arXiv preprint arXiv:2503.02315

work page arXiv 2025
[33]

S., Williams, J

Mahmassani, H. S., Williams, J. C., Herman, R., 1984. Investigation of network-level traffic flow relationships: some simulation results. Transportation Research Record 971, 121--130

work page 1984
[34]

F., Rothery, R., 1971

Makigami, Y., Newell, G. F., Rothery, R., 1971. Three-dimensional representation of traffic flow. Transportation Science 5 (3), 302--313

work page 1971
[35]

Ultra-fast traffic nowcasting and control via differentiable agent-based simulation

Makinoshima, F., Yamaguchi, Y., Segawa, E., Niinuma, K., Qian, S., 2026. Ultra-fast traffic nowcasting and control via differentiable agent-based simulation. arXiv preprint arXiv: 2603.25068

work page arXiv 2026
[36]

E., 2010

Nair, V., Hinton, G. E., 2010. Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th international conference on machine learning. pp. 807--814

work page 2010
[37]

F., 1993 a

Newell, G. F., 1993 a . A simplified theory of kinematic waves in highway traffic, part I : General theory. Transportation Research Part B: Methodological 27 (4), 281--287

work page 1993
[38]

F., 1993 b

Newell, G. F., 1993 b . A simplified theory of kinematic waves in highway traffic, part II : Queueing at freeway bottlenecks. Transportation Research Part B: Methodological 27 (4), 289--303

work page 1993
[39]

F., 1993 c

Newell, G. F., 1993 c . A simplified theory of kinematic waves in highway traffic, part III : Multi-destination flows. Transportation Research Part B: Methodological 27 (4), 305--313

work page 1993
[40]

A computationally efficient simulation-based optimization algorithm for large-scale urban transportation problems

Osorio, C., Chong, L., 2015. A computationally efficient simulation-based optimization algorithm for large-scale urban transportation problems. Transportation Science 49 (3), 623--636

work page 2015
[41]

Dynamic network loading: A stochastic differentiable model that derives link state distributions

Osorio, C., Flotterod, G., Bierlaire, M., 2011. Dynamic network loading: A stochastic differentiable model that derives link state distributions. Transportation Research Part B: Methodological 45 (9), 1410--1423

work page 2011
[42]

Traffic simulation with METANET

Papageorgiou, M., Papamichail, I., Messmer, A., Wang, Y., 2010. Traffic simulation with METANET . In: Fundamentals of traffic simulation. Springer, pp. 399--430

work page 2010
[43]

S., Boltyanskii, V

Pontryagin, L. S., Boltyanskii, V. G., Gamkrelidze, R. V., Mishchenko, E. F., 1962. The mathematical theory of optimal processes. Interscience Publishers, New York

work page 1962
[44]

Second order macroscopic traffic flow model validation using automatic differentiation with resilient backpropagation and particle swarm optimisation algorithms

Poole, A., Kotsialos, A., 2016. Second order macroscopic traffic flow model validation using automatic differentiation with resilient backpropagation and particle swarm optimisation algorithms. Transportation Research Part C: Emerging Technologies 71, 356--381

work page 2016
[45]

The Python Language Reference

Python Software Foundation , 2022. The Python Language Reference

work page 2022
[46]

E., LeBlanc, L

Ran, B., Boyce, D. E., LeBlanc, L. J., 1993. A new class of instantaneous dynamic user-optimal traffic assignment models. Operations Research 41 (1), 192--202

work page 1993
[47]

L., Krichene, W., Goatin, P., Bayen, A

Reilly, J., Samaranayake, S., Delle Monache, M. L., Krichene, W., Goatin, P., Bayen, A. M., 2015. Adjoint-based optimization on a network of discretized scalar conservation laws with applications to coordinated ramp metering. Journal of optimization theory and applications 167 (2), 733--760

work page 2015
[48]

I., 1956

Richards, P. I., 1956. Shock waves on the highway. Operations Research 4 (1), 42--51

work page 1956
[49]

Macroscopic Traffic Flow Simulation: Fundamental Mathematical Theory and Python Implementation

Seo, T., 2023. Macroscopic Traffic Flow Simulation: Fundamental Mathematical Theory and Python Implementation. Corona Publishing Co., Ltd., (in Japanese)

work page 2023
[50]

UXsim : lightweight mesoscopic traffic flow simulator in pure Python

Seo, T., 2025. UXsim : lightweight mesoscopic traffic flow simulator in pure Python . Journal of Open Source Software 10 (106), 7617

work page 2025
[51]

A physics-informed deep learning paradigm for traffic state and fundamental diagram estimation

Shi, R., Mo, Z., Huang, K., Di, X., Du, Q., 2022. A physics-informed deep learning paradigm for traffic state and fundamental diagram estimation. IEEE Transactions on Intelligent Transportation Systems 23 (8), 11688--11698

work page 2022
[52]

Enhancing discrete choice models with representation learning

Sifringer, B., Lurkin, V., Alahi, A., 2020. Enhancing discrete choice models with representation learning. Transportation Research Part B: Methodological 140, 236--261

work page 2020
[53]

Smits, E.-S., Bliemer, M. C. J., Pel, A. J., van Arem, B., 2015. A family of macroscopic node models. Transportation Research Part B: Methodological 74, 20--39

work page 2015
[54]

C., 2022

Son, S., Qiao, Y., Sewall, J., Lin, M. C., 2022. Differentiable hybrid traffic simulation. ACM Transactions on Graphics 41 (6), 1--14

work page 2022
[55]

C., 2025

Son, S., Zheng, L., Clipp, B., Greenwell, C., Philip, S., Lin, M. C., 2025. Gradient-based trajectory optimization with parallelized differentiable traffic simulation. In: 2025 IEEE International Conference on Robotics and Automation. pp. 14497--14504

work page 2025
[56]

L., del Castillo, E., 2017

Song, W., Han, K., Wang, Y., Friesz, T. L., del Castillo, E., 2017. Statistical metamodeling of dynamic network loading. Transportation Research Part B: Methodological

work page 2017
[57]

C., 1998

Spall, J. C., 1998. Implementation of the simultaneous perturbation algorithm for stochastic optimization. IEEE Transactions on Aerospace and Electronic Systems 34 (3), 817--823

work page 1998
[58]

C., 2002

Spall, J. C., 2002. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE transactions on automatic control 37 (3), 332--341

work page 2002
[59]

Y., Lo, H

Szeto, W. Y., Lo, H. K., 2006. Dynamic traffic assignment: properties and extensions. Transportmetrica 2 (1), 31--52

work page 2006
[60]

Tamp \`e re, C. M. J., Corthout, R., Cattrysse, D., Immers, L. H., 2011. A generic class of first order node models for dynamic macroscopic simulation of traffic flows. Transportation Research Part B: Methodological 45 (1), 289--309

work page 2011
[61]

Transportation networks for researchAccessed 2021-10-01

Transportation Networks for Research Core Team , 2021. Transportation networks for researchAccessed 2021-10-01

work page 2021
[62]

A short note on the link transmission model

Wada, K., Jin, W., 2017. A short note on the link transmission model. Working Paper on ResearchGate

work page 2017
[63]

G., 1952

Wardrop, J. G., 1952. Some theoretical aspects of road traffic research. Proceedings of the Institution of Civil Engineers 1 (3), 325--362

work page 1952
[64]

The link transmission model for dynamic network loading

Yperman, I., 2007. The link transmission model for dynamic network loading. Ph.D. thesis, Katholieke Universiteit Leuven

work page 2007
[65]

M., Immers, B., 2006

Yperman, I., Logghe, S., Tampere, C. M., Immers, B., 2006. The multicommodity link transmission model for dynamic network loading. In: Transportation Research Board 85th Annual Meeting

work page 2006

[1] [1]

Towards differentiable agent-based simulation

Andelfinger, P., 2023. Towards differentiable agent-based simulation. ACM Transactions on Modeling and Computer Simulation 33 (1--2), 1--26

work page 2023

[2] [2]

N., 2007

Balakrishna, R., Ben-Akiva, M., Koutsopoulos, H. N., 2007. Offline calibration of dynamic traffic assignment: Simultaneous demand-and-supply estimation. Transportation Research Record 2003, 50--58

work page 2007

[3] [3]

G., Pearlmutter, B

Baydin, A. G., Pearlmutter, B. A., Radul, A. A., Siskind, J. M., 2018. Automatic differentiation in machine learning: a survey. Journal of Machine Learning Research 18 (153), 1--43

work page 2018

[4] [4]

On a routing problem

Bellman, R., 1958. On a routing problem. Quarterly of applied mathematics 16 (1), 87--90

work page 1958

[5] [5]

Discrete choice methods and their applications to short term travel decisions

Ben-Akiva, M., Bierlaire, M., 1999. Discrete choice methods and their applications to short term travel decisions. In: Hall, R. (Ed.), Handbook of Transportation Science. Springer, pp. 5--33

work page 1999

[6] [6]

J., Leary, C., Maclaurin, D., Necula, G., Paszke, A., VanderPlas, J., Wanderman-Milne, S., Zhang, Q., 2018

Bradbury, J., Frostig, R., Hawkins, P., Johnson, M. J., Leary, C., Maclaurin, D., Necula, G., Paszke, A., VanderPlas, J., Wanderman-Milne, S., Zhang, Q., 2018. JAX : composable transformations of Python+NumPy programs. http://github.com/jax-ml/jax

work page 2018

[7] [7]

Chen, R. T. Q., Rubanova, Y., Bettencourt, J., Duvenaud, D., 2018. Neural ordinary differential equations. In: Advances in Neural Information Processing Systems (NeurIPS)

work page 2018

[8] [8]

A simulation-based optimization algorithm for dynamic large-scale urban transportation problems

Chong, L., Osorio, C., 2018. A simulation-based optimization algorithm for dynamic large-scale urban transportation problems. Transportation Science 52 (3), 637--656

work page 2018

[9] [9]

F., 1994

Daganzo, C. F., 1994. The cell transmission model: A dynamic representation of highway traffic consistent with the hydrodynamic theory. Transportation Research Part B: Methodological 28 (4), 269--287

work page 1994

[10] [10]

F., 1995

Daganzo, C. F., 1995. The cell transmission model, part II : Network traffic. Transportation Research Part B: Methodological 29 (2), 79--93

work page 1995

[11] [11]

L., 2024

Dantsuji, T., Ngoduy, D., Pu, Z., Lee, S., Vu, H. L., 2024. A hybrid neural network for real-time OD demand calibration under disruptions. arXiv preprint arXiv:2408.06659

work page arXiv 2024

[12] [12]

K., 2025

Du, K., Lee, E., Ma, Q., Su, Z., Zhang, S., Lo, H. K., 2025. Modeling metro passenger routing choices with a fully differentiable end-to-end simulation-based optimization ( SBO ) approach. Transportation Science 59 (4), 802--822

work page 2025

[13] [13]

o tter \

Fl \"o tter \"o d, G., Rohde, J., 2011. Operational macroscopic modeling of complex urban road intersections. Transportation Research Part B: Methodological 45 (6), 903--922

work page 2011

[14] [14]

R., 1956

Ford, L. R., 1956. Network flow theory. RAND Corporation Paper, Santa Monica, 1956

work page 1956

[15] [15]

L., Bernstein, D., Smith, T

Friesz, T. L., Bernstein, D., Smith, T. E., Tobin, R. L., Wie, B.-W., 1993. A variational inequality formulation of the dynamic network user equilibrium problem. Operations Research 41 (1), 179--191

work page 1993

[16] [16]

Managing network congestion with link-based incentives: A surrogate-based optimization approach

Fu, Q., Wu, J., Wu, X., Sun, J., Tian, Y., 2024. Managing network congestion with link-based incentives: A surrogate-based optimization approach. Transportation Research Part A: Policy and Practice 182, 104033

work page 2024

[17] [17]

F., 2007

Geroliminis, N., Daganzo, C. F., 2007. Macroscopic modeling of traffic in cities. In: Transportation Research Board 86th Annual Meeting

work page 2007

[18] [18]

Hysteresis phenomena of a macroscopic fundamental diagram in freeway networks

Geroliminis, N., Sun, J., 2011 a . Hysteresis phenomena of a macroscopic fundamental diagram in freeway networks. Transportation Research Part A: Policy and Practice 45 (9), 966--979

work page 2011

[19] [19]

Properties of a well-defined macroscopic fundamental diagram for urban traffic

Geroliminis, N., Sun, J., 2011 b . Properties of a well-defined macroscopic fundamental diagram for urban traffic. Transportation Research Part B: Methodological 45 (3), 605--617

work page 2011

[20] [20]

Discrete adjoint gradient computation for multiclass traffic flow models on road networks

Goatin, P., Klar, A., Mezquita-Nieto, C., 2026. Discrete adjoint gradient computation for multiclass traffic flow models on road networks. arXiv preprint arXiv:2604.00670

work page arXiv 2026

[21] [21]

E., 1989

Goldberg, D. E., 1989. Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley

work page 1989

[22] [22]

Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation, 2nd Edition

Griewank, A., Walther, A., 2008. Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation, 2nd Edition. SIAM

work page 2008

[23] [23]

DiffTaichi : Differentiable programming for physical simulation

Hu, Y., Anderson, L., Li, T.-M., Sun, Q., Carr, N., Ragan-Kelley, J., Durand, F., 2020. DiffTaichi : Differentiable programming for physical simulation. In: International Conference on Learning Representations (ICLR)

work page 2020

[24] [24]

Properties of dynamic user equilibrium solution: existence, uniqueness, stability, and robust solution methodology

Iryo, T., 2013. Properties of dynamic user equilibrium solution: existence, uniqueness, stability, and robust solution methodology. Transportmetrica B: Transport Dynamics 1 (1), 52--67

work page 2013

[25] [25]

P., Ba, J., 2015

Kingma, D. P., Ba, J., 2015. Adam: A method for stochastic optimization. In: International Conference on Learning Representations (ICLR)

work page 2015

[26] [26]

Dynamic user optimal assignment with physical queues for a many-to-many OD pattern

Kuwahara, M., Akamatsu, T., 2001. Dynamic user optimal assignment with physical queues for a many-to-many OD pattern. Transportation Research Part B: Methodological 35 (5), 461--479

work page 2001

[27] [27]

P., 1996

Lebacque, J. P., 1996. The G odunov scheme and what it means for first order traffic flow models. In: Lesort, J. B. (Ed.), Proceedings of the 13th International Symposium on Transportation and Traffic Theory. Elsevier, pp. 647--677

work page 1996

[28] [28]

Traffic assignment as a differentiable program

Li, J., Nie, M., 2026. Traffic assignment as a differentiable program. Available at SSRN

work page 2026

[29] [29]

J., Whitham, G

Lighthill, M. J., Whitham, G. B., 1955. On kinematic waves. II . a theory of traffic flow on long crowded roads. Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences 229 (1178), 317--345

work page 1955

[30] [30]

K., 2023

Liu, Z., Yin, Y., Bai, F., Grimm, D. K., 2023. End-to-end learning of user equilibrium with implicit neural networks. Transportation Research Part C: Emerging Technologies 150, 104085

work page 2023

[31] [31]

An enhanced SPSA algorithm for the calibration of dynamic traffic assignment models

Lu, L., Xu, Y., Antoniou, C., Ben-Akiva, M., 2015. An enhanced SPSA algorithm for the calibration of dynamic traffic assignment models. Transportation Research Part C: Emerging Technologies 51, 149--166

work page 2015

[32] [32]

Incorporating graph neural network into route choice model

Ma, Y., Seo, T., 2025. Incorporating graph neural network into route choice model. arXiv preprint arXiv:2503.02315

work page arXiv 2025

[33] [33]

S., Williams, J

Mahmassani, H. S., Williams, J. C., Herman, R., 1984. Investigation of network-level traffic flow relationships: some simulation results. Transportation Research Record 971, 121--130

work page 1984

[34] [34]

F., Rothery, R., 1971

Makigami, Y., Newell, G. F., Rothery, R., 1971. Three-dimensional representation of traffic flow. Transportation Science 5 (3), 302--313

work page 1971

[35] [35]

Ultra-fast traffic nowcasting and control via differentiable agent-based simulation

Makinoshima, F., Yamaguchi, Y., Segawa, E., Niinuma, K., Qian, S., 2026. Ultra-fast traffic nowcasting and control via differentiable agent-based simulation. arXiv preprint arXiv: 2603.25068

work page arXiv 2026

[36] [36]

E., 2010

Nair, V., Hinton, G. E., 2010. Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th international conference on machine learning. pp. 807--814

work page 2010

[37] [37]

F., 1993 a

Newell, G. F., 1993 a . A simplified theory of kinematic waves in highway traffic, part I : General theory. Transportation Research Part B: Methodological 27 (4), 281--287

work page 1993

[38] [38]

F., 1993 b

Newell, G. F., 1993 b . A simplified theory of kinematic waves in highway traffic, part II : Queueing at freeway bottlenecks. Transportation Research Part B: Methodological 27 (4), 289--303

work page 1993

[39] [39]

F., 1993 c

Newell, G. F., 1993 c . A simplified theory of kinematic waves in highway traffic, part III : Multi-destination flows. Transportation Research Part B: Methodological 27 (4), 305--313

work page 1993

[40] [40]

A computationally efficient simulation-based optimization algorithm for large-scale urban transportation problems

Osorio, C., Chong, L., 2015. A computationally efficient simulation-based optimization algorithm for large-scale urban transportation problems. Transportation Science 49 (3), 623--636

work page 2015

[41] [41]

Dynamic network loading: A stochastic differentiable model that derives link state distributions

Osorio, C., Flotterod, G., Bierlaire, M., 2011. Dynamic network loading: A stochastic differentiable model that derives link state distributions. Transportation Research Part B: Methodological 45 (9), 1410--1423

work page 2011

[42] [42]

Traffic simulation with METANET

Papageorgiou, M., Papamichail, I., Messmer, A., Wang, Y., 2010. Traffic simulation with METANET . In: Fundamentals of traffic simulation. Springer, pp. 399--430

work page 2010

[43] [43]

S., Boltyanskii, V

Pontryagin, L. S., Boltyanskii, V. G., Gamkrelidze, R. V., Mishchenko, E. F., 1962. The mathematical theory of optimal processes. Interscience Publishers, New York

work page 1962

[44] [44]

Second order macroscopic traffic flow model validation using automatic differentiation with resilient backpropagation and particle swarm optimisation algorithms

Poole, A., Kotsialos, A., 2016. Second order macroscopic traffic flow model validation using automatic differentiation with resilient backpropagation and particle swarm optimisation algorithms. Transportation Research Part C: Emerging Technologies 71, 356--381

work page 2016

[45] [45]

The Python Language Reference

Python Software Foundation , 2022. The Python Language Reference

work page 2022

[46] [46]

E., LeBlanc, L

Ran, B., Boyce, D. E., LeBlanc, L. J., 1993. A new class of instantaneous dynamic user-optimal traffic assignment models. Operations Research 41 (1), 192--202

work page 1993

[47] [47]

L., Krichene, W., Goatin, P., Bayen, A

Reilly, J., Samaranayake, S., Delle Monache, M. L., Krichene, W., Goatin, P., Bayen, A. M., 2015. Adjoint-based optimization on a network of discretized scalar conservation laws with applications to coordinated ramp metering. Journal of optimization theory and applications 167 (2), 733--760

work page 2015

[48] [48]

I., 1956

Richards, P. I., 1956. Shock waves on the highway. Operations Research 4 (1), 42--51

work page 1956

[49] [49]

Macroscopic Traffic Flow Simulation: Fundamental Mathematical Theory and Python Implementation

Seo, T., 2023. Macroscopic Traffic Flow Simulation: Fundamental Mathematical Theory and Python Implementation. Corona Publishing Co., Ltd., (in Japanese)

work page 2023

[50] [50]

UXsim : lightweight mesoscopic traffic flow simulator in pure Python

Seo, T., 2025. UXsim : lightweight mesoscopic traffic flow simulator in pure Python . Journal of Open Source Software 10 (106), 7617

work page 2025

[51] [51]

A physics-informed deep learning paradigm for traffic state and fundamental diagram estimation

Shi, R., Mo, Z., Huang, K., Di, X., Du, Q., 2022. A physics-informed deep learning paradigm for traffic state and fundamental diagram estimation. IEEE Transactions on Intelligent Transportation Systems 23 (8), 11688--11698

work page 2022

[52] [52]

Enhancing discrete choice models with representation learning

Sifringer, B., Lurkin, V., Alahi, A., 2020. Enhancing discrete choice models with representation learning. Transportation Research Part B: Methodological 140, 236--261

work page 2020

[53] [53]

Smits, E.-S., Bliemer, M. C. J., Pel, A. J., van Arem, B., 2015. A family of macroscopic node models. Transportation Research Part B: Methodological 74, 20--39

work page 2015

[54] [54]

C., 2022

Son, S., Qiao, Y., Sewall, J., Lin, M. C., 2022. Differentiable hybrid traffic simulation. ACM Transactions on Graphics 41 (6), 1--14

work page 2022

[55] [55]

C., 2025

Son, S., Zheng, L., Clipp, B., Greenwell, C., Philip, S., Lin, M. C., 2025. Gradient-based trajectory optimization with parallelized differentiable traffic simulation. In: 2025 IEEE International Conference on Robotics and Automation. pp. 14497--14504

work page 2025

[56] [56]

L., del Castillo, E., 2017

Song, W., Han, K., Wang, Y., Friesz, T. L., del Castillo, E., 2017. Statistical metamodeling of dynamic network loading. Transportation Research Part B: Methodological

work page 2017

[57] [57]

C., 1998

Spall, J. C., 1998. Implementation of the simultaneous perturbation algorithm for stochastic optimization. IEEE Transactions on Aerospace and Electronic Systems 34 (3), 817--823

work page 1998

[58] [58]

C., 2002

Spall, J. C., 2002. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE transactions on automatic control 37 (3), 332--341

work page 2002

[59] [59]

Y., Lo, H

Szeto, W. Y., Lo, H. K., 2006. Dynamic traffic assignment: properties and extensions. Transportmetrica 2 (1), 31--52

work page 2006

[60] [60]

Tamp \`e re, C. M. J., Corthout, R., Cattrysse, D., Immers, L. H., 2011. A generic class of first order node models for dynamic macroscopic simulation of traffic flows. Transportation Research Part B: Methodological 45 (1), 289--309

work page 2011

[61] [61]

Transportation networks for researchAccessed 2021-10-01

Transportation Networks for Research Core Team , 2021. Transportation networks for researchAccessed 2021-10-01

work page 2021

[62] [62]

A short note on the link transmission model

Wada, K., Jin, W., 2017. A short note on the link transmission model. Working Paper on ResearchGate

work page 2017

[63] [63]

G., 1952

Wardrop, J. G., 1952. Some theoretical aspects of road traffic research. Proceedings of the Institution of Civil Engineers 1 (3), 325--362

work page 1952

[64] [64]

The link transmission model for dynamic network loading

Yperman, I., 2007. The link transmission model for dynamic network loading. Ph.D. thesis, Katholieke Universiteit Leuven

work page 2007

[65] [65]

M., Immers, B., 2006

Yperman, I., Logghe, S., Tampere, C. M., Immers, B., 2006. The multicommodity link transmission model for dynamic network loading. In: Transportation Research Board 85th Annual Meeting

work page 2006