Toward Cooperative Driving in Mixed Traffic: An Adaptive Potential Game-Based Approach with Field Test Verification
Pith reviewed 2026-05-10 00:52 UTC · model grok-4.3
The pith
An adaptive potential game framework lets autonomous vehicles cooperate with human drivers by aligning individual goals with system safety and efficiency while refining preference estimates from observed behavior.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The adaptive potential game (APG) establishes a system utility function from the general form of individual utilities and their monotonic relationship to system objectives. It applies the Shapley value to compute each vehicle's marginal utility and refines HDV preference estimates in real time by comparing observed behavior to APG-predicted actions, yielding measurable improvements in mixed-traffic safety and efficiency.
What carries the argument
Adaptive potential game (APG) that uses monotonic individual-to-system utility mapping, Shapley-value marginal contribution calculation, and online comparison of observed HDV actions to estimated actions for preference refinement.
If this is right
- Individual vehicle objectives and global traffic safety and efficiency can be optimized at the same time.
- Shapley-value weighting lets each vehicle's contribution to cooperation be quantified and used in decision-making.
- Continuous preference refinement raises the rate at which cooperative maneuvers succeed in mixed fleets.
- The same structure supports real-time deployment, as shown by successful field tests on public roads.
Where Pith is reading between the lines
- The framework could be tested in denser urban intersections or on highways with frequent lane changes to check whether the monotonicity assumption still holds under higher interaction density.
- Integration with existing perception modules would allow the preference estimator to run on raw sensor data rather than requiring perfect state information.
- If the preference update rule proves stable across different driver populations, the method could be transferred to regions with distinct driving cultures without full retraining.
Load-bearing premise
A monotonic relationship holds between each vehicle's individual utility and overall system objectives, and observed human driver behavior can be compared reliably to the model's estimated actions to refine preferences.
What would settle it
A controlled field test in which the APG method produces no measurable gain in collision-avoidance rates or average travel time compared with a non-adaptive game baseline when human preferences are updated from observed actions.
Figures
read the original abstract
Connected autonomous vehicles (CAVs), which represent a significant advancement in autonomous driving technology, have the potential to greatly increase traffic safety and efficiency through cooperative decision-making. However, existing methods often overlook the individual needs and heterogeneity of cooperative participants, making it difficult to transfer them to environments where they coexist with human-driven vehicles (HDVs).To address this challenge, this paper proposes an adaptive potential game (APG) cooperative driving framework. First, the system utility function is established on the basis of a general form of individual utility and its monotonic relationship, allowing for the simultaneous optimization of both individual and system objectives. Second, the Shapley value is introduced to compute each vehicle's marginal utility within the system, allowing its varying impact to be quantified. Finally, the HDV preference estimation is dynamically refined by continuously comparing the observed HDV behavior with the APG's estimated actions, leading to improvements in overall system safety and efficiency. Ablation studies demonstrate that adaptively updating Shapley values and HDV preference estimation significantly improve cooperation success rates in mixed traffic. Comparative experiments further highlight the APG's advantages in terms of safety and efficiency over other cooperative methods. Moreover, the applicability of the approach to real-world scenarios was validated through field tests.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes an adaptive potential game (APG) framework for cooperative decision-making among connected autonomous vehicles (CAVs) and human-driven vehicles (HDVs) in mixed traffic. It defines a system utility function from individual utilities via a claimed monotonic relationship to enable joint optimization, incorporates Shapley values to quantify each vehicle's marginal contribution, and dynamically refines HDV preference estimates online by comparing observed trajectories against actions computed by the current APG model. Ablation studies, comparative simulations, and field tests are presented to claim improvements in safety and efficiency metrics over baselines.
Significance. If the central claims hold, the work offers a practical extension of potential games to heterogeneous mixed-traffic settings, with explicit handling of HDV variability through online adaptation. The inclusion of field-test verification and ablation results demonstrating benefits from adaptive Shapley-value and preference updates are concrete strengths that distinguish it from purely simulation-based game-theoretic approaches. Successful validation could inform deployable CAV coordination algorithms that respect individual vehicle objectives while improving system-level outcomes.
major comments (2)
- [§4.2] §4.2 (HDV Preference Refinement): The online update compares observed HDV trajectories to actions selected by the current APG model and treats the latter as reference for preference correction. This step is load-bearing for the adaptivity claim, yet no convergence proof, Lyapunov-style stability argument, or sensitivity analysis under realistic sensor noise or initial preference mismatch is supplied; the risk of error reinforcement noted in the skeptic's analysis therefore remains unaddressed.
- [§3.1] §3.1 (System Utility Construction): The assertion that a monotonic mapping between individual utilities and system utility permits simultaneous optimization is central to the joint-optimization guarantee, but the manuscript provides neither an explicit functional form for the monotonicity nor a proof that the Shapley-value allocation preserves the claimed monotonicity under the heterogeneous utility functions used for HDVs.
minor comments (2)
- [Table 2, Figure 5] Table 2 and Figure 5: error bars or standard deviations are omitted from the reported safety and efficiency metrics, making it difficult to assess statistical significance of the claimed improvements over baselines.
- [§3.3] Notation: the symbol for the adaptive preference vector is introduced without an explicit update equation; readers must infer the precise recursion from surrounding prose.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which help clarify the theoretical and practical aspects of our adaptive potential game framework. We address each major comment below and outline the revisions planned for the next version of the manuscript.
read point-by-point responses
-
Referee: [§4.2] §4.2 (HDV Preference Refinement): The online update compares observed HDV trajectories to actions selected by the current APG model and treats the latter as reference for preference correction. This step is load-bearing for the adaptivity claim, yet no convergence proof, Lyapunov-style stability argument, or sensitivity analysis under realistic sensor noise or initial preference mismatch is supplied; the risk of error reinforcement noted in the skeptic's analysis therefore remains unaddressed.
Authors: We acknowledge that the manuscript lacks a formal convergence proof or Lyapunov-style stability analysis for the HDV preference refinement. The update rule is presented as a practical online mechanism whose benefits are shown empirically via ablation studies, comparative simulations, and field tests. To directly address robustness concerns, including sensor noise and potential error reinforcement, we will add a dedicated sensitivity analysis section with new simulation results under noisy observations and varying initial preference mismatches. This empirical strengthening will be included in the revision, while noting that a complete theoretical guarantee remains an open direction for future work. revision: partial
-
Referee: [§3.1] §3.1 (System Utility Construction): The assertion that a monotonic mapping between individual utilities and system utility permits simultaneous optimization is central to the joint-optimization guarantee, but the manuscript provides neither an explicit functional form for the monotonicity nor a proof that the Shapley-value allocation preserves the claimed monotonicity under the heterogeneous utility functions used for HDVs.
Authors: Section 3.1 defines the system utility via a general monotonic relationship to the aggregate of individual utilities, enabling the claimed joint optimization. We agree that an explicit functional form and a demonstration that Shapley-value marginal contributions preserve monotonicity for heterogeneous HDV utilities would improve rigor. In the revised manuscript we will specify the exact transformation employed (a scaled sum that maintains monotonicity) and include a short proof sketch showing that the Shapley allocation inherits the monotonic property under the utility heterogeneity considered. revision: yes
Circularity Check
No significant circularity in the APG derivation chain
full rationale
The paper constructs the system utility from a general individual utility via an explicit monotonic relationship, which is the standard definition enabling potential-game equivalence rather than a self-referential loop. Shapley values are invoked as an external cooperative-game tool to quantify marginal contributions. The HDV preference update compares observed trajectories against current APG actions in an online refinement loop; this is an iterative estimator, not a definitional reduction where the 'prediction' is forced to equal the fitted input by construction. Ablation studies, comparative experiments, and field-test validation supply independent empirical checks outside the derivation. No quoted equation or step reduces the claimed output to the input tautologically.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption A monotonic relationship exists between individual utility and system utility that allows simultaneous optimization of both.
invented entities (1)
-
Adaptive Potential Game (APG) framework
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Limitations and improvements of the intelligent driver model (idm)
Albeaik, S., Bayen, A., Chiri, M.T., Gong, X., Hayat, A., Kardous, N., Keimer, A., Mc- Quade, S.T., Piccoli, B., You, Y., 2022. Limitations and improvements of the intelligent driver model (idm). SIAM Journal on Applied Dynamical Systems 21, 1862–1892
work page 2022
-
[2]
Arfvidsson, K.M., Jiang, F.J., Johansson, K.H., Mårtensson, J., 2024. Towards safe autonomous intersection management: Temporal logic-based safety filters for vehicle coordination, in: 2024IEEE27thInternationalConferenceonIntelligentTransportation Systems (ITSC), IEEE. pp. 895–901
work page 2024
-
[3]
Azimi, R., Bhatia, G., Rajkumar, R.R., Mudalige, P., 2014. Stip: Spatio-temporal inter- section protocols for autonomous vehicles, in: 2014 ACM/IEEE international conference on cyber-physical systems (ICCPS), IEEE. pp. 1–12
work page 2014
-
[4]
Camerer, C.F., Fehr, E., 2006. When does" economic man" dominate social behavior? science 311, 47–52
work page 2006
-
[5]
Chandra, R., Manocha, D., 2022. Gameplan: Game-theoretic multi-agent planning with human drivers at intersections, roundabouts, and merging. IEEE Robotics and Automation Letters 7, 2676–2683
work page 2022
-
[6]
Chen, C., Xu, Q., Cai, M., Wang, J., Wang, J., Li, K., 2022. Conflict-free cooperation method for connected and automated vehicles at unsignalized intersections: Graph- based modeling and optimality analysis. IEEE Transactions on Intelligent Transporta- tion Systems 23, 21897–21914
work page 2022
-
[7]
Chen, T., Wang, M., Gong, S., Zhou, Y., Ran, B., 2021. Connected and automated vehicle distributed control for on-ramp merging scenario: A virtual rotation approach. Transportation Research Part C: Emerging Technologies 133, 103451
work page 2021
-
[8]
Game theory in internet of things: A survey
Chi, C., Wang, Y., Tong, X., Siddula, M., Cai, Z., 2021. Game theory in internet of things: A survey. IEEE Internet of Things Journal 9, 12125–12146
work page 2021
-
[9]
Cooper, P., 1984. Experience with traffic conflicts in canada with emphasis on “post encroachment time” techniques, in: International calibration study of traffic conflict techniques. Springer, pp. 75–96. 26
work page 1984
-
[10]
Introduction to the k-means clustering algorithm based on the elbow method
Cui, M., et al., 2020. Introduction to the k-means clustering algorithm based on the elbow method. Accounting, Auditing and Finance 1, 5–8
work page 2020
-
[11]
A vehicle-infrastructure multi-layer co- operative decision-making framework
Cui, Y., Fang, S., Hang, P., Sun, J., 2025. A vehicle-infrastructure multi-layer co- operative decision-making framework. URL:https://arxiv.org/abs/2503.16552, arXiv:2503.16552
-
[12]
Fang, S., Hang, P., Wei, C., Xing, Y., Sun, J., 2024. Cooperative driving of connected autonomous vehicles in heterogeneous mixed traffic: A game theoretic approach. IEEE Transactions on Intelligent Vehicles , 1–15doi:10.1109/TIV.2024.3399694
-
[13]
Fang, S., Liu, J., Ding, M., Cui, Y., Lv, C., Hang, P., Sun, J., 2025. Towards interactive and learnable cooperative driving automation: a large language model-driven decision- making framework. IEEE Transactions on Vehicular Technology
work page 2025
-
[14]
Garg, M., Bouroche, M., 2023. Can connected autonomous vehicles improve mixed traf- fic safety without compromising efficiency in realistic scenarios? IEEE Transactions on Intelligent Transportation Systems 24, 6674–6689. doi:10.1109/TITS.2023.3238889
-
[15]
Hang, P., Huang, C., Hu, Z., Xing, Y., Lv, C., 2021a. Decision making of connected automated vehicles at an unsignalized roundabout considering personalized driving be- haviours. IEEE Transactions on Vehicular Technology 70, 4051–4064
-
[16]
Hang, P., Lv, C., Huang, C., Xing, Y., Hu, Z., 2021b. Cooperative decision making of connected automated vehicles at multi-lane merging zone: A coalitional game approach. IEEE Transactions on Intelligent Transportation Systems 23, 3829–3841
-
[17]
Trajectory optimization of connected and autonomous vehi- cles at a multilane freeway merging area
Hu, X., Sun, J., 2019. Trajectory optimization of connected and autonomous vehi- cles at a multilane freeway merging area. Transportation Research Part C: Emerging Technologies 101, 111–125
work page 2019
-
[18]
Hu, Z., Huang, J., Yang, D., Zhong, Z., 2021. Constraint-tree-driven modeling and distributed robust control for multi-vehicle cooperation at unsignalized intersections. Transportation Research Part C: Emerging Technologies 131, 103353
work page 2021
-
[19]
Huang, H., Liu, J., Shi, G., Zhao, S., Li, B., Wang, J., 2024. Adaptive decision-making for autonomous vehicles: A learning-enhanced game-theoretic approach in interactive environments. arXiv preprint arXiv:2402.11467
-
[20]
Dynamic driving and routing games for au- tonomous vehicles on networks: A mean field game approach
Huang, K., Chen, X., Di, X., Du, Q., 2021. Dynamic driving and routing games for au- tonomous vehicles on networks: A mean field game approach. Transportation Research Part C: Emerging Technologies 128, 103189
work page 2021
-
[21]
Kang, J., Zhang, J., Yang, H., Ye, D., Hossain, M.S., 2024. When metaverses meet vehicle road cooperation: Multi-agent drl-based stackelberg game for vehicular twins migration. IEEE Internet of Things Journal . 27
work page 2024
-
[22]
Klimke, M., Völz, B., Buchholz, M., 2023. Automatic intersection management in mixed traffic using reinforcement learning and graph neural networks. arXiv preprint arXiv:2301.12717
-
[23]
Kurzer, K., Bitzer, M., Zöllner, J.M., 2022. Learning reward models for cooperative trajectory planning with inverse reinforcement learning and monte carlo tree search, in: 2022 IEEE Intelligent Vehicles Symposium (IV), IEEE. pp. 22–28
work page 2022
-
[24]
Li, L., Zhao, W., Wang, C., 2024. Cooperative merging strategy considering stochastic driving style at on-ramps: A bayesian game approach. Automotive Innovation 7, 312– 334
work page 2024
-
[25]
Liu, J., Hang, P., Na, X., Huang, C., Sun, J., 2024. Cooperative decision-making for cavs at unsignalized intersections: A marl approach with attention and hierarchical game priors. IEEE Transactions on Intelligent Transportation Systems
work page 2024
-
[26]
Potential game-based decision-making for autonomous driving
Liu, M., Kolmanovsky, I., Tseng, H.E., Huang, S., Filev, D., Girard, A., 2023. Potential game-based decision-making for autonomous driving. IEEE Transactions on Intelligent Transportation Systems , 1–14doi:10.1109/TITS.2023.3264665
-
[27]
Liu, Q., Liu, W., Jiang, R., Han, X., 2025. On the morning commute problem with mixed autonomous and human-driven traffic under stochastic bottleneck capac- ity. Transportation Research Part B: Methodological 195, 103203
work page 2025
-
[28]
Beijing autonomous vehicle road test report 2022.http://www.mzone
MZONE, 2022. Beijing autonomous vehicle road test report 2022.http://www.mzone. site/index.php/index/index/cid/2/sid/21.html
work page 2022
-
[29]
Peesapati, L.N., Hunter, M.P., Rodgers, M.O., 2018. Can post encroachment time substitute intersection characteristics in crash prediction models? Journal of safety research 66, 205–211
work page 2018
-
[30]
Social behavior for autonomous vehicles
Schwarting, W., Pierson, A., Alonso-Mora, J., Karaman, S., Rus, D., 2019. Social behavior for autonomous vehicles. Proceedings of the National Academy of Sciences 116, 24972–24978
work page 2019
- [31]
-
[32]
Shi, H., Zhou, Y., Wu, K., Wang, X., Lin, Y., Ran, B., 2021. Connected automated ve- hicle cooperative control with a deep reinforcement learning approach in a mixed traffic environment. Transportation Research Part C: Emerging Technologies 133, 103421
work page 2021
-
[33]
Influenceofconnectedandautonomousvehicles on traffic flow stability and throughput
Talebpour, A., Mahmassani, H.S., 2016. Influenceofconnectedandautonomousvehicles on traffic flow stability and throughput. Transportation research part C: emerging technologies 71, 143–163
work page 2016
-
[34]
Tian, R., Li, N., Kolmanovsky, I., Yildiz, Y., Girard, A.R., 2020. Game-theoretic modeling of traffic in unsignalized intersection network for autonomous vehicle control verification and validation. IEEE Transactions on Intelligent Transportation Systems . 28
work page 2020
-
[35]
Umargono, E., Suseno, J.E., Gunawan, S., 2019. K-means clustering optimization using the elbow method and early centroid determination based-on mean and median, in: Proceedings of the International Conferences on Information System and Technology, SCITEPRESS—Science and Technology Publications Setubal, Portugal. pp. 234–240
work page 2019
-
[36]
Wang, Y., Li, J., Ke, T., Ke, Z., Jiang, J., Xu, S., Wang, J., 2024. A homogeneous multi-vehicle cooperative group decision-making method in complicated mixed traffic scenarios. Transportation Research Part C: Emerging Technologies 167, 104833
work page 2024
-
[37]
Winter, E., 2002. The shapley value. Handbook of game theory with economic appli- cations 3, 2025–2054
work page 2002
-
[38]
Influence of cavs platooning on intersection capacity under mixed traffic
Wu, Y., Wang, D.Z., Zhu, F., 2022. Influence of cavs platooning on intersection capacity under mixed traffic. Physica A: Statistical Mechanics and its Applications 593, 126989
work page 2022
-
[39]
Distributed conflict-free cooperation for multiple connected vehicles at unsignalized intersections
Xu, B., Li, S.E., Bian, Y., Li, S., Ban, X.J., Wang, J., Li, K., 2018. Distributed conflict-free cooperation for multiple connected vehicles at unsignalized intersections. Transportation Research Part C: Emerging Technologies 93, 322–334
work page 2018
-
[40]
Cooperative driving at unsignalized intersections using tree search
Xu, H., Zhang, Y., Li, L., Li, W., 2019. Cooperative driving at unsignalized intersections using tree search. IEEE Transactions on Intelligent Transportation Systems 21, 4563– 4571
work page 2019
-
[41]
Yan, Z., Zheng, H., Wu, C., 2024. Multi-agent path finding for cooperative autonomous driving, in: 2024 IEEE International Conference on Robotics and Automation (ICRA), IEEE. pp. 12361–12367
work page 2024
-
[42]
Yu, C., Sun, W., Liu, H.X., Yang, X., 2019. Managingconnectedandautomatedvehicles at isolated intersections: From reservation-to optimization-based methods. Transporta- tion research part B: methodological 122, 416–435
work page 2019
-
[43]
Zhang, Y., Hao, R., Zhang, T., Chang, X., Xie, Z., Zhang, Q., 2021. A trajectory optimization-based intersection coordination framework for cooperative autonomous ve- hicles. IEEE Transactions on Intelligent Transportation Systems 23, 14674–14688
work page 2021
-
[44]
Zheng, S.T., Jiang, R., Zhang, H., Tian, J., Yan, R., Jia, B., Gao, Z., 2025. Oscillation growth in mixed traffic of human-driven and automated vehicles in both experimental study and simulation. Transportation Science
work page 2025
-
[45]
Zhou, D., Hang, P., Sun, J., 2024. Reasoning graph-based reinforcement learning to cooperate mixed connected and autonomous traffic at unsignalized intersections. Transportation Research Part C: Emerging Technologies 167, 104807. URL:https: //www.sciencedirect.com/science/article/pii/S0968090X24003280, doi:https: //doi.org/10.1016/j.trc.2024.104807
-
[46]
Zhou, D., Ma, Z., Zhao, X., Sun, J., 2022. Reasoning graph: A situation-aware frame- work for cooperating unprotected turns under mixed connected and autonomous traffic environments. Transportation Research Part C: Emerging Technologies 143. 29
work page 2022
-
[47]
Zhou, H., Jiang, K., He, S., Min, G., Wu, J., 2023. Distributed deep multi-agent rein- forcement learning for cooperative edge caching in internet-of-vehicles. IEEE Transac- tions on Wireless Communications 22, 9595–9609. doi:10.1109/TWC.2023.3272348
-
[48]
Zimmermann, M., Schopf, D., Lütteken, N., Liu, Z., Storost, K., Baumann, M., Happee, R., Bengler, K.J., 2018. Carrot and stick: A game-theoretic approach to motivate coop- erative driving through social interaction. Transportation Research Part C: Emerging Technologies 88, 159–175. 30
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.