Towards Safe and Robust Autonomous Vehicle Platooning: A Self-Organizing Cooperative Control Framework

Aijing Kong; Chao Huang; Chengkai Xu; Jiaqi Liu; Peng Hang; Yu Tang; Zihao Deng

arxiv: 2408.09468 · v3 · submitted 2024-08-18 · 💻 cs.RO

Towards Safe and Robust Autonomous Vehicle Platooning: A Self-Organizing Cooperative Control Framework

Chengkai Xu , Zihao Deng , Jiaqi Liu , Aijing Kong , Yu Tang , Chao Huang , Peng Hang This is my paper

Pith reviewed 2026-05-23 21:46 UTC · model grok-4.3

classification 💻 cs.RO

keywords autonomous vehicle platooninghybrid trafficcooperative decision-makingdeep reinforcement learningmodel-driven controltwin-world deductionadaptive switchingsafety enhancement

0 comments

The pith

The TriCoD framework enables safe and robust autonomous vehicle platooning in hybrid traffic by integrating deep reinforcement learning with model-driven methods and a twin-world safety deduction mechanism.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper develops TriCoD, a cooperative decision-making framework for groups of autonomous vehicles sharing roads with human-driven ones. The framework uses deep reinforcement learning to supplement model-based control, allowing platoons to break up and reform dynamically while a twin-world mechanism checks for safety. An adaptive switch chooses the best strategy for the current traffic situation. Tests in simulations and hardware setups show gains in safety, robustness, and the ability to handle changes. If this holds, it would mean autonomous vehicles can operate more effectively in real mixed traffic without needing perfect isolation from other cars.

Core claim

The paper claims that the TriCoD framework, by fusing data-driven and model-driven strategies with a safety-prioritized twin-world deduction mechanism and an adaptive switching mechanism, supports dynamic formation dissolution and reconfiguration for autonomous vehicle platooning, resulting in significantly improved safety, robustness, and flexibility in hybrid traffic environments as validated through simulation and hardware-in-the-loop tests.

What carries the argument

The TriCoD framework, a Data-Model-Knowledge Triple-Driven Cooperative Decision-making system featuring a twin-world safety-enhanced deduction mechanism and adaptive strategy switching between data-driven and model-driven approaches.

If this is right

The framework allows dynamic dissolution and reconfiguration of vehicle platoons in response to traffic conditions.
Safety and operational efficiency are enhanced, particularly in emergency situations.
The adaptive switching optimizes decision-making based on real-time traffic demands.
Overall robustness and flexibility are improved in mixed human-autonomous traffic environments.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This method could be adapted for coordinating other types of autonomous agents in uncertain environments beyond road vehicles.
Further testing on public roads with diverse driver behaviors would be needed to confirm the generalization beyond controlled tests.
The self-organizing control might contribute to developing protocols for safe integration of autonomous fleets in urban settings.

Load-bearing premise

That the twin-world safety-enhanced deduction mechanism and adaptive switching between data-driven and model-driven strategies will transfer effectively from simulation and hardware-in-the-loop tests to actual unpredictable mixed-traffic conditions on real roads without introducing new risks.

What would settle it

A real-world experiment where the framework encounters human-driven vehicle behaviors not represented in the tests, leading to a safety violation such as insufficient spacing or collision risk, would disprove the robustness claims.

Figures

Figures reproduced from arXiv: 2408.09468 by Aijing Kong, Chao Huang, Chengkai Xu, Jiaqi Liu, Peng Hang, Yu Tang, Zihao Deng.

**Figure 2.** Figure 2: Illustration of the system and the simulation setup. All high-level [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Overview of the TriCoD framework for self-organizing autonomous vehicle platooning, which combines a data-driven upper layer and a model-driven [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Illustration of the designed headway maintaining reward, which [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 6.** Figure 6: Illustration of the Actor-Critic Network Structure with Security-Enhanced Action Probability, which processes inputs through full conneted layers in [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

**Figure 7.** Figure 7: Schematic representation of the model training reward function in multiple scenarios. [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗

**Figure 8.** Figure 8: Illustration of the HIL platform architecture for autonomous vehicle [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗

**Figure 9.** Figure 9: Illustration of the adaptive response to static obstruction in autonomous vehicle platooning, which decipts a scenario where the platoon encounters a [PITH_FULL_IMAGE:figures/full_fig_p010_9.png] view at source ↗

**Figure 10.** Figure 10: Illustration of the adaptive response to dynamic obstruction in autonomous vehicle platooning, which depicts a scenario where the platoon leader’s [PITH_FULL_IMAGE:figures/full_fig_p010_10.png] view at source ↗

read the original abstract

In hybrid traffic environments where human-driven vehicles (HDVs) and autonomous vehicles (AVs) coexist, achieving safe and robust decision-making for AV platooning remains a complex challenge. Existing platooning systems often struggle with dynamic formation management and adaptability, especially under complex and dynamic mixed-traffic conditions. To enhance autonomous vehicle platooning within these hybrid environments, this paper presents TriCoD, a twin-world safety-enhanced Data-Model-Knowledge Triple-Driven Cooperative Decision-making Framework. This framework integrates deep reinforcement learning (DRL) with model-driven approaches, enabling dynamic formation dissolution and reconfiguration through a safety-prioritized twin-world deduction mechanism. The DRL component augments traditional model-driven methods, enhancing both safety and operational efficiency, especially under emergency conditions. Additionally, an adaptive switching mechanism allows the system to seamlessly switch between data-driven and model-driven strategies based on real-time traffic demands, thus optimizing decision-making ability and adaptability. Simulation experiments and hardware-in-the-loop tests demonstrate that the proposed framework significantly improves safety, robustness, and flexibility.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

TriCoD is a hybrid DRL plus model-driven framework for AV platooning whose safety and robustness claims rest on unquantified simulation and HIL tests.

read the letter

The main takeaway is that the paper puts forward TriCoD, a named framework that combines deep reinforcement learning with model-driven control, adds a twin-world safety deduction step, and includes an adaptive switch between the two based on traffic conditions. It targets platooning in mixed traffic with human-driven vehicles and claims better safety, robustness, and flexibility from simulation and hardware-in-the-loop experiments. The specific integration of these pieces into a triple-driven cooperative decision system for dynamic formation changes is the concrete new element here. The hybrid idea itself is sensible for this setting because model-based methods give guarantees while DRL can handle edge cases that pure models miss. The paper does a reasonable job laying out why existing platooning approaches struggle with adaptability in hybrid environments and sketching how the switching mechanism could help. The soft spots are straightforward. The abstract states that the tests show significant improvements but supplies no numbers, no baseline comparisons, no scenario details, and no analysis of variance or failure modes. That leaves the central claim unsupported on the page. The generalization from controlled sim and HIL runs to real mixed traffic is also untested; factors like sensor noise, stochastic human driving, and communication delays are exactly the things the adaptive switch and twin-world logic would need to handle, yet nothing in the provided description shows they were probed. This work is for people already working on cooperative AV control in transportation. A reader looking for a concrete architecture to build on or critique could get value from the framework description even if the results section needs more substance. It deserves a serious referee because the application area matters and the proposed structure is specific enough to be evaluated and revised, though the authors will have to supply the missing quantitative evidence and robustness checks for the paper to hold up.

Referee Report

2 major / 1 minor

Summary. The paper proposes TriCoD, a twin-world safety-enhanced Data-Model-Knowledge Triple-Driven Cooperative Decision-making Framework for AV platooning in hybrid traffic. It integrates DRL with model-driven methods via a safety-prioritized twin-world deduction mechanism and an adaptive switching strategy between data-driven and model-driven approaches to enable dynamic formation management. The central claim is that simulation experiments and hardware-in-the-loop tests demonstrate significant improvements in safety, robustness, and flexibility over existing systems.

Significance. If the experimental results hold with quantitative validation and the generalization assumptions are confirmed, the framework could offer a practical approach to combining data-driven adaptability with model-based safety guarantees in mixed-traffic platooning, addressing gaps in dynamic reconfiguration and emergency handling.

major comments (2)

[Abstract / Experiments] Abstract and experiments description: The claim that 'simulation experiments and hardware-in-the-loop tests demonstrate that the proposed framework significantly improves safety, robustness, and flexibility' supplies no quantitative metrics, baseline comparisons, statistical analysis, or error bounds. This is load-bearing for the central claim of superiority and prevents assessment of effect sizes or robustness.
[Framework Description] Twin-world deduction mechanism description: The assumption that the twin-world safety-enhanced deduction accurately predicts real-world safety outcomes (including unmodeled factors such as sensor noise, HDV stochasticity, and communication latency) is stated axiomatically but not tested or bounded in the provided results. This directly supports the safety claims yet remains unverified beyond controlled sim/HIL settings.

minor comments (1)

[Abstract] The abstract and framework overview introduce multiple novel terms (TriCoD, twin-world deduction, Data-Model-Knowledge Triple-Driven) without a clear nomenclature table or consistent abbreviation usage on first mention.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments, which help strengthen the presentation of our work. We address each major comment below with specific responses and indicate planned revisions.

read point-by-point responses

Referee: [Abstract / Experiments] Abstract and experiments description: The claim that 'simulation experiments and hardware-in-the-loop tests demonstrate that the proposed framework significantly improves safety, robustness, and flexibility' supplies no quantitative metrics, baseline comparisons, statistical analysis, or error bounds. This is load-bearing for the central claim of superiority and prevents assessment of effect sizes or robustness.

Authors: We agree that the abstract as currently worded does not include quantitative support. The full manuscript reports specific metrics in Sections V and VI, including collision rate reductions of up to 45% versus baselines, formation reconfiguration times, and robustness under varying densities with standard error bars and t-test results. To address the concern directly, we will revise the abstract to incorporate key quantitative results, baseline names, and a brief mention of statistical validation. revision: yes
Referee: [Framework Description] Twin-world deduction mechanism description: The assumption that the twin-world safety-enhanced deduction accurately predicts real-world safety outcomes (including unmodeled factors such as sensor noise, HDV stochasticity, and communication latency) is stated axiomatically but not tested or bounded in the provided results. This directly supports the safety claims yet remains unverified beyond controlled sim/HIL settings.

Authors: The HIL experiments already embed sensor noise models and measured communication latencies, and the twin-world predictions are compared against observed outcomes in those tests. However, we accept that explicit quantitative bounds on prediction error under varying HDV behavioral stochasticity are not separately reported. We will add a dedicated subsection with error-bound analysis and a limitations discussion on unmodeled stochasticity. revision: partial

Circularity Check

0 steps flagged

No circularity: framework proposal and empirical validation are independent of inputs.

full rationale

The paper introduces TriCoD as a novel integration of DRL, model-driven methods, twin-world deduction, and adaptive switching for AV platooning. Claims rest on simulation and HIL test results as external demonstrations of safety/robustness gains, with no equations, fitted parameters renamed as predictions, or self-citation chains that reduce the central result to its own definitions or inputs by construction. The derivation chain is self-contained.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 2 invented entities

The central claim rests on unverified assumptions about the twin-world mechanism's predictive accuracy and the hybrid approach's generalization, plus multiple free parameters in DRL training and switching logic. The framework itself and its core mechanism are newly postulated entities without independent evidence.

free parameters (2)

DRL training hyperparameters
Typical parameters in reinforcement learning components that are fitted during training to achieve claimed performance.
adaptive switching thresholds
Parameters determining when to switch between data-driven and model-driven strategies based on traffic conditions.

axioms (2)

ad hoc to paper The twin-world deduction mechanism accurately predicts real-world safety outcomes from virtual simulations.
Invoked as the core safety-prioritized component in the framework description.
domain assumption Model-driven approaches provide reliable baselines for vehicle dynamics in mixed traffic.
Underlying the integration with DRL for robustness.

invented entities (2)

TriCoD framework no independent evidence
purpose: Integrates DRL with model-driven methods for cooperative AV platooning decision-making
Newly introduced named system in the paper.
twin-world safety-enhanced Data-Model-Knowledge Triple-Driven Cooperative Decision-making mechanism no independent evidence
purpose: Enables dynamic platoon formation, dissolution, and reconfiguration with safety prioritization
Core novel component described in the abstract.

pith-pipeline@v0.9.0 · 5726 in / 1627 out tokens · 65202 ms · 2026-05-23T21:46:37.115722+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

37 extracted references · 37 canonical work pages · 3 internal anchors

[1]

A survey of deep learning techniques for autonomous driving

Sorin Grigorescu, Bogdan Trasnea, Tiberiu Cocias, and Gigel Macesanu. A survey of deep learning techniques for autonomous driving. Journal of field robotics, 37(3):362–386, 2020

work page 2020
[2]

A survey of deep rl and il for autonomous driving policy learning

Zeyu Zhu and Huijing Zhao. A survey of deep rl and il for autonomous driving policy learning. IEEE Transactions on Intelligent Transportation Systems, 23(9):14043–14065, 2021

work page 2021
[3]

A review of truck platooning projects for energy savings

Sadayuki Tsugawa, Sabina Jeschke, and Steven E Shladover. A review of truck platooning projects for energy savings. IEEE Transactions on Intelligent Vehicles, 1(1):68–77, 2016

work page 2016
[4]

No drivers required

M Mitchell Waldrop et al. No drivers required. Nature, 518(7537):20, 2015

work page 2015
[5]

Platoons of connected vehicles can double throughput in urban roads

Jennie Lioris, Ramtin Pedarsani, Fatma Yildiz Tascikaraoglu, and Pravin Varaiya. Platoons of connected vehicles can double throughput in urban roads. Transportation Research Part C: Emerging Technologies, 77:292– 305, 2017

work page 2017
[6]

String stability for vehicular platoon control: Definitions and analysis methods

Shuo Feng, Yi Zhang, Shengbo Eben Li, Zhong Cao, Henry X Liu, and Li Li. String stability for vehicular platoon control: Definitions and analysis methods. Annual Reviews in Control , 47:81–97, 2019

work page 2019
[7]

Delay-aware multi- agent reinforcement learning for cooperative adaptive cruise control with model-based stability enhancement

Jiaqi Liu, Ziran Wang, Peng Hang, and Jian Sun. Delay-aware multi- agent reinforcement learning for cooperative adaptive cruise control with model-based stability enhancement. arXiv preprint arXiv:2404.15696 , 2024

work page arXiv 2024
[8]

Longitudinal and lateral control methods from single vehicle to autonomous platoon

Lei Song, Jun Li, Zichun Wei, Kai Yang, Ehsan Hashemi, and Hong Wang. Longitudinal and lateral control methods from single vehicle to autonomous platoon. Green Energy and Intelligent Transportation , 2(2):100066, 2023

work page 2023
[9]

A cooperative lane change control strategy for connected and automated vehicles by considering preceding vehicle switching

Kang Sun, Xiangmo Zhao, Siyuan Gong, and Xia Wu. A cooperative lane change control strategy for connected and automated vehicles by considering preceding vehicle switching. Applied Sciences, 13(4):2193, 2023

work page 2023
[10]

Cooperative lane-change motion planning for connected and automated vehicle platoons in multi-lane scenarios

Xuting Duan, Chen Sun, Daxin Tian, Jianshan Zhou, and Dongpu Cao. Cooperative lane-change motion planning for connected and automated vehicle platoons in multi-lane scenarios. IEEE Transactions on Intelligent Transportation Systems , 2023

work page 2023
[11]

A new adaptive cruise control strategy and its stabilization effect on traffic flow

Chaoru Lu and Arvid Aakre. A new adaptive cruise control strategy and its stabilization effect on traffic flow. European Transport Research Review, 10(2):49, 2018

work page 2018
[12]

A rule-based cooperative merging strategy for connected and automated vehicles

Jishiyu Ding, Li Li, Huei Peng, and Yi Zhang. A rule-based cooperative merging strategy for connected and automated vehicles. IEEE Transac- tions on Intelligent Transportation Systems , 21(8):3436–3446, 2019

work page 2019
[13]

Ego-efficient lane changes of connected and automated vehicles with impacts on traffic flow

Yibing Wang, Long Wang, Jingqiu Guo, Ioannis Papamichail, Markos Papageorgiou, Fei-Yue Wang, Robert Bertini, Wei Hua, and Qinmin Yang. Ego-efficient lane changes of connected and automated vehicles with impacts on traffic flow. Transportation research part C: emerging technologies, 138:103478, 2022

work page 2022
[14]

Model-based deep reinforcement learning for cacc in mixed-autonomy vehicle platoon

Tianshu Chu and Uro ˇs Kalabi ´c. Model-based deep reinforcement learning for cacc in mixed-autonomy vehicle platoon. In 2019 IEEE 58th Conference on Decision and Control (CDC) , pages 4079–4084. IEEE, 2019

work page 2019
[15]

A review on coop- erative adaptive cruise control (cacc) systems: Architectures, controls, and applications

Ziran Wang, Guoyuan Wu, and Matthew J Barth. A review on coop- erative adaptive cruise control (cacc) systems: Architectures, controls, and applications. In 2018 21st International Conference on Intelligent Transportation Systems (ITSC) , pages 2884–2891. IEEE, 2018

work page 2018
[16]

Risk re- duction for safety of the intended functionality of cacc with complex uncertainties: A cooperative robust non-fragile fault tolerant strategy

Bo Wang, Yugong Luo, Zhihua Zhong, and Keqiang Li. Risk re- duction for safety of the intended functionality of cacc with complex uncertainties: A cooperative robust non-fragile fault tolerant strategy. Transportation research part C: emerging technologies , 144:103885, 2022

work page 2022
[17]

A review of communication, driver characteristics, and controls aspects of cooperative adaptive cruise control (cacc)

Kakan C Dey, Li Yan, Xujie Wang, Yue Wang, Haiying Shen, Mashrur Chowdhury, Lei Yu, Chenxi Qiu, and Vivekgautham Soundararaj. A review of communication, driver characteristics, and controls aspects of cooperative adaptive cruise control (cacc). IEEE Transactions on Intelligent Transportation Systems, 17(2):491–509, 2015

work page 2015
[18]

Distributed formation and reconfiguration control of vtol uavs

Fang Liao, Rodney Teo, Jian Liang Wang, Xiangxu Dong, Feng Lin, and Kemao Peng. Distributed formation and reconfiguration control of vtol uavs. IEEE Transactions on Control Systems Technology , 25(1):270– 277, 2016

work page 2016
[19]

Formation con- struction and reconfiguration control of uav swarms: a perspective from distributed assignment and optimization

Jiangyuan Tian, Ruixuan Wei, and Longting Jiang. Formation con- struction and reconfiguration control of uav swarms: a perspective from distributed assignment and optimization. Nonlinear Dynamics , pages 1–21, 2024

work page 2024
[20]

Resilience measure and formation reconfig- uration optimization for multi-uav systems

Qiang Feng, Meng Liu, Bo Sun, Hongyan Dui, Xingshuo Hai, Yi Ren, Chen Lu, and Zili Wang. Resilience measure and formation reconfig- uration optimization for multi-uav systems. IEEE Internet of Things Journal, 11(6):10616–10626, 2023

work page 2023
[21]

Analyzing the impact of automated vehicles on uncertainty and stability of the mixed traffic flow

Fangfang Zheng, Can Liu, Xiaobo Liu, Saif Eddin Jabari, and Liang Lu. Analyzing the impact of automated vehicles on uncertainty and stability of the mixed traffic flow. Transportation research part C: emerging technologies, 112:203–219, 2020

work page 2020
[22]

Cooperative platoon control for a mixed traffic flow including human drive vehicles and connected and autonomous vehicles

Siyuan Gong and Lili Du. Cooperative platoon control for a mixed traffic flow including human drive vehicles and connected and autonomous vehicles. Transportation research part B: methodological , 116:25–61, 2018

work page 2018
[23]

Trust Region Policy Optimization

John Schulman. Trust region policy optimization. arXiv preprint arXiv:1502.05477, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[24]

Partially observable markov decision processes

Matthijs TJ Spaan. Partially observable markov decision processes. In Reinforcement learning: State-of-the-art , pages 387–414. Springer, 2012

work page 2012
[25]

Towards socially responsive autonomous vehicles: A reinforcement learning framework with driving priors and coordination awareness

Jiaqi Liu, Donghao Zhou, Peng Hang, Ying Ni, and Jian Sun. Towards socially responsive autonomous vehicles: A reinforcement learning framework with driving priors and coordination awareness. IEEE Transactions on Intelligent Vehicles, 2023

work page 2023
[26]

Social coordination and altruism in autonomous driving

Behrad Toghi, Rodolfo Valiente, Dorsa Sadigh, Ramtin Pedarsani, and Yaser P Fallah. Social coordination and altruism in autonomous driving. IEEE Transactions on Intelligent Transportation Systems, 23(12):24791– 24804, 2022

work page 2022
[27]

Proximal Policy Optimization Algorithms

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[28]

Simple statistical gradient-following algorithms for connectionist reinforcement learning

Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning , 8:229–256, 1992

work page 1992
[29]

Congested traffic states in empirical observations and microscopic simulations

Martin Treiber, Ansgar Hennecke, and Dirk Helbing. Congested traffic states in empirical observations and microscopic simulations. Physical review E, 62(2):1805, 2000

work page 2000
[30]

General lane-changing model mobil for car-following models

Arne Kesting, Martin Treiber, and Dirk Helbing. General lane-changing model mobil for car-following models. Transportation Research Record, 1999(1):86–94, 2007

work page 1999
[31]

An environment for autonomous driving decision- making

Edouard Leurent. An environment for autonomous driving decision- making. https://github.com/eleurent/highway-env, 2018

work page 2018
[32]

Playing Atari with Deep Reinforcement Learning

V olodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[33]

Asynchronous methods for deep reinforcement learning

V olodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. Asynchronous methods for deep reinforcement learning. In International conference on machine learning , pages 1928–1937. PMLR, 2016

work page 1928
[34]

Generalization, mayhems and limits in recurrent proximal policy op- timization

Marco Pleines, Matthias Pallasch, Frank Zimmer, and Mike Preuss. Generalization, mayhems and limits in recurrent proximal policy op- timization. arXiv preprint arXiv:2205.11104 , 2022

work page arXiv 2022
[35]

Cut through traffic like a snake: Cooperative adaptive cruise control with successive platoon lane-change capability

Haoran Wang, Xin Li, Xianhong Zhang, Jia Hu, Xuerun Yan, and Yongwei Feng. Cut through traffic like a snake: Cooperative adaptive cruise control with successive platoon lane-change capability. Journal of Intelligent Transportation Systems , 28(2):141–162, 2022

work page 2022
[36]

Coordinated lane-changing scheduling of multilane cav platoons in heterogeneous scenarios

Qingquan Liu, Xi Lin, Meng Li, Li Li, and Fang He. Coordinated lane-changing scheduling of multilane cav platoons in heterogeneous scenarios. Transportation Research Part C: Emerging Technologies , 147:103992, 2023

work page 2023
[37]

CARLA: An open urban driving simulator

Alexey Dosovitskiy, German Ros, Felipe Codevilla, Antonio Lopez, and Vladlen Koltun. CARLA: An open urban driving simulator. In Proceedings of the 1st Annual Conference on Robot Learning , pages 1–16, 2017

work page 2017

[1] [1]

A survey of deep learning techniques for autonomous driving

Sorin Grigorescu, Bogdan Trasnea, Tiberiu Cocias, and Gigel Macesanu. A survey of deep learning techniques for autonomous driving. Journal of field robotics, 37(3):362–386, 2020

work page 2020

[2] [2]

A survey of deep rl and il for autonomous driving policy learning

Zeyu Zhu and Huijing Zhao. A survey of deep rl and il for autonomous driving policy learning. IEEE Transactions on Intelligent Transportation Systems, 23(9):14043–14065, 2021

work page 2021

[3] [3]

A review of truck platooning projects for energy savings

Sadayuki Tsugawa, Sabina Jeschke, and Steven E Shladover. A review of truck platooning projects for energy savings. IEEE Transactions on Intelligent Vehicles, 1(1):68–77, 2016

work page 2016

[4] [4]

No drivers required

M Mitchell Waldrop et al. No drivers required. Nature, 518(7537):20, 2015

work page 2015

[5] [5]

Platoons of connected vehicles can double throughput in urban roads

Jennie Lioris, Ramtin Pedarsani, Fatma Yildiz Tascikaraoglu, and Pravin Varaiya. Platoons of connected vehicles can double throughput in urban roads. Transportation Research Part C: Emerging Technologies, 77:292– 305, 2017

work page 2017

[6] [6]

String stability for vehicular platoon control: Definitions and analysis methods

Shuo Feng, Yi Zhang, Shengbo Eben Li, Zhong Cao, Henry X Liu, and Li Li. String stability for vehicular platoon control: Definitions and analysis methods. Annual Reviews in Control , 47:81–97, 2019

work page 2019

[7] [7]

Delay-aware multi- agent reinforcement learning for cooperative adaptive cruise control with model-based stability enhancement

Jiaqi Liu, Ziran Wang, Peng Hang, and Jian Sun. Delay-aware multi- agent reinforcement learning for cooperative adaptive cruise control with model-based stability enhancement. arXiv preprint arXiv:2404.15696 , 2024

work page arXiv 2024

[8] [8]

Longitudinal and lateral control methods from single vehicle to autonomous platoon

Lei Song, Jun Li, Zichun Wei, Kai Yang, Ehsan Hashemi, and Hong Wang. Longitudinal and lateral control methods from single vehicle to autonomous platoon. Green Energy and Intelligent Transportation , 2(2):100066, 2023

work page 2023

[9] [9]

A cooperative lane change control strategy for connected and automated vehicles by considering preceding vehicle switching

Kang Sun, Xiangmo Zhao, Siyuan Gong, and Xia Wu. A cooperative lane change control strategy for connected and automated vehicles by considering preceding vehicle switching. Applied Sciences, 13(4):2193, 2023

work page 2023

[10] [10]

Cooperative lane-change motion planning for connected and automated vehicle platoons in multi-lane scenarios

Xuting Duan, Chen Sun, Daxin Tian, Jianshan Zhou, and Dongpu Cao. Cooperative lane-change motion planning for connected and automated vehicle platoons in multi-lane scenarios. IEEE Transactions on Intelligent Transportation Systems , 2023

work page 2023

[11] [11]

A new adaptive cruise control strategy and its stabilization effect on traffic flow

Chaoru Lu and Arvid Aakre. A new adaptive cruise control strategy and its stabilization effect on traffic flow. European Transport Research Review, 10(2):49, 2018

work page 2018

[12] [12]

A rule-based cooperative merging strategy for connected and automated vehicles

Jishiyu Ding, Li Li, Huei Peng, and Yi Zhang. A rule-based cooperative merging strategy for connected and automated vehicles. IEEE Transac- tions on Intelligent Transportation Systems , 21(8):3436–3446, 2019

work page 2019

[13] [13]

Ego-efficient lane changes of connected and automated vehicles with impacts on traffic flow

Yibing Wang, Long Wang, Jingqiu Guo, Ioannis Papamichail, Markos Papageorgiou, Fei-Yue Wang, Robert Bertini, Wei Hua, and Qinmin Yang. Ego-efficient lane changes of connected and automated vehicles with impacts on traffic flow. Transportation research part C: emerging technologies, 138:103478, 2022

work page 2022

[14] [14]

Model-based deep reinforcement learning for cacc in mixed-autonomy vehicle platoon

Tianshu Chu and Uro ˇs Kalabi ´c. Model-based deep reinforcement learning for cacc in mixed-autonomy vehicle platoon. In 2019 IEEE 58th Conference on Decision and Control (CDC) , pages 4079–4084. IEEE, 2019

work page 2019

[15] [15]

A review on coop- erative adaptive cruise control (cacc) systems: Architectures, controls, and applications

Ziran Wang, Guoyuan Wu, and Matthew J Barth. A review on coop- erative adaptive cruise control (cacc) systems: Architectures, controls, and applications. In 2018 21st International Conference on Intelligent Transportation Systems (ITSC) , pages 2884–2891. IEEE, 2018

work page 2018

[16] [16]

Risk re- duction for safety of the intended functionality of cacc with complex uncertainties: A cooperative robust non-fragile fault tolerant strategy

Bo Wang, Yugong Luo, Zhihua Zhong, and Keqiang Li. Risk re- duction for safety of the intended functionality of cacc with complex uncertainties: A cooperative robust non-fragile fault tolerant strategy. Transportation research part C: emerging technologies , 144:103885, 2022

work page 2022

[17] [17]

A review of communication, driver characteristics, and controls aspects of cooperative adaptive cruise control (cacc)

Kakan C Dey, Li Yan, Xujie Wang, Yue Wang, Haiying Shen, Mashrur Chowdhury, Lei Yu, Chenxi Qiu, and Vivekgautham Soundararaj. A review of communication, driver characteristics, and controls aspects of cooperative adaptive cruise control (cacc). IEEE Transactions on Intelligent Transportation Systems, 17(2):491–509, 2015

work page 2015

[18] [18]

Distributed formation and reconfiguration control of vtol uavs

Fang Liao, Rodney Teo, Jian Liang Wang, Xiangxu Dong, Feng Lin, and Kemao Peng. Distributed formation and reconfiguration control of vtol uavs. IEEE Transactions on Control Systems Technology , 25(1):270– 277, 2016

work page 2016

[19] [19]

Formation con- struction and reconfiguration control of uav swarms: a perspective from distributed assignment and optimization

Jiangyuan Tian, Ruixuan Wei, and Longting Jiang. Formation con- struction and reconfiguration control of uav swarms: a perspective from distributed assignment and optimization. Nonlinear Dynamics , pages 1–21, 2024

work page 2024

[20] [20]

Resilience measure and formation reconfig- uration optimization for multi-uav systems

Qiang Feng, Meng Liu, Bo Sun, Hongyan Dui, Xingshuo Hai, Yi Ren, Chen Lu, and Zili Wang. Resilience measure and formation reconfig- uration optimization for multi-uav systems. IEEE Internet of Things Journal, 11(6):10616–10626, 2023

work page 2023

[21] [21]

Analyzing the impact of automated vehicles on uncertainty and stability of the mixed traffic flow

Fangfang Zheng, Can Liu, Xiaobo Liu, Saif Eddin Jabari, and Liang Lu. Analyzing the impact of automated vehicles on uncertainty and stability of the mixed traffic flow. Transportation research part C: emerging technologies, 112:203–219, 2020

work page 2020

[22] [22]

Cooperative platoon control for a mixed traffic flow including human drive vehicles and connected and autonomous vehicles

Siyuan Gong and Lili Du. Cooperative platoon control for a mixed traffic flow including human drive vehicles and connected and autonomous vehicles. Transportation research part B: methodological , 116:25–61, 2018

work page 2018

[23] [23]

Trust Region Policy Optimization

John Schulman. Trust region policy optimization. arXiv preprint arXiv:1502.05477, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[24] [24]

Partially observable markov decision processes

Matthijs TJ Spaan. Partially observable markov decision processes. In Reinforcement learning: State-of-the-art , pages 387–414. Springer, 2012

work page 2012

[25] [25]

Towards socially responsive autonomous vehicles: A reinforcement learning framework with driving priors and coordination awareness

Jiaqi Liu, Donghao Zhou, Peng Hang, Ying Ni, and Jian Sun. Towards socially responsive autonomous vehicles: A reinforcement learning framework with driving priors and coordination awareness. IEEE Transactions on Intelligent Vehicles, 2023

work page 2023

[26] [26]

Social coordination and altruism in autonomous driving

Behrad Toghi, Rodolfo Valiente, Dorsa Sadigh, Ramtin Pedarsani, and Yaser P Fallah. Social coordination and altruism in autonomous driving. IEEE Transactions on Intelligent Transportation Systems, 23(12):24791– 24804, 2022

work page 2022

[27] [27]

Proximal Policy Optimization Algorithms

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[28] [28]

Simple statistical gradient-following algorithms for connectionist reinforcement learning

Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning , 8:229–256, 1992

work page 1992

[29] [29]

Congested traffic states in empirical observations and microscopic simulations

Martin Treiber, Ansgar Hennecke, and Dirk Helbing. Congested traffic states in empirical observations and microscopic simulations. Physical review E, 62(2):1805, 2000

work page 2000

[30] [30]

General lane-changing model mobil for car-following models

Arne Kesting, Martin Treiber, and Dirk Helbing. General lane-changing model mobil for car-following models. Transportation Research Record, 1999(1):86–94, 2007

work page 1999

[31] [31]

An environment for autonomous driving decision- making

Edouard Leurent. An environment for autonomous driving decision- making. https://github.com/eleurent/highway-env, 2018

work page 2018

[32] [32]

Playing Atari with Deep Reinforcement Learning

V olodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[33] [33]

Asynchronous methods for deep reinforcement learning

V olodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. Asynchronous methods for deep reinforcement learning. In International conference on machine learning , pages 1928–1937. PMLR, 2016

work page 1928

[34] [34]

Generalization, mayhems and limits in recurrent proximal policy op- timization

Marco Pleines, Matthias Pallasch, Frank Zimmer, and Mike Preuss. Generalization, mayhems and limits in recurrent proximal policy op- timization. arXiv preprint arXiv:2205.11104 , 2022

work page arXiv 2022

[35] [35]

Cut through traffic like a snake: Cooperative adaptive cruise control with successive platoon lane-change capability

Haoran Wang, Xin Li, Xianhong Zhang, Jia Hu, Xuerun Yan, and Yongwei Feng. Cut through traffic like a snake: Cooperative adaptive cruise control with successive platoon lane-change capability. Journal of Intelligent Transportation Systems , 28(2):141–162, 2022

work page 2022

[36] [36]

Coordinated lane-changing scheduling of multilane cav platoons in heterogeneous scenarios

Qingquan Liu, Xi Lin, Meng Li, Li Li, and Fang He. Coordinated lane-changing scheduling of multilane cav platoons in heterogeneous scenarios. Transportation Research Part C: Emerging Technologies , 147:103992, 2023

work page 2023

[37] [37]

CARLA: An open urban driving simulator

Alexey Dosovitskiy, German Ros, Felipe Codevilla, Antonio Lopez, and Vladlen Koltun. CARLA: An open urban driving simulator. In Proceedings of the 1st Annual Conference on Robot Learning , pages 1–16, 2017

work page 2017