Generative Models and Connected and Automated Vehicles: A Survey in Exploring the Intersection of Transportation and AI
Pith reviewed 2026-05-24 02:46 UTC · model grok-4.3
The pith
Generative models can enhance predictive modeling, simulation accuracy, and decision-making in connected and automated vehicles.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By focusing on the application of generative models within the context of CAVs, the study aims to unravel how this integration could enhance predictive modeling, simulation accuracy, and decision-making processes in autonomous vehicles, while discussing the benefits and challenges of integration and the potential for advancements in safety and innovation.
What carries the argument
Survey of generative model applications to CAVs, centered on their use for predictive modeling, simulation, and decision processes.
Load-bearing premise
The reviewed literature on generative models and CAVs is sufficient to identify meaningful benefits and challenges in their integration without gaps in coverage or unstated technical barriers.
What would settle it
A follow-up review that identifies major unaddressed technical barriers preventing generative models from delivering measurable gains in CAV simulation accuracy or decision reliability.
Figures
read the original abstract
This report investigates the history and impact of Generative Models and Connected and Automated Vehicles (CAVs), two groundbreaking forces pushing progress in technology and transportation. By focusing on the application of generative models within the context of CAVs, the study aims to unravel how this integration could enhance predictive modeling, simulation accuracy, and decision-making processes in autonomous vehicles. This thesis discusses the benefits and challenges of integrating generative models and CAV technology in transportation. It aims to highlight the progress made, the remaining obstacles, and the potential for advancements in safety and innovation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript is a survey reviewing the history and impact of generative models and connected and automated vehicles (CAVs). It focuses on their integration to improve predictive modeling, simulation accuracy, and decision-making processes in autonomous vehicles, while outlining benefits, challenges, progress made, remaining obstacles, and potential advancements in safety and innovation.
Significance. If the survey delivers a balanced and reasonably comprehensive synthesis of the relevant literature, it could serve as a helpful entry point for researchers working at the AI-transportation intersection by organizing existing work and flagging open issues. Its value rests entirely on the quality of the literature selection and the clarity of the thematic organization rather than on any new technical results.
major comments (1)
- [Abstract] Abstract: the central goal of identifying 'meaningful benefits and challenges' and 'unravel[ing] how this integration could enhance' predictive modeling etc. rests on an unstated assumption that the reviewed literature is sufficiently complete and representative; no search strategy, databases, keywords, or inclusion/exclusion criteria are supplied, making it impossible to assess coverage or potential gaps.
minor comments (1)
- [Abstract] Abstract: the text refers to both 'this report' and 'this thesis' in consecutive sentences; terminology should be made consistent throughout.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our survey manuscript. We address the single major comment below and will incorporate the suggested clarification in the revised version.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central goal of identifying 'meaningful benefits and challenges' and 'unravel[ing] how this integration could enhance' predictive modeling etc. rests on an unstated assumption that the reviewed literature is sufficiently complete and representative; no search strategy, databases, keywords, or inclusion/exclusion criteria are supplied, making it impossible to assess coverage or potential gaps.
Authors: We agree that explicitly documenting the literature search process strengthens the credibility of any survey and allows readers to better evaluate coverage. Although the current manuscript presents a narrative synthesis rather than a formal systematic review, the absence of a methods description is a valid limitation. In the revised manuscript we will insert a new subsection (likely under Introduction) that specifies the databases consulted (IEEE Xplore, ACM Digital Library, Google Scholar, arXiv), the primary keyword combinations employed, the time window considered, and the inclusion/exclusion criteria applied. This addition will directly address the referee’s concern without altering the survey’s scope or conclusions. revision: yes
Circularity Check
No significant circularity in literature survey synthesis
full rationale
This is a survey paper whose central claim is a synthesis of existing external literature on generative models in CAVs. It advances no original equations, predictions, fitted parameters, derivations, or uniqueness theorems. No load-bearing step reduces to a self-definition, fitted input renamed as prediction, or self-citation chain. The coverage assumption is definitional to any survey and does not create internal circularity. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Automated Vehicles for Safety — NHTSA,
NHTSA, “Automated Vehicles for Safety — NHTSA,” www.nhtsa.gov. https://www.nhtsa.gov/vehicle-safety/automated-vehicles-safety
-
[2]
Autonomous driving’s future: Convenient and connected — McKinsey,
McKinsey, “Autonomous driving’s future: Convenient and connected — McKinsey,” www.mckinsey.com, Jan. 06, 2023. https://www.mckinsey.com/industries/automotive-and-assembly/our- insights/autonomous-drivings-future-convenient-and-connected
work page 2023
-
[3]
The 6 Challenges of Autonomous Vehicles and How to Overcome Them,
R. McCauley, “The 6 Challenges of Autonomous Vehicles and How to Overcome Them,” Govtech.com, 2019. https://www.govtech.com/fs/The-6-Challenges-of-Autonomous- Vehicles-and-How-to-Overcome-Them.html
work page 2019
-
[4]
Zhou, Andy, Bo Li, and Haohan Wang. ”Robust prompt optimization for defending language models against jailbreaking attacks.” arXiv preprint arXiv:2401.17263 (2024)
-
[5]
”Assessing prompt injection risks in 200+ custom gpts.” arXiv preprint arXiv:2311.11538 (2023)
Yu, Jiahao, et al. ”Assessing prompt injection risks in 200+ custom gpts.” arXiv preprint arXiv:2311.11538 (2023)
-
[6]
Jin, Mingyu, et al. ”AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models.” arXiv preprint arXiv:2401.09002 (2024)
-
[7]
Divya Saxena and Jiannong Cao. 2021. Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions. ACM Comput. Surv. 54, 3, Article 63 (April 2022), 42 pages. https://doi.org/10.1145/3446374
-
[8]
H. Lin, Y . Liu, S. Li and X. Qu, ”How Generative Adversarial Networks Promote the Development of Intelligent Transportation Systems: A Survey,” in IEEE/CAA Journal of Automatica Sinica, vol. 10, no. 9, pp. 1781-1796, September 2023, doi: 10.1109/JAS.2023.123744
-
[9]
S. Aradi, ”Survey of Deep Reinforcement Learning for Motion Plan- ning of Autonomous Vehicles,” in IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 2, pp. 740-759, Feb. 2022, doi: 10.1109/TITS.2020.3024655
-
[10]
A. Alharin, T. -N. Doan and M. Sartipi, ”Reinforcement Learning Interpretation Methods: A Survey,” in IEEE Access, vol. 8, pp. 171058- 171077, 2020, doi: 10.1109/ACCESS.2020.3023394
-
[11]
Rishubh Parihar, Ankit Dhiman, Tejan Karmali, and Venkatesh R
-
[12]
In Proceedings of the 30th ACM International Conference on Multimedia (MM ’22)
Everything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration. In Proceedings of the 30th ACM International Conference on Multimedia (MM ’22). Association for Computing Machinery, New York, NY , USA, 1828–1836. https://doi.org/10.1145/3503161.3547972
-
[13]
Omer Tov, Yuval Alaluf, Yotam Nitzan, Or Patashnik, and Daniel Cohen- Or. 2021. Designing an encoder for StyleGAN image manipulation. ACM Trans. Graph. 40, 4, Article 133 (August 2021), 14 pages. https://doi.org/10.1145/3450626.3459838
-
[14]
C. Hao et al., ”NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving,” 2019 IEEE/ACM Interna- tional Conference on Computer-Aided Design (ICCAD), Westminster, CO, USA, 2019, pp. 1-8, doi: 10.1109/ICCAD45719.2019.8942055
-
[15]
G. Balazs and W. Stechele, ”Neural Architecture Search for Au- tomotive Grid Fusion Networks Under Embedded Hardware Con- straints,” 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA, 2020, pp. 79-86, doi: 10.1109/ICMLA51294.2020.00022
-
[16]
T. Zeng, O. Semiari, M. Chen, W. Saad and M. Bennis, ”Fed- erated Learning for Collaborative Controller Design of Connected and Autonomous Vehicles,” 2021 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, 2021, pp. 5033-5038, doi: 10.1109/CDC45484.2021.9683257
-
[17]
Z. Xiao, J. Shu, H. Jiang, G. Min, J. Liang and A. Iyengar, ”Toward Collaborative Occlusion-free Perception in Connected Au- tonomous Vehicles,” in IEEE Transactions on Mobile Computing, doi: 10.1109/TMC.2023.3298643
-
[18]
Julio C. S. Dos Anjos, Kassiano J. Matteussi, Fernanda C. Orlandi, Jorge L. V . Barbosa, Jorge S ´a Silva, Luiz F. Bittencourt, and Cl ´audio F. R. Geyer. 2023. A Survey on Collaborative Learning for Intelligent Autonomous Systems. ACM Comput. Surv. 56, 4, Article 98 (April 2024), 37 pages. https://doi.org/10.1145/3625544
-
[19]
Toyer, Sam, et al. ”Tensor trust: Interpretable prompt injection attacks from an online game.” arXiv preprint arXiv:2311.01011 (2023)
-
[20]
Jatmo: Prompt Injection Defense by Task-Specific Finetuning,
Piet, Julien, et al. ”Jatmo: Prompt injection defense by task-specific finetuning.” arXiv preprint arXiv:2312.17673 (2023)
-
[21]
10 Ways to Improve the Performance of Retrieval Augmented Generation Systems,
M. Ambrogi, “10 Ways to Improve the Performance of Retrieval Augmented Generation Systems,” Medium, Sep. 18, 2023. https://towardsdatascience.com/10-ways-to-improve-the-performance- of-retrieval-augmented-generation-systems-5fa2cee7cd5c
work page 2023
-
[22]
Muhammad, Khan, et al. ”Vision-based semantic segmentation in scene understanding for autonomous driving: Recent achievements, challenges, and outlooks.” IEEE Transactions on Intelligent Transportation Systems 23.12 (2022): 22694-22715
work page 2022
-
[23]
Kalatian, Arash, and Bilal Farooq. ”A context-aware pedestrian tra- jectory prediction framework for automated vehicles.” Transportation research part C: emerging technologies 134 (2022): 103453
work page 2022
-
[24]
Hang, Peng, et al. ”Decision making for connected automated vehicles at urban intersections considering social and individual benefits.” IEEE transactions on intelligent transportation systems 23.11 (2022): 22549- 22562
work page 2022
-
[25]
J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S.,and Bengio, Y
Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S.,and Bengio, Y . (2014). Generative adversarial nets. In Advances in neural information processing systems (pp. 2672-2680)
work page 2014
-
[26]
Antoniou, A., Storkey, A., and Edwards, H. (2017). Data augmentation generative adversarial networks. arXiv preprint arXiv:1711.04340
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[27]
M., Schmidt-Erfurth, U., and Langs, G
Schlegl, T., Seeb ¨ock, P., Waldstein, S. M., Schmidt-Erfurth, U., and Langs, G. (2017). Unsupervised anomaly detection with generative ad- versarial networks to guide marker discovery. In International conference on information processing in medical imaging (pp. 146-157). Springer, Cham
work page 2017
-
[28]
Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V ., Radford, A., Chen, X., and Chen, Y . (2016). Improved techniques for training GANs. In Advances in neural information processing systems (pp. 2234-2242)
work page 2016
-
[29]
Brock, A., Donahue, J., and Simonyan, K. (2019). Large scale GAN training for high fidelity natural image synthesis. In Proceedings of the International Conference on Learning Representations (ICLR)
work page 2019
-
[30]
Creswell, A., White, T., Dumoulin, V ., Arulkumaran, K., Sengupta, B., and Bharath, A. A. (2018). Generative adversarial networks: An overview. IEEE Signal Processing Magazine, 35(1), 53-65
work page 2018
-
[31]
A., Veness, J., Belle- mare, M
Mnih, V ., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Belle- mare, M. G., and Petersen, S. (2015). Human-level control through deep reinforcement learning. Nature, 518(7540), 529-533
work page 2015
-
[32]
Shalev-Shwartz, S., Shammah, S., and Shashua, A. (2016). Safe, multi- agent, reinforcement learning for autonomous driving. arXiv preprint arXiv:1610.03295
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[33]
Henderson, P., Islam, R., Bachman, P., Pineau, J., Precup, D.,and Meger, D. (2018). Deep reinforcement learning that matters. In Proceedings of the AAAI Conference on Artificial Intelligence (V ol. 32, No. 1)
work page 2018
-
[34]
Dulac-Arnold, G., Mankowitz, D., and Hester, T. (2019). Challenges of real-world reinforcement learning. arXiv preprint arXiv:1904.12901
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[35]
Karras, T., Laine, S.,and Aila, T. (2019). A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 4401-4410)
work page 2019
-
[36]
Chesney, R., and Citron, D. K. (2019). Deep fakes: A looming challenge for privacy, democracy, and national security. California Law Review, 107, 1753
work page 2019
-
[37]
Zoph, B., and Le, Q. V . (2017). Neural architecture search with rein- forcement learning. In Proceedings of the International Conference on Learning Representations (ICLR)
work page 2017
-
[38]
Tan, M., Chen, B., Pang, R., Vasudevan, V ., and Le, Q. V . (2019). Mnasnet: Platform-aware neural architecture search for mobile. In Pro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 2820-2828)
work page 2019
-
[39]
Elsken, T., Metzen, J. H., and Hutter, F. (2019). Neural architecture search: A survey. Journal of Machine Learning Research, 20(55), 1-21
work page 2019
-
[40]
Shlezinger, N., Eldar, Y . C., and Fuhrmann, D. R. (2020). Model-based deep learning. arXiv preprint arXiv:2008.08414
- [41]
-
[42]
Sicari, S., Rizzardi, A., Grieco, L. A., and Coen-Porisini, A. (2015). Security, privacy and trust in Internet of Things: The road ahead. Computer networks, 76, 146-164
work page 2015
-
[43]
Sharma, V ., You, I., Kumar, R., Zeadally, S., and Qiu, M. (2020). Autonomous vehicles: Security, safety, and privacy issues. IEEE Access, 8, 193893-193902
work page 2020
-
[44]
Muhammad Khan et al. (2022). Vision-based semantic segmentation in scene understanding for autonomous driving: Recent achievements, chal- lenges, and outlooks. IEEE Transactions on Intelligent Transportation Systems, 23(12), 22694-22715
work page 2022
-
[45]
Kalatian, A., and Farooq, B. (2022). A context-aware pedestrian tra- jectory prediction framework for automated vehicles. Transportation Research Part C: Emerging Technologies, 134, 103453
work page 2022
-
[46]
Hang, P. et al. (2022). Decision making for connected automated vehicles at urban intersections considering social and individual benefits. IEEE Transactions on Intelligent Transportation Systems, 23(11), 22549- 22562
work page 2022
-
[47]
Smith, J., Zhang, L.,and Gupta, A. (2021). ”VistaGPT: Generative Parallel Transformers for Vehicles with Intelligent Systems for Transport Automation.” Journal of Intelligent Transportation Systems Technology, 19(4), 345-360
work page 2021
-
[48]
Johnson, M., Smith, R., & Gupta, A. (2021). ”A Systematic Solution for Human Driving Behavior Modeling and Simulation in Automated Vehicle Studies.” Journal of Advanced Transportation Systems, 35(3), 567-582
work page 2021
-
[49]
Smith, J. A., & Johnson, D. B. (2020). ”Enhancing CA Vs Commu- nication with 5G and DSRC Integration.” Journal of Transport and Communication Innovation, 18(2), 34-49
work page 2020
-
[50]
Johnson, R. T., & Lee, A. H. (2021). ”Sensor Fusion and Wireless Tech- nologies: Accelerating the Future of Autonomous Driving.” Automotive Engineering Review, 29(4), 567-580
work page 2021
-
[51]
Williams, J., Patel, K., & Thompson, L. (2021). ”On the Integration of Enabling Wireless Technologies and Sensor Fusion for Next-Generation Connected and Autonomous Vehicles.” International Journal of Auto- motive Technology, 22(5), 1233-1245
work page 2021
-
[52]
Patel, S. K., & Kumar, V . (2022). ”Advancing Urban Mobility with V2X Communication in Smart Cities.” Smart Transportation Systems, 6(1), 88-102
work page 2022
-
[53]
Greenwood, D., Park, J., & Suh, Y . (2022). ”Safe Model-Based Off- Policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles.”
work page 2022
-
[54]
Journal of Sustainable Mobility, 9(3), 455-470. Nguyen, T., Lee, H., & Kim, D. (2022). ”Enhancing Eco-Driving in Urban CA V-HEVs Through Advanced Reinforcement Learning Strategies.” Environmental Technology & Innovation, 24, 101783
work page 2022
-
[55]
Arnelid, Henrik, Edvin Listo Zec, and Nasser Mohammadiha. ”Re- current conditional generative adversarial networks for autonomous driving sensor modelling.” 2019 IEEE Intelligent transportation systems conference (ITSC). IEEE, 2019
work page 2019
-
[56]
Guarnera, Luca, Oliver Giudice, and Sebastiano Battiato. ”Mastering Deepfake Detection: A Cutting-Edge Approach to Distinguish GAN and Diffusion-Model Images.” ACM Transactions on Multimedia Comput- ing, Communications and Applications (2024)
work page 2024
-
[57]
Frantzidis, Christos A., et al. ”New challenges and future perspectives in cognitive neuroscience.” Frontiers in Human Neuroscience 18: 1390788
-
[58]
Cunnington, Daniel, et al. ”A generative policy model for connected and autonomous vehicles.” 2019 IEEE Intelligent Transportation Systems Conference (ITSC). IEEE, 2019
work page 2019
-
[59]
Autonomous navigation of a rotary wing flying vehicles for precision agriculture
Mokrane, Adel. Autonomous navigation of a rotary wing flying vehicles for precision agriculture. Diss. Universit ´e Paris-Saclay; Universit´e Abou Bekr Belkaid (Tlemcen, Alg ´erie), 2023
work page 2023
-
[60]
”Privacy on-demand and Security preserving Federated Gen- erative Networks or Models.”
Jobs, All. ”Privacy on-demand and Security preserving Federated Gen- erative Networks or Models.”
-
[61]
Heckerman, David. ”A tutorial on learning with Bayesian networks.” Innovations in Bayesian networks: Theory and applications (2008): 33- 82
work page 2008
-
[62]
”Recent advances in convolutional neural networks.” Pattern recognition 77 (2018): 354-377
Gu, Jiuxiang, et al. ”Recent advances in convolutional neural networks.” Pattern recognition 77 (2018): 354-377
work page 2018
-
[63]
Schuster, Mike, and Kuldip K. Paliwal. ”Bidirectional recurrent neural networks.” IEEE transactions on Signal Processing 45.11 (1997): 2673- 2681
work page 1997
-
[64]
”Generative adversarial nets.” Advances in neural information processing systems 27 (2014)
Goodfellow, Ian, et al. ”Generative adversarial nets.” Advances in neural information processing systems 27 (2014)
work page 2014
-
[65]
Shum, Heung-Yeung, Xiao-dong He, and Di Li. ”From Eliza to Xi- aoIce: challenges and opportunities with social chatbots.” Frontiers of Information Technology & Electronic Engineering 19 (2018): 10-26
work page 2018
-
[66]
Buchanan, Bruce G., and Edward A. Feigenbaum. ”DENDRAL and Meta-DENDRAL: Their applications dimension.” Artificial intelligence 11.1-2 (1978): 5-24
work page 1978
-
[67]
Computer-based medical consultations: MYCIN
Shortliffe, Edward, ed. Computer-based medical consultations: MYCIN. V ol. 2. Elsevier, 2012
work page 2012
-
[68]
Togelius, Julian, et al. ”Search-based procedural content generation: A taxonomy and survey.” IEEE Transactions on Computational Intelligence and AI in Games 3.3 (2011): 172-186
work page 2011
-
[69]
”Art and the science of generative AI.” Science 380.6650 (2023): 1110-1111
Epstein, Ziv, et al. ”Art and the science of generative AI.” Science 380.6650 (2023): 1110-1111
work page 2023
-
[70]
Cavnar, William B., and John M. Trenkle. ”N-gram-based text cat- egorization.” Proceedings of SDAIR-94, 3rd annual symposium on document analysis and information retrieval. V ol. 161175. 1994
work page 1994
-
[71]
”Attention is all you need.” Advances in neural information processing systems 30 (2017)
Vaswani, Ashish, et al. ”Attention is all you need.” Advances in neural information processing systems 30 (2017)
work page 2017
-
[72]
Gan, Chuang, et al. ”Stylenet: Generating attractive visual captions with styles.” Proceedings of the IEEE conference on computer vision and pattern recognition. 2017
work page 2017
-
[73]
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Devlin, Jacob, et al. ”Bert: Pre-training of deep bidirectional trans- formers for language understanding.” arXiv preprint arXiv:1810.04805 (2018)
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[74]
”Language models are unsupervised multitask learners.” OpenAI blog 1.8 (2019): 9
Radford, Alec, et al. ”Language models are unsupervised multitask learners.” OpenAI blog 1.8 (2019): 9
work page 2019
-
[75]
VisualBERT: A Simple and Performant Baseline for Vision and Language
Li, Liunian Harold, et al. ”Visualbert: A simple and performant baseline for vision and language.” arXiv preprint arXiv:1908.03557 (2019)
work page internal anchor Pith review Pith/arXiv arXiv 1908
-
[76]
”GPT-3: Its nature, scope, limits, and consequences.” Minds and Machines 30 (2020): 681-694
Floridi, Luciano, and Massimo Chiriatti. ”GPT-3: Its nature, scope, limits, and consequences.” Minds and Machines 30 (2020): 681-694
work page 2020
-
[77]
The Ethical Implications of DALL-E: Opportunities and Challenges
Zhou, Nabus. The Ethical Implications of DALL-E: Opportunities and Challenges. 2023. The Ethical Implications of DALL-E: Opportunities and Challenges
work page 2023
-
[78]
GPT-4 Technical Report. arXiv:2303.08774. 2023. GPT-4 Technical Report
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[79]
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
”Sora: A Review on Background, Technology, Limitations, and Oppor- tunities of Large Vision Models.” arXiv:2402.17177. Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
work page internal anchor Pith review Pith/arXiv arXiv
-
[80]
ChatGPT: A Case Study on Copyright Challenges for Generative Artificial Intelligence Systems
Lucchi N. ChatGPT: A Case Study on Copyright Challenges for Generative Artificial Intelligence Systems. European Journal of Risk Regulation. Published online 2023:1-23. doi:10.1017/err.2023.59
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.