End to End Learning for Self-Driving Cars

Beat Flepp; Bernhard Firner; Daniel Dworakowski; Davide Del Testa; Jake Zhao; Jiakai Zhang; Karol Zieba; Lawrence D. Jackel; Mariusz Bojarski; Mathew Monfort

arxiv: 1604.07316 · v1 · submitted 2016-04-25 · 💻 cs.CV · cs.LG· cs.NE

End to End Learning for Self-Driving Cars

Mariusz Bojarski , Davide Del Testa , Daniel Dworakowski , Bernhard Firner , Beat Flepp , Prasoon Goyal , Lawrence D. Jackel , Mathew Monfort

show 5 more authors

Urs Muller Jiakai Zhang Xin Zhang Jake Zhao Karol Zieba

This is my paper

Pith reviewed 2026-05-12 23:19 UTC · model grok-4.3

classification 💻 cs.CV cs.LGcs.NE

keywords end-to-end learningconvolutional neural networkssteering predictionautonomous drivingdeep learningcomputer visionself-driving carsneural network control

0 comments

The pith

A convolutional neural network maps raw front-camera pixels directly to steering commands.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that a single convolutional neural network can be trained to take images from one forward-facing camera and output steering angles for a car. Training uses only human driving recordings, and the network discovers internal features like road edges on its own rather than receiving explicit labels for them. The system handles marked and unmarked roads, highways, parking lots, and unpaved surfaces at real-time speeds. If the mapping generalizes, it removes the need to hand-design separate stages for perception, planning, and control. Readers may care because the method replaces a chain of engineered modules with one jointly optimized network.

Core claim

We trained a convolutional neural network to map raw pixels from a single front-facing camera directly to steering commands. This end-to-end approach proved surprisingly powerful. With minimum training data from humans the system learns to drive in traffic on local roads with or without lane markings and on highways. It also operates in areas with unclear visual guidance such as in parking lots and on unpaved roads. The system automatically learns internal representations of the necessary processing steps such as detecting useful road features with only the human steering angle as the training signal. We never explicitly trained it to detect, for example, the outline of roads. Compared to an

What carries the argument

End-to-end convolutional neural network that converts single-camera pixel input straight into steering-angle output while jointly optimizing all internal steps.

If this is right

Internal components self-optimize for overall driving performance instead of human-chosen intermediate goals such as accurate lane-marking detection.
The complete system requires fewer processing stages and therefore can be smaller than pipelines that separate perception from control.
The same network can operate on local roads without markings, highways, parking lots, and unpaved surfaces after training on modest amounts of human data.
The learned mapping runs at 30 frames per second on automotive-grade hardware.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same direct-mapping idea could replace modular stacks in other sensor-to-action tasks where human demonstrations are cheap to record.
Safety arguments would then shift from verifying each submodule to verifying that the training distribution covers the full operating envelope.
Extending the input to include additional sensors or temporal context would be a direct next test of whether the single-network approach scales.

Load-bearing premise

Images collected while humans drive already contain enough examples of every situation the car will meet later, so the learned mapping stays safe without extra safety layers.

What would settle it

A controlled test in which the trained car is driven through a road configuration or lighting condition absent from the human-collected training set and is observed to produce incorrect steering.

read the original abstract

We trained a convolutional neural network (CNN) to map raw pixels from a single front-facing camera directly to steering commands. This end-to-end approach proved surprisingly powerful. With minimum training data from humans the system learns to drive in traffic on local roads with or without lane markings and on highways. It also operates in areas with unclear visual guidance such as in parking lots and on unpaved roads. The system automatically learns internal representations of the necessary processing steps such as detecting useful road features with only the human steering angle as the training signal. We never explicitly trained it to detect, for example, the outline of roads. Compared to explicit decomposition of the problem, such as lane marking detection, path planning, and control, our end-to-end system optimizes all processing steps simultaneously. We argue that this will eventually lead to better performance and smaller systems. Better performance will result because the internal components self-optimize to maximize overall system performance, instead of optimizing human-selected intermediate criteria, e.g., lane detection. Such criteria understandably are selected for ease of human interpretation which doesn't automatically guarantee maximum system performance. Smaller networks are possible because the system learns to solve the problem with the minimal number of processing steps. We used an NVIDIA DevBox and Torch 7 for training and an NVIDIA DRIVE(TM) PX self-driving car computer also running Torch 7 for determining where to drive. The system operates at 30 frames per second (FPS).

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is the first public-scale demo of a CNN mapping single-camera pixels to steering on real roads and highways, but the results stay qualitative with no failure metrics or baselines.

read the letter

The punchline is that a single CNN trained only on human steering angles can drive a car across local roads, highways, parking lots, and unpaved surfaces from raw front-camera images. That end-to-end mapping was new at the time and showed the approach could work at driving speeds without separate lane detectors or planners. They collected human data, trained in Torch on an NVIDIA box, and ran inference at 30 FPS on their DRIVE PX hardware, which makes the claim concrete rather than purely theoretical. The paper does well by arguing that joint optimization of all steps can beat hand-engineered pipelines and by keeping the supervision minimal—just the steering signal. That simplicity is the real strength here. The soft spots are in the evidence. The description claims successful real-world operation but supplies no numbers on miles driven, intervention rates, error distributions, or comparisons to any baseline. Without those, it is difficult to assess how often the system fails or how much the training distribution actually covers test conditions. Generalization is acknowledged as the limit but not tested with held-out routes or stress cases in any detail. The math is absent because this is an empirical demonstration, and the citations are appropriate for the prior imitation-learning work they build on. This paper is for researchers exploring imitation learning or simpler AV architectures. A reader who wants to see what end-to-end driving looked like at scale will find it useful. It deserves serious referee time because the demonstration is genuine and the idea has held up in later work, even though the current version would benefit from quantitative evaluation.

Referee Report

1 major / 2 minor

Summary. The paper claims that a convolutional neural network can be trained end-to-end to map raw pixels from a single front-facing camera directly to steering commands. With minimal human-collected training data, the system learns to drive in traffic on local roads (with or without lane markings), highways, parking lots, and unpaved roads. It automatically discovers internal representations for road features using only steering angles as supervision, operates at 30 FPS on NVIDIA DRIVE PX hardware, and is argued to be more efficient than pipelines that separately handle lane detection, path planning, and control.

Significance. If the results hold, the work is significant as an early empirical demonstration that joint optimization of perception and control via deep learning can produce a functional real-world driving system without hand-engineered intermediate modules. It provides a concrete baseline for end-to-end autonomous driving research, shows real-time inference feasibility on embedded hardware, and highlights the potential for smaller, self-optimizing networks. Credit is due for the use of actual driving data and successful deployment across varied environments.

major comments (1)

[Abstract and real-world testing description] Abstract and real-world testing description: the central claim of successful operation on local roads, highways, parking lots, and unpaved surfaces is presented without any quantitative metrics (e.g., steering prediction error, autonomous distance driven, intervention rate, or failure cases). This is load-bearing for assessing generalization from the training distribution of human steering data.

minor comments (2)

[Training procedure] The manuscript would benefit from a short table or paragraph summarizing the training data volume, collection protocol, and any augmentation steps, as these details directly affect reproducibility of the reported generalization.
[System implementation] Clarify whether the 30 FPS figure refers to inference only or includes any preprocessing; this affects the practicality claim for real-time operation.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the positive assessment of the work's significance and the recommendation for minor revision. We address the single major comment point by point below.

read point-by-point responses

Referee: Abstract and real-world testing description: the central claim of successful operation on local roads, highways, parking lots, and unpaved surfaces is presented without any quantitative metrics (e.g., steering prediction error, autonomous distance driven, intervention rate, or failure cases). This is load-bearing for assessing generalization from the training distribution of human steering data.

Authors: We acknowledge that the abstract and the description of real-world operation are presented qualitatively. The manuscript's core contribution is the demonstration that a CNN can be trained end-to-end to produce steering commands directly from camera images, with internal features emerging automatically from steering supervision alone. The listed environments (local roads with/without markings, highways, parking lots, unpaved roads) were chosen precisely to illustrate generalization beyond the training distribution, as the network was never explicitly trained on lane outlines or other hand-engineered features. Quantitative metrics such as closed-loop steering error, autonomous distance, or intervention counts are not included because the evaluation was a proof-of-concept deployment with a safety driver present; defining and measuring 'intervention' or 'failure' in a reproducible way would require a separate, controlled benchmarking protocol that lies outside the paper's scope. Training-set steering prediction error is discussed in the experimental sections, but real-world closed-loop performance is inherently harder to quantify without additional instrumentation. We therefore do not view the absence of these numbers as undermining the central claim, which concerns the viability of the end-to-end paradigm rather than a head-to-head system benchmark. No revision is planned on this point. revision: no

Circularity Check

0 steps flagged

No circularity: empirical end-to-end training with external validation

full rationale

The paper reports an empirical demonstration: a CNN is trained on human-collected front-camera images paired with steering angles, then evaluated by real-world driving performance on held-out routes. No equations, uniqueness theorems, or derivations are presented that could reduce a claimed prediction to a fitted input by construction. The central argument (end-to-end optimization yields better performance than modular pipelines) is a qualitative claim supported by the observed behavior, not by any self-referential definition or self-citation chain. The generalization assumption is acknowledged as a practical limit but does not create an internal circular step within the described construction.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard supervised learning assumptions plus the domain assumption that human steering data is a sufficient training signal for safe driving.

free parameters (2)

CNN architecture and hyperparameters
Number of layers, filter sizes, learning rate, and data augmentation choices selected to achieve the reported behavior.
Training data collection protocol
Specific routes, times of day, and driver behavior used to gather the human steering examples.

axioms (1)

domain assumption The visual-to-steering mapping is learnable from finite human driving data
Invoked when claiming the network will operate on unseen roads and conditions.

pith-pipeline@v0.9.0 · 5604 in / 1212 out tokens · 26662 ms · 2026-05-12T23:19:46.034799+00:00 · methodology

discussion (0)

Forward citations

Cited by 60 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Parameterized Hardness of Zonotope Containment and Neural Network Verification
cs.CC 2025-09 unverdicted novelty 8.0

The paper proves W[1]-hardness parameterized by dimension d for positivity, zonotope containment, max approximation, and L_p-Lipschitz constants in 2- and 3-layer ReLU networks, showing enumeration methods are optimal...
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
cs.RO 2023-03 accept novelty 8.0

Diffusion Policy models robot actions as a conditional diffusion process, outperforming prior state-of-the-art methods by 46.9% on average across 12 manipulation tasks from four benchmarks.
LOSCAR-SGD: Local SGD with Communication-Computation Overlap and Delay-Corrected Sparse Model Averaging
cs.LG 2026-05 unverdicted novelty 7.0

LOSCAR-SGD combines local updates, sparse model averaging, and communication-computation overlap with a delay-corrected merge rule, providing convergence rates for smooth non-convex objectives under worker heterogeneity.
Ringmaster LMO: Asynchronous Linear Minimization Oracle Momentum Method
cs.LG 2026-05 unverdicted novelty 7.0

Ringmaster LMO extends delay-thresholding from ASGD to LMO-based momentum updates, providing convergence guarantees under (L0, L1)-smoothness and time-complexity bounds that recover optimal rates in the Euclidean case.
4DLidarOpen: An Open 4D FMCW Lidar Dataset for Motion-Aware Autonomous Driving
cs.RO 2026-05 unverdicted novelty 7.0

4DLidarOpen is a new open dataset providing synchronized 4D FMCW Lidar velocity measurements, multi-Lidar and camera data, and 3D bounding-box annotations with track IDs to support benchmarks on 3D detection, BEV segm...
Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations
cs.RO 2026-05 unverdicted novelty 7.0

Bench2Drive-Robust is a new closed-loop benchmark that evaluates end-to-end autonomous driving models under deployment perturbations from camera failures, ego-state errors, and compute delays, showing substantial perf...
Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling
cs.LG 2026-05 unverdicted novelty 7.0

DRATS derives a minimax objective from a feasibility formulation of MTRL to adaptively sample tasks with the largest return gaps, leading to better worst-task performance on MetaWorld benchmarks.
Optimality of Sub-network Laplace Approximations: New Results and Methods
stat.ML 2026-05 conditional novelty 7.0

Sub-network Laplace approximations always underestimate full-model predictive variance, and two new gradient-based and greedy selection rules provide theoretically grounded improvements.
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving
cs.RO 2026-05 unverdicted novelty 7.0

ReflectDrive-2 achieves 91.0 PDMS on NAVSIM with camera input by training a discrete diffusion model to self-edit trajectories via RL-aligned AutoEdit.
TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations
cs.LG 2026-05 unverdicted novelty 7.0

TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.
Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation
cs.LG 2026-05 unverdicted novelty 7.0

LHSD uses spectral filtering on the log-density Hessian to isolate tangent directions from noise and estimate local intrinsic dimension scalably via Stochastic Lanczos Quadrature.
ST-BCP: Tightening Coverage Bound for Backward Conformal Prediction via Non-Conformity Score Transformation
stat.ML 2026-02 conditional novelty 7.0

ST-BCP tightens the coverage bound in Backward Conformal Prediction by applying a computable data-dependent transformation to nonconformity scores, reducing the average gap from 4.20% to 1.12% on benchmarks while prov...
Extremal Contours: Gradient-driven contours for compact visual attribution
cs.CV 2025-11 unverdicted novelty 7.0

A training-free method using Fourier-parameterized star-convex contours optimized via gradients to generate compact, faithful visual attributions for image classifiers on benchmarks like ImageNet.
Steering Your Diffusion Policy with Latent Space Reinforcement Learning
cs.RO 2025-06 unverdicted novelty 7.0

DSRL steers pretrained diffusion policies for robotics by applying RL to their latent noise inputs, achieving sample-efficient real-world adaptation with only black-box access.
Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals
cs.LG 2026-05 unverdicted novelty 6.0

Dywave applies wavelet-based hierarchical decomposition to build dynamic, event-aligned tokens for heterogeneous IoT signals, cutting token length by up to 75% while raising accuracy up to 12% on sequence models.
Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals
cs.LG 2026-05 unverdicted novelty 6.0

Dywave uses wavelet hierarchical decomposition to create event-aligned compact token sequences for heterogeneous IoT signals, yielding up to 12% accuracy gains and 75% shorter inputs on mainstream sequence models acro...
Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction
cs.CV 2026-05 unverdicted novelty 6.0

Smaller end-to-end autonomous driving models achieve optimal 3-second trajectory prediction accuracy at lower or intermediate temporal sampling frequencies, whereas larger VLA-style models perform best at the highest ...
Ensemble Distributionally Robust Bayesian Optimisation
cs.LG 2026-05 unverdicted novelty 6.0

A tractable ensemble distributionally robust Bayesian optimization method achieves improved sublinear regret bounds under context uncertainty.
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving
cs.RO 2026-05 unverdicted novelty 6.0

ReflectDrive-2 combines masked discrete diffusion with RL-aligned self-editing to generate and refine driving trajectories, reaching 91.0 PDMS on NAVSIM camera-only and 94.8 in best-of-6.
OGPO: Sample Efficient Full-Finetuning of Generative Control Policies
cs.LG 2026-05 unverdicted novelty 6.0

OGPO is a sample-efficient off-policy method for full finetuning of generative control policies that reaches SOTA on robotic manipulation tasks and can recover from poor behavior-cloning initializations without expert data.
Empirical Insights of Test Selection Metrics under Multiple Testing Objectives and Distribution Shifts
cs.SE 2026-04 unverdicted novelty 6.0

A broad empirical benchmark shows how 15 existing test selection metrics perform for fault detection, performance estimation, and retraining under corrupted, adversarial, temporal, natural, and label shifts across ima...
FingerViP: Learning Real-World Dexterous Manipulation with Fingertip Visual Perception
cs.RO 2026-04 conditional novelty 6.0

FingerViP equips each finger with a miniature camera and trains a multi-view diffusion policy that achieves 80.8% success on real-world dexterous tasks previously limited by wrist-camera occlusion.
MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving
cs.RO 2026-04 unverdicted novelty 6.0

MVAdapt conditions end-to-end autonomous driving policies on explicit vehicle physics to achieve better zero-shot transfer and few-shot calibration across different vehicles in CARLA simulation.
Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems
cs.LG 2026-04 unverdicted novelty 6.0

MOSAIC is a scaling-aware data selection framework that outperforms baselines in training end-to-end autonomous driving planners, achieving comparable or better EPDMS scores with up to 80% less data.
Safety-Aligned 3D Object Detection: Single-Vehicle, Cooperative, and End-to-End Perspectives
cs.CV 2026-04 unverdicted novelty 6.0

Safety-aware metrics and losses for 3D detection improve critical error handling in autonomous vehicle perception across single-vehicle, cooperative, and end-to-end settings.
SutureFormer: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space
cs.RO 2026-03 unverdicted novelty 6.0

SutureFormer models needle tip movement in video as sequential pixel-space actions via goal-conditioned offline RL with spline-based reward densification, cutting average displacement error by 58.6% on a new 1,158-tra...
Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving
cs.RO 2026-02 unverdicted novelty 6.0

The paper introduces Hyper Diffusion Planner (HDP), a diffusion-based E2E AD framework that identifies insights on loss space, trajectory representation and data scaling, adds RL post-training, and reports 10x perform...
LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving
cs.CV 2025-12 accept novelty 6.0

Reducing expert-student asymmetries in visibility, uncertainty, and route specification enables a new TransFuser v6 policy that reaches 95 DS on Bench2Drive and more than doubles prior scores on Longest6 v2 and Town13.
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
cs.RO 2025-10 conditional novelty 6.0

Alpamayo-R1 introduces a VLA model with a Chain of Causation dataset and multi-stage SFT-plus-RL training that reports 12% better planning accuracy and 35% fewer close encounters versus trajectory-only baselines in dr...
Automated Test Validators for Flaky Cyber-Physical System Simulators: Approach and Evaluation
cs.SE 2025-08 unverdicted novelty 6.0

Test validators generated via genetic programming using the Ochiai SBFL formula are more accurate and robust to flakiness than alternatives from Tarantula, Naish, decision trees, or rules, with 88.7% alignment to know...
EMMA: End-to-End Multimodal Model for Autonomous Driving
cs.CV 2024-10 unverdicted novelty 6.0

EMMA is an end-to-end multimodal LLM that converts camera data into trajectories, objects, and road graphs via text prompts and reports state-of-the-art motion planning on nuScenes plus competitive detection results on Waymo.
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models
cs.AI 2024-08 conditional novelty 6.0

Empirical analysis shows scaling inference compute via strategies like tree search can be more efficient than scaling model parameters, with 7B models plus novel search outperforming 34B models.
Octo: An Open-Source Generalist Robot Policy
cs.RO 2024-05 unverdicted novelty 6.0

Octo is an open-source transformer-based generalist robot policy pretrained on 800k trajectories that serves as an effective initialization for finetuning across diverse robotic platforms.
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
cs.RO 2024-02 conditional novelty 6.0

3D Diffuser Actor unifies diffusion policies with 3D scene features to set new state-of-the-art results on RLBench and CALVIN robot benchmarks.
GPT-Driver: Learning to Drive with GPT
cs.CV 2023-10 conditional novelty 6.0

GPT-3.5 is turned into an autonomous-vehicle motion planner by representing driving scenes and trajectories as language tokens and applying a prompting-reasoning-finetuning pipeline, with results shown on nuScenes.
Emergence of Exploratory Look-Around Behaviors through Active Observation Completion
cs.CV 2019-06 unverdicted novelty 6.0

An RL agent learns to actively explore by being rewarded for inferring unobserved scene parts after short glimpse sequences, with sidekick policy learning enabling generalization to other active perception tasks.
Rules of the Road: Predicting Driving Behavior with a Convolutional Model of Semantic Interactions
cs.CV 2019-06 unverdicted novelty 6.0

A grid-based convolutional architecture fuses semantic maps and 3D perceptions to model driving interactions and predict future agent states, evaluated on a new industry-grade dataset.
Lost in Fog: Sensor Perturbations Expose Reasoning Fragility in Driving VLAs
cs.RO 2026-05 unverdicted novelty 5.0

Sensor perturbations in driving VLAs cause Chain-of-Causation reasoning changes that correlate strongly with 5.3x higher trajectory deviation, while enabling such reasoning improves accuracy by 11.8%.
Anomaly-Informed Confidence Calibration for Vision-Based Safety Prediction
cs.RO 2026-05 unverdicted novelty 5.0

Fusing perceptual and dynamics anomaly scores enables online temperature scaling that cuts expected calibration error by 37% on physical DonkeyCar tests with four unseen anomaly types.
C-CoT: Counterfactual Chain-of-Thought with Vision-Language Models for Safe Autonomous Driving
cs.CV 2026-05 unverdicted novelty 5.0

C-CoT applies VLMs to autonomous driving via five-stage reasoning with a meta-action tree for counterfactuals, yielding 81.9% risk recall, 3.52% collision rate, and 1.98 m L2 error on a new dataset.
Rennala MVR: Improved Time Complexity for Parallel Stochastic Optimization via Momentum-Based Variance Reduction
math.OC 2026-05 unverdicted novelty 5.0

Rennala MVR improves time complexity over Rennala SGD for smooth nonconvex stochastic optimization in heterogeneous parallel systems under a mean-squared smoothness assumption.
InterFuserDVS: Event-Enhanced Sensor Fusion for Safe RL-Based Decision Making
cs.CV 2026-05 unverdicted novelty 5.0

Integrating DVS event data into InterFuser through token fusion yields a driving score of 77.2 and 100% route completion on CARLA benchmarks, indicating improved robustness in dynamic conditions.
UniAda: Universal Adaptive Multi-objective Adversarial Attack for End-to-End Autonomous Driving Systems
cs.SE 2026-04 unverdicted novelty 5.0

UniAda introduces a white-box multi-objective attack using adaptive weighting to generate perturbations that jointly affect steering and speed in E2E ADS, outperforming benchmarks with average deviations of 3.54-29 de...
MetaErr: Towards Predicting Error Patterns in Deep Neural Networks
cs.CV 2026-04 unverdicted novelty 5.0

MetaErr introduces a meta-model that forecasts per-sample prediction errors in deep neural networks solely from base model performance observations, outperforming baselines and boosting pseudo-labeling on three comput...
End-to-End ILC for Repetitive Untrackable Tasks: A Cooperative Game Perspective
eess.SY 2026-04 unverdicted novelty 5.0

An end-to-end ILC for untrackable repetitive tasks is formulated as a cooperative game between reference and feedforward updates, yielding a sufficient condition for lower cost than norm-optimal ILC.
Artificial Intelligence for Modeling and Simulation of Mixed Automated and Human Traffic
cs.AI 2026-04 unverdicted novelty 5.0

This survey synthesizes AI techniques for mixed autonomy traffic simulation and introduces a taxonomy spanning agent-level behavior models, environment-level methods, and cognitive/physics-informed approaches.
State-Conditional Adversarial Learning: An Off-Policy Visual Domain Transfer Method for End-to-End Imitation Learning
cs.RO 2025-12 unverdicted novelty 5.0

SCAL derives an upper bound on target-domain imitation loss using source loss plus state-conditional latent KL divergence and aligns distributions via a discriminator-based adversarial estimator.
Reliable and Real-Time Highway Trajectory Planning via Hybrid Learning-Optimization Frameworks
cs.RO 2025-08 unverdicted novelty 5.0

Hybrid learning-optimization framework for highway trajectory planning that reports over 97% scenario success rate and 54 ms average cycle time on the HighD dataset while enforcing formal safety via MIQP.
Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings
cs.CV 2025-05 unverdicted novelty 5.0

TEA is a new targeted adversarial attack that incorporates edge information from the target image to reduce query count and improve performance in low-query black-box hard-label settings.
Survival of the Cheapest: Cost-Aware Hardware Adaptation for Adversarial Robustness
cs.CR 2024-09 unverdicted novelty 5.0

A decision-support framework applies AFT models to show Nvidia L4 GPUs yield 20% longer adversarial survival time at 75% lower cost than V100, with inference latency as the strongest robustness predictor.
Analyzing Adversarial Inputs in Deep Reinforcement Learning
cs.LG 2024-02 unverdicted novelty 5.0

Introduces the Adversarial Rate metric and associated tools to systematically evaluate and visualize the impact of adversarial inputs on DRL policies using formal verification.
PyTorch Distributed: Experiences on Accelerating Data Parallel Training
cs.DC 2020-06 accept novelty 5.0

PyTorch distributed data parallel attains near-linear scalability on 256 GPUs through gradient bucketing, computation-communication overlap, and selective synchronization skipping.
Towards Generalizing Sensorimotor Control Across Weather Conditions
cs.LG 2019-07 unverdicted novelty 5.0

A teacher-student framework with domain translation transfers steering control from one weather condition to multiple others using only source-domain labels.
NeuroTrajectory: A Neuroevolutionary Approach to Local State Trajectory Learning for Autonomous Vehicles
cs.RO 2019-06 unverdicted novelty 5.0

NeuroTrajectory is a neuroevolutionary method that trains deep neural networks via genetic algorithms to estimate multi-objective optimal state trajectories over a finite horizon for autonomous vehicle motion planning.
Real-Time Evaluation of Autonomous Systems under Adversarial Attacks
cs.AI 2026-05 unverdicted novelty 4.0

A framework trains and compares MLP, transformer, and GAIL-based trajectory models on real driving data, finding that architectural differences cause large variations in robustness to PGD attacks despite similar nomin...
Multimodal embodiment-aware navigation transformer
cs.RO 2026-04 unverdicted novelty 4.0

ViLiNT improves goal-conditioned navigation success rates by 166% on average over vision-only baselines across simulations and real rover tests by combining multimodal sensing with embodiment-conditioned diffusion tra...
Event-Centric World Modeling with Memory-Augmented Retrieval for Embodied Decision-Making
cs.LG 2026-04 unverdicted novelty 4.0

An event-centric framework encodes environments as semantic events and retrieves weighted prior maneuvers from a knowledge bank to enable interpretable, physics-aware decision-making for UAVs.
From Virtual Environments to Real-World Trials: Emerging Trends in Autonomous Driving
cs.AI 2026-03 unverdicted novelty 4.0

A survey organizes synthetic data use, digital twin simulation, and domain adaptation techniques for autonomous driving while identifying open challenges like Sim2Real transfer.
Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
cs.AI 2025-10 unverdicted novelty 4.0

A survey of physical AI that distinguishes theoretical physics reasoning from applied understanding and synthesizes advances in symbolic reasoning, embodied systems, and generative models to advocate for physics-groun...
ADAPS: Autonomous Driving Via Principled Simulations
cs.RO 2019-07 unverdicted novelty 4.0

ADAPS generates accident data via simulations and employs a memory-enabled hierarchical policy with efficient online learning to produce robust autonomous driving controllers tested in simulation.

Reference graph

Works this paper leans on

9 extracted references · 9 canonical work pages · cited by 64 Pith papers

[1]

LeCun, B

Y . LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Backprop- agation applied to handwritten zip code recognition. Neural Computation , 1(4):541–551, Winter 1989. URL: http://yann.lecun.org/exdb/publis/pdf/lecun-89e.pdf

work page 1989
[2]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classiﬁcation with deep convolutional neural networks. In F. Pereira, C. J. C. Burges, L. Bottou, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 25 , pages 1097–1105. Curran Associates, Inc., 2012. URL: http://papers.nips.cc/paper/ 4824-imagenet-classificat...

work page 2012
[3]

L. D. Jackel, D. Sharman, Stenard C. E., Strom B. I., , and D Zuckert. Optical character recognition for self-service banking. AT&T Technical Journal, 74(1):16–24, 1995

work page 1995
[4]

URL: http://www.image-net.org/ challenges/LSVRC/

Large scale visual recognition challenge (ILSVRC). URL: http://www.image-net.org/ challenges/LSVRC/

work page
[5]

Autonomous off-road vehicle control using end-to-end learning, July 2004

Net-Scale Technologies, Inc. Autonomous off-road vehicle control using end-to-end learning, July 2004. Final technical report. URL: http://net-scale.com/doc/net-scale-dave-report.pdf

work page 2004
[6]

Pomerleau

Dean A. Pomerleau. ALVINN, an autonomous land vehicle in a neural network. Technical report, Carnegie Mellon University, 1989. URL: http://repository.cmu.edu/cgi/viewcontent. cgi?article=2874&context=compsci

work page 1989
[7]

DARPA LAGR program

Wikipedia.org. DARPA LAGR program. http://en.wikipedia.org/wiki/DARPA_LAGR_ Program

work page
[8]

Trajectory planning for a four-wheel-steering vehicle

Danwei Wang and Feng Qi. Trajectory planning for a four-wheel-steering vehicle. In Proceedings of the 2001 IEEE International Conference on Robotics & Automation , May 21–26 2001. URL: http: //www.ntu.edu.sg/home/edwwang/confpapers/wdwicar01.pdf

work page 2001
[9]

URL: https://drive.google.com/open?id= 0B9raQzOpizn1TkRIa241ZnBEcjQ

DA VE 2 driving a lincoln. URL: https://drive.google.com/open?id= 0B9raQzOpizn1TkRIa241ZnBEcjQ. 9

work page

[1] [1]

LeCun, B

Y . LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Backprop- agation applied to handwritten zip code recognition. Neural Computation , 1(4):541–551, Winter 1989. URL: http://yann.lecun.org/exdb/publis/pdf/lecun-89e.pdf

work page 1989

[2] [2]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classiﬁcation with deep convolutional neural networks. In F. Pereira, C. J. C. Burges, L. Bottou, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 25 , pages 1097–1105. Curran Associates, Inc., 2012. URL: http://papers.nips.cc/paper/ 4824-imagenet-classificat...

work page 2012

[3] [3]

L. D. Jackel, D. Sharman, Stenard C. E., Strom B. I., , and D Zuckert. Optical character recognition for self-service banking. AT&T Technical Journal, 74(1):16–24, 1995

work page 1995

[4] [4]

URL: http://www.image-net.org/ challenges/LSVRC/

Large scale visual recognition challenge (ILSVRC). URL: http://www.image-net.org/ challenges/LSVRC/

work page

[5] [5]

Autonomous off-road vehicle control using end-to-end learning, July 2004

Net-Scale Technologies, Inc. Autonomous off-road vehicle control using end-to-end learning, July 2004. Final technical report. URL: http://net-scale.com/doc/net-scale-dave-report.pdf

work page 2004

[6] [6]

Pomerleau

Dean A. Pomerleau. ALVINN, an autonomous land vehicle in a neural network. Technical report, Carnegie Mellon University, 1989. URL: http://repository.cmu.edu/cgi/viewcontent. cgi?article=2874&context=compsci

work page 1989

[7] [7]

DARPA LAGR program

Wikipedia.org. DARPA LAGR program. http://en.wikipedia.org/wiki/DARPA_LAGR_ Program

work page

[8] [8]

Trajectory planning for a four-wheel-steering vehicle

Danwei Wang and Feng Qi. Trajectory planning for a four-wheel-steering vehicle. In Proceedings of the 2001 IEEE International Conference on Robotics & Automation , May 21–26 2001. URL: http: //www.ntu.edu.sg/home/edwwang/confpapers/wdwicar01.pdf

work page 2001

[9] [9]

URL: https://drive.google.com/open?id= 0B9raQzOpizn1TkRIa241ZnBEcjQ

DA VE 2 driving a lincoln. URL: https://drive.google.com/open?id= 0B9raQzOpizn1TkRIa241ZnBEcjQ. 9

work page