hub Mixed citations

End to End Learning for Self-Driving Cars

Mariusz Bojarski, Davide Del Testa, Daniel Dworakowski, Bernhard Firner, Beat Flepp, Prasoon Goyal · 2016 · cs.CV · arXiv 1604.07316

Mixed citation behavior. Most common role is background (50%).

64 Pith papers citing it

Background 50% of classified citations

open full Pith review browse 64 citing papers arXiv PDF

abstract

We trained a convolutional neural network (CNN) to map raw pixels from a single front-facing camera directly to steering commands. This end-to-end approach proved surprisingly powerful. With minimum training data from humans the system learns to drive in traffic on local roads with or without lane markings and on highways. It also operates in areas with unclear visual guidance such as in parking lots and on unpaved roads. The system automatically learns internal representations of the necessary processing steps such as detecting useful road features with only the human steering angle as the training signal. We never explicitly trained it to detect, for example, the outline of roads. Compared to explicit decomposition of the problem, such as lane marking detection, path planning, and control, our end-to-end system optimizes all processing steps simultaneously. We argue that this will eventually lead to better performance and smaller systems. Better performance will result because the internal components self-optimize to maximize overall system performance, instead of optimizing human-selected intermediate criteria, e.g., lane detection. Such criteria understandably are selected for ease of human interpretation which doesn't automatically guarantee maximum system performance. Smaller networks are possible because the system learns to solve the problem with the minimal number of processing steps. We used an NVIDIA DevBox and Torch 7 for training and an NVIDIA DRIVE(TM) PX self-driving car computer also running Torch 7 for determining where to drive. The system operates at 30 frames per second (FPS).

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 9 method 4 other 1

citation-polarity summary

background 7 use method 4 unclear 2 support 1

representative citing papers

Parameterized Hardness of Zonotope Containment and Neural Network Verification

cs.CC · 2025-09-26 · unverdicted · novelty 8.0

The paper proves W[1]-hardness parameterized by dimension d for positivity, zonotope containment, max approximation, and L_p-Lipschitz constants in 2- and 3-layer ReLU networks, showing enumeration methods are optimal under ETH.

Diffusion Policy: Visuomotor Policy Learning via Action Diffusion

cs.RO · 2023-03-07 · accept · novelty 8.0

Diffusion Policy models robot actions as a conditional diffusion process, outperforming prior state-of-the-art methods by 46.9% on average across 12 manipulation tasks from four benchmarks.

LOSCAR-SGD: Local SGD with Communication-Computation Overlap and Delay-Corrected Sparse Model Averaging

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

LOSCAR-SGD combines local updates, sparse model averaging, and communication-computation overlap with a delay-corrected merge rule, providing convergence rates for smooth non-convex objectives under worker heterogeneity.

Ringmaster LMO: Asynchronous Linear Minimization Oracle Momentum Method

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

Ringmaster LMO extends delay-thresholding from ASGD to LMO-based momentum updates, providing convergence guarantees under (L0, L1)-smoothness and time-complexity bounds that recover optimal rates in the Euclidean case.

4DLidarOpen: An Open 4D FMCW Lidar Dataset for Motion-Aware Autonomous Driving

cs.RO · 2026-05-18 · unverdicted · novelty 7.0

4DLidarOpen is a new open dataset providing synchronized 4D FMCW Lidar velocity measurements, multi-Lidar and camera data, and 3D bounding-box annotations with track IDs to support benchmarks on 3D detection, BEV segmentation, flow prediction, and motion forecasting.

Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations

cs.RO · 2026-05-18 · unverdicted · novelty 7.0

Bench2Drive-Robust is a new closed-loop benchmark that evaluates end-to-end autonomous driving models under deployment perturbations from camera failures, ego-state errors, and compute delays, showing substantial performance degradation beyond image-level tests.

Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

DRATS derives a minimax objective from a feasibility formulation of MTRL to adaptively sample tasks with the largest return gaps, leading to better worst-task performance on MetaWorld benchmarks.

Optimality of Sub-network Laplace Approximations: New Results and Methods

stat.ML · 2026-05-09 · conditional · novelty 7.0

Sub-network Laplace approximations always underestimate full-model predictive variance, and two new gradient-based and greedy selection rules provide theoretically grounded improvements.

TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations

cs.LG · 2026-05-04 · unverdicted · novelty 7.0

TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.

Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation

cs.LG · 2026-05-02 · unverdicted · novelty 7.0

LHSD uses spectral filtering on the log-density Hessian to isolate tangent directions from noise and estimate local intrinsic dimension scalably via Stochastic Lanczos Quadrature.

ST-BCP: Tightening Coverage Bound for Backward Conformal Prediction via Non-Conformity Score Transformation

stat.ML · 2026-02-02 · conditional · novelty 7.0

ST-BCP tightens the coverage bound in Backward Conformal Prediction by applying a computable data-dependent transformation to nonconformity scores, reducing the average gap from 4.20% to 1.12% on benchmarks while proving superiority over the identity baseline.

Extremal Contours: Gradient-driven contours for compact visual attribution

cs.CV · 2025-11-03 · unverdicted · novelty 7.0

A training-free method using Fourier-parameterized star-convex contours optimized via gradients to generate compact, faithful visual attributions for image classifiers on benchmarks like ImageNet.

Steering Your Diffusion Policy with Latent Space Reinforcement Learning

cs.RO · 2025-06-18 · unverdicted · novelty 7.0

DSRL steers pretrained diffusion policies for robotics by applying RL to their latent noise inputs, achieving sample-efficient real-world adaptation with only black-box access.

Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals

cs.LG · 2026-05-13 · unverdicted · novelty 6.0 · 2 refs

Dywave uses wavelet hierarchical decomposition to create event-aligned compact token sequences for heterogeneous IoT signals, yielding up to 12% accuracy gains and 75% shorter inputs on mainstream sequence models across five datasets.

Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

Smaller end-to-end autonomous driving models achieve optimal 3-second trajectory prediction accuracy at lower or intermediate temporal sampling frequencies, whereas larger VLA-style models perform best at the highest frequencies across Waymo, nuScenes, and PAVE datasets.

Ensemble Distributionally Robust Bayesian Optimisation

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

A tractable ensemble distributionally robust Bayesian optimization method achieves improved sublinear regret bounds under context uncertainty.

ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving

cs.RO · 2026-05-06 · unverdicted · novelty 6.0 · 2 refs

ReflectDrive-2 combines masked discrete diffusion with RL-aligned self-editing to generate and refine driving trajectories, reaching 91.0 PDMS on NAVSIM camera-only and 94.8 in best-of-6.

OGPO: Sample Efficient Full-Finetuning of Generative Control Policies

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

OGPO is a sample-efficient off-policy method for full finetuning of generative control policies that reaches SOTA on robotic manipulation tasks and can recover from poor behavior-cloning initializations without expert data.

Empirical Insights of Test Selection Metrics under Multiple Testing Objectives and Distribution Shifts

cs.SE · 2026-04-25 · unverdicted · novelty 6.0

A broad empirical benchmark shows how 15 existing test selection metrics perform for fault detection, performance estimation, and retraining under corrupted, adversarial, temporal, natural, and label shifts across image, text, and Android data.

FingerViP: Learning Real-World Dexterous Manipulation with Fingertip Visual Perception

cs.RO · 2026-04-23 · conditional · novelty 6.0

FingerViP equips each finger with a miniature camera and trains a multi-view diffusion policy that achieves 80.8% success on real-world dexterous tasks previously limited by wrist-camera occlusion.

MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving

cs.RO · 2026-04-13 · unverdicted · novelty 6.0

MVAdapt conditions end-to-end autonomous driving policies on explicit vehicle physics to achieve better zero-shot transfer and few-shot calibration across different vehicles in CARLA simulation.

Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems

cs.LG · 2026-04-09 · unverdicted · novelty 6.0

MOSAIC is a scaling-aware data selection framework that outperforms baselines in training end-to-end autonomous driving planners, achieving comparable or better EPDMS scores with up to 80% less data.

Safety-Aligned 3D Object Detection: Single-Vehicle, Cooperative, and End-to-End Perspectives

cs.CV · 2026-04-02 · unverdicted · novelty 6.0

Safety-aware metrics and losses for 3D detection improve critical error handling in autonomous vehicle perception across single-vehicle, cooperative, and end-to-end settings.

SutureFormer: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space

cs.RO · 2026-03-19 · unverdicted · novelty 6.0

SutureFormer models needle tip movement in video as sequential pixel-space actions via goal-conditioned offline RL with spline-based reward densification, cutting average displacement error by 58.6% on a new 1,158-trajectory kidney suturing dataset.

citing papers explorer

Showing 50 of 64 citing papers.

Parameterized Hardness of Zonotope Containment and Neural Network Verification cs.CC · 2025-09-26 · unverdicted · none · ref 4 · internal anchor
The paper proves W[1]-hardness parameterized by dimension d for positivity, zonotope containment, max approximation, and L_p-Lipschitz constants in 2- and 3-layer ReLU networks, showing enumeration methods are optimal under ETH.
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion cs.RO · 2023-03-07 · accept · none · ref 2 · internal anchor
Diffusion Policy models robot actions as a conditional diffusion process, outperforming prior state-of-the-art methods by 46.9% on average across 12 manipulation tasks from four benchmarks.
LOSCAR-SGD: Local SGD with Communication-Computation Overlap and Delay-Corrected Sparse Model Averaging cs.LG · 2026-05-20 · unverdicted · none · ref 58 · internal anchor
LOSCAR-SGD combines local updates, sparse model averaging, and communication-computation overlap with a delay-corrected merge rule, providing convergence rates for smooth non-convex objectives under worker heterogeneity.
Ringmaster LMO: Asynchronous Linear Minimization Oracle Momentum Method cs.LG · 2026-05-18 · unverdicted · none · ref 56 · internal anchor
Ringmaster LMO extends delay-thresholding from ASGD to LMO-based momentum updates, providing convergence guarantees under (L0, L1)-smoothness and time-complexity bounds that recover optimal rates in the Euclidean case.
4DLidarOpen: An Open 4D FMCW Lidar Dataset for Motion-Aware Autonomous Driving cs.RO · 2026-05-18 · unverdicted · none · ref 26 · internal anchor
4DLidarOpen is a new open dataset providing synchronized 4D FMCW Lidar velocity measurements, multi-Lidar and camera data, and 3D bounding-box annotations with track IDs to support benchmarks on 3D detection, BEV segmentation, flow prediction, and motion forecasting.
Bench2Drive-Robust: Benchmarking Closed-Loop Autonomous Driving under Deployment Perturbations cs.RO · 2026-05-18 · unverdicted · none · ref 16 · internal anchor
Bench2Drive-Robust is a new closed-loop benchmark that evaluates end-to-end autonomous driving models under deployment perturbations from camera failures, ego-state errors, and compute delays, showing substantial performance degradation beyond image-level tests.
Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling cs.LG · 2026-05-14 · unverdicted · none · ref 170 · internal anchor
DRATS derives a minimax objective from a feasibility formulation of MTRL to adaptively sample tasks with the largest return gaps, leading to better worst-task performance on MetaWorld benchmarks.
Optimality of Sub-network Laplace Approximations: New Results and Methods stat.ML · 2026-05-09 · conditional · none · ref 15 · internal anchor
Sub-network Laplace approximations always underestimate full-model predictive variance, and two new gradient-based and greedy selection rules provide theoretically grounded improvements.
TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations cs.LG · 2026-05-04 · unverdicted · none · ref 235 · internal anchor
TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.
Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation cs.LG · 2026-05-02 · unverdicted · none · ref 123 · internal anchor
LHSD uses spectral filtering on the log-density Hessian to isolate tangent directions from noise and estimate local intrinsic dimension scalably via Stochastic Lanczos Quadrature.
ST-BCP: Tightening Coverage Bound for Backward Conformal Prediction via Non-Conformity Score Transformation stat.ML · 2026-02-02 · conditional · none · ref 5 · internal anchor
ST-BCP tightens the coverage bound in Backward Conformal Prediction by applying a computable data-dependent transformation to nonconformity scores, reducing the average gap from 4.20% to 1.12% on benchmarks while proving superiority over the identity baseline.
Extremal Contours: Gradient-driven contours for compact visual attribution cs.CV · 2025-11-03 · unverdicted · none · ref 7 · internal anchor
A training-free method using Fourier-parameterized star-convex contours optimized via gradients to generate compact, faithful visual attributions for image classifiers on benchmarks like ImageNet.
Steering Your Diffusion Policy with Latent Space Reinforcement Learning cs.RO · 2025-06-18 · unverdicted · none · ref 26 · internal anchor
DSRL steers pretrained diffusion policies for robotics by applying RL to their latent noise inputs, achieving sample-efficient real-world adaptation with only black-box access.
Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals cs.LG · 2026-05-13 · unverdicted · none · ref 125 · 2 links · internal anchor
Dywave uses wavelet hierarchical decomposition to create event-aligned compact token sequences for heterogeneous IoT signals, yielding up to 12% accuracy gains and 75% shorter inputs on mainstream sequence models across five datasets.
Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction cs.CV · 2026-05-11 · unverdicted · none · ref 3 · internal anchor
Smaller end-to-end autonomous driving models achieve optimal 3-second trajectory prediction accuracy at lower or intermediate temporal sampling frequencies, whereas larger VLA-style models perform best at the highest frequencies across Waymo, nuScenes, and PAVE datasets.
Ensemble Distributionally Robust Bayesian Optimisation cs.LG · 2026-05-08 · unverdicted · none · ref 93 · internal anchor
A tractable ensemble distributionally robust Bayesian optimization method achieves improved sublinear regret bounds under context uncertainty.
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving cs.RO · 2026-05-06 · unverdicted · none · ref 93 · 2 links · internal anchor
ReflectDrive-2 combines masked discrete diffusion with RL-aligned self-editing to generate and refine driving trajectories, reaching 91.0 PDMS on NAVSIM camera-only and 94.8 in best-of-6.
OGPO: Sample Efficient Full-Finetuning of Generative Control Policies cs.LG · 2026-05-04 · unverdicted · none · ref 24 · internal anchor
OGPO is a sample-efficient off-policy method for full finetuning of generative control policies that reaches SOTA on robotic manipulation tasks and can recover from poor behavior-cloning initializations without expert data.
Empirical Insights of Test Selection Metrics under Multiple Testing Objectives and Distribution Shifts cs.SE · 2026-04-25 · unverdicted · none · ref 10 · internal anchor
A broad empirical benchmark shows how 15 existing test selection metrics perform for fault detection, performance estimation, and retraining under corrupted, adversarial, temporal, natural, and label shifts across image, text, and Android data.
FingerViP: Learning Real-World Dexterous Manipulation with Fingertip Visual Perception cs.RO · 2026-04-23 · conditional · none · ref 7 · internal anchor
FingerViP equips each finger with a miniature camera and trains a multi-view diffusion policy that achieves 80.8% success on real-world dexterous tasks previously limited by wrist-camera occlusion.
MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving cs.RO · 2026-04-13 · unverdicted · none · ref 17 · internal anchor
MVAdapt conditions end-to-end autonomous driving policies on explicit vehicle physics to achieve better zero-shot transfer and few-shot calibration across different vehicles in CARLA simulation.
Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems cs.LG · 2026-04-09 · unverdicted · none · ref 4 · internal anchor
MOSAIC is a scaling-aware data selection framework that outperforms baselines in training end-to-end autonomous driving planners, achieving comparable or better EPDMS scores with up to 80% less data.
Safety-Aligned 3D Object Detection: Single-Vehicle, Cooperative, and End-to-End Perspectives cs.CV · 2026-04-02 · unverdicted · none · ref 27 · internal anchor
Safety-aware metrics and losses for 3D detection improve critical error handling in autonomous vehicle perception across single-vehicle, cooperative, and end-to-end settings.
SutureFormer: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space cs.RO · 2026-03-19 · unverdicted · none · ref 3 · internal anchor
SutureFormer models needle tip movement in video as sequential pixel-space actions via goal-conditioned offline RL with spline-based reward densification, cutting average displacement error by 58.6% on a new 1,158-trajectory kidney suturing dataset.
Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving cs.RO · 2026-02-26 · unverdicted · none · ref 6 · internal anchor
The paper introduces Hyper Diffusion Planner (HDP), a diffusion-based E2E AD framework that identifies insights on loss space, trajectory representation and data scaling, adds RL post-training, and reports 10x performance gains over 200 km of real-world testing across 6 scenarios.
LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving cs.CV · 2025-12-23 · accept · none · ref 3 · internal anchor
Reducing expert-student asymmetries in visibility, uncertainty, and route specification enables a new TransFuser v6 policy that reaches 95 DS on Bench2Drive and more than doubles prior scores on Longest6 v2 and Town13.
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail cs.RO · 2025-10-30 · conditional · none · ref 6 · internal anchor
Alpamayo-R1 introduces a VLA model with a Chain of Causation dataset and multi-stage SFT-plus-RL training that reports 12% better planning accuracy and 35% fewer close encounters versus trajectory-only baselines in driving tasks.
Automated Test Validators for Flaky Cyber-Physical System Simulators: Approach and Evaluation cs.SE · 2025-08-28 · unverdicted · none · ref 49 · internal anchor
Test validators generated via genetic programming using the Ochiai SBFL formula are more accurate and robust to flakiness than alternatives from Tarantula, Naish, decision trees, or rules, with 88.7% alignment to known requirements in CPS case studies.
EMMA: End-to-End Multimodal Model for Autonomous Driving cs.CV · 2024-10-30 · unverdicted · none · ref 114 · internal anchor
EMMA is an end-to-end multimodal LLM that converts camera data into trajectories, objects, and road graphs via text prompts and reports state-of-the-art motion planning on nuScenes plus competitive detection results on Waymo.
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models cs.AI · 2024-08-01 · conditional · none · ref 29 · internal anchor
Empirical analysis shows scaling inference compute via strategies like tree search can be more efficient than scaling model parameters, with 7B models plus novel search outperforming 34B models.
Octo: An Open-Source Generalist Robot Policy cs.RO · 2024-05-20 · unverdicted · none · ref 8 · internal anchor
Octo is an open-source transformer-based generalist robot policy pretrained on 800k trajectories that serves as an effective initialization for finetuning across diverse robotic platforms.
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations cs.RO · 2024-02-16 · conditional · none · ref 25 · internal anchor
3D Diffuser Actor unifies diffusion policies with 3D scene features to set new state-of-the-art results on RLBench and CALVIN robot benchmarks.
GPT-Driver: Learning to Drive with GPT cs.CV · 2023-10-02 · conditional · none · ref 2 · internal anchor
GPT-3.5 is turned into an autonomous-vehicle motion planner by representing driving scenes and trajectories as language tokens and applying a prompting-reasoning-finetuning pipeline, with results shown on nuScenes.
Emergence of Exploratory Look-Around Behaviors through Active Observation Completion cs.CV · 2019-06-27 · unverdicted · none · ref 50 · internal anchor
An RL agent learns to actively explore by being rewarded for inferring unobserved scene parts after short glimpse sequences, with sidekick policy learning enabling generalization to other active perception tasks.
Rules of the Road: Predicting Driving Behavior with a Convolutional Model of Semantic Interactions cs.CV · 2019-06-21 · unverdicted · none · ref 8 · internal anchor
A grid-based convolutional architecture fuses semantic maps and 3D perceptions to model driving interactions and predict future agent states, evaluated on a new industry-grade dataset.
Anomaly-Informed Confidence Calibration for Vision-Based Safety Prediction cs.RO · 2026-05-20 · unverdicted · none · ref 28 · internal anchor
Fusing perceptual and dynamics anomaly scores enables online temperature scaling that cuts expected calibration error by 37% on physical DonkeyCar tests with four unseen anomaly types.
C-CoT: Counterfactual Chain-of-Thought with Vision-Language Models for Safe Autonomous Driving cs.CV · 2026-05-11 · unverdicted · none · ref 1 · internal anchor
C-CoT applies VLMs to autonomous driving via five-stage reasoning with a meta-action tree for counterfactuals, yielding 81.9% risk recall, 3.52% collision rate, and 1.98 m L2 error on a new dataset.
Rennala MVR: Improved Time Complexity for Parallel Stochastic Optimization via Momentum-Based Variance Reduction math.OC · 2026-05-09 · unverdicted · none · ref 54 · internal anchor
Rennala MVR improves time complexity over Rennala SGD for smooth nonconvex stochastic optimization in heterogeneous parallel systems under a mean-squared smoothness assumption.
InterFuserDVS: Event-Enhanced Sensor Fusion for Safe RL-Based Decision Making cs.CV · 2026-05-05 · unverdicted · none · ref 2 · internal anchor
Integrating DVS event data into InterFuser through token fusion yields a driving score of 77.2 and 100% route completion on CARLA benchmarks, indicating improved robustness in dynamic conditions.
UniAda: Universal Adaptive Multi-objective Adversarial Attack for End-to-End Autonomous Driving Systems cs.SE · 2026-04-25 · unverdicted · none · ref 6 · internal anchor
UniAda introduces a white-box multi-objective attack using adaptive weighting to generate perturbations that jointly affect steering and speed in E2E ADS, outperforming benchmarks with average deviations of 3.54-29 degrees and 11-22 km/h.
MetaErr: Towards Predicting Error Patterns in Deep Neural Networks cs.CV · 2026-04-25 · unverdicted · none · ref 4 · internal anchor
MetaErr introduces a meta-model that forecasts per-sample prediction errors in deep neural networks solely from base model performance observations, outperforming baselines and boosting pseudo-labeling on three computer vision datasets.
End-to-End ILC for Repetitive Untrackable Tasks: A Cooperative Game Perspective eess.SY · 2026-04-18 · unverdicted · none · ref 4 · internal anchor
An end-to-end ILC for untrackable repetitive tasks is formulated as a cooperative game between reference and feedforward updates, yielding a sufficient condition for lower cost than norm-optimal ILC.
Artificial Intelligence for Modeling and Simulation of Mixed Automated and Human Traffic cs.AI · 2026-04-14 · unverdicted · none · ref 15 · internal anchor
This survey synthesizes AI techniques for mixed autonomy traffic simulation and introduces a taxonomy spanning agent-level behavior models, environment-level methods, and cognitive/physics-informed approaches.
Reliable and Real-Time Highway Trajectory Planning via Hybrid Learning-Optimization Frameworks cs.RO · 2025-08-06 · unverdicted · none · ref 29 · internal anchor
Hybrid learning-optimization framework for highway trajectory planning that reports over 97% scenario success rate and 54 ms average cycle time on the HighD dataset while enforcing formal safety via MIQP.
Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings cs.CV · 2025-05-22 · unverdicted · none · ref 1 · internal anchor
TEA is a new targeted adversarial attack that incorporates edge information from the target image to reduce query count and improve performance in low-query black-box hard-label settings.
Survival of the Cheapest: Cost-Aware Hardware Adaptation for Adversarial Robustness cs.CR · 2024-09-11 · unverdicted · none · ref 6 · internal anchor
A decision-support framework applies AFT models to show Nvidia L4 GPUs yield 20% longer adversarial survival time at 75% lower cost than V100, with inference latency as the strongest robustness predictor.
Analyzing Adversarial Inputs in Deep Reinforcement Learning cs.LG · 2024-02-07 · unverdicted · none · ref 2 · internal anchor
Introduces the Adversarial Rate metric and associated tools to systematically evaluate and visualize the impact of adversarial inputs on DRL policies using formal verification.
PyTorch Distributed: Experiences on Accelerating Data Parallel Training cs.DC · 2020-06-28 · accept · none · ref 21 · internal anchor
PyTorch distributed data parallel attains near-linear scalability on 256 GPUs through gradient bucketing, computation-communication overlap, and selective synchronization skipping.
Towards Generalizing Sensorimotor Control Across Weather Conditions cs.LG · 2019-07-25 · unverdicted · none · ref 1 · internal anchor
A teacher-student framework with domain translation transfers steering control from one weather condition to multiple others using only source-domain labels.
NeuroTrajectory: A Neuroevolutionary Approach to Local State Trajectory Learning for Autonomous Vehicles cs.RO · 2019-06-26 · unverdicted · none · ref 2 · internal anchor
NeuroTrajectory is a neuroevolutionary method that trains deep neural networks via genetic algorithms to estimate multi-objective optimal state trajectories over a finite horizon for autonomous vehicle motion planning.

End to End Learning for Self-Driving Cars

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer