pith. machine review for the scientific record. sign in

arxiv: 1612.04340 · v1 · submitted 2016-12-13 · 📊 stat.ML · cs.LG· cs.RO

Recognition: unknown

End-to-End Deep Reinforcement Learning for Lane Keeping Assist

Ahmad El Sallab , Mohammed Abdou , Etienne Perot , Senthil Yogamani

Authors on Pith no claims yet
classification 📊 stat.ML cs.LGcs.RO
keywords learningdeepwillactionsautonomouscategorydrivingenvironment
0
0 comments X
read the original abstract

Reinforcement learning is considered to be a strong AI paradigm which can be used to teach machines through interaction with the environment and learning from their mistakes, but it has not yet been successfully used for automotive applications. There has recently been a revival of interest in the topic, however, driven by the ability of deep learning algorithms to learn good representations of the environment. Motivated by Google DeepMind's successful demonstrations of learning for games from Breakout to Go, we will propose different methods for autonomous driving using deep reinforcement learning. This is of particular interest as it is difficult to pose autonomous driving as a supervised learning problem as it has a strong interaction with the environment including other vehicles, pedestrians and roadworks. As this is a relatively new area of research for autonomous driving, we will formulate two main categories of algorithms: 1) Discrete actions category, and 2) Continuous actions category. For the discrete actions category, we will deal with Deep Q-Network Algorithm (DQN) while for the continuous actions category, we will deal with Deep Deterministic Actor Critic Algorithm (DDAC). In addition to that, We will also discover the performance of these two categories on an open source car simulator for Racing called (TORCS) which stands for The Open Racing car Simulator. Our simulation results demonstrate learning of autonomous maneuvering in a scenario of complex road curvatures and simple interaction with other vehicles. Finally, we explain the effect of some restricted conditions, put on the car during the learning phase, on the convergence time for finishing its learning phase.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. MTA-RL: Robust Urban Driving via Multi-modal Transformer-based 3D Affordances and Reinforcement Learning

    cs.CV 2026-05 unverdicted novelty 5.0

    MTA-RL predicts 3D driving affordances from multi-modal sensors with a transformer and uses them as the observation space for an RL policy, yielding better route completion and generalization than baselines in CARLA u...