A Commute in Data: The comma2k19 Dataset

Andrew Haden; Eder Santana; Harald Schafer; Riccardo Biasini

arxiv: 1812.05752 · v1 · pith:ZFJB342Lnew · submitted 2018-12-14 · 💻 cs.RO

A Commute in Data: The comma2k19 Dataset

Harald Schafer , Eder Santana , Andrew Haden , Riccardo Biasini This is my paper

classification 💻 cs.RO

keywords gnssdatadatasetcommacomma2k19laikaalgorithmscalifornia

0 comments

read the original abstract

comma.ai presents comma2k19, a dataset of over 33 hours of commute in California's 280 highway. This means 2019 segments, 1 minute long each, on a 20km section of highway driving between California's San Jose and San Francisco. The dataset was collected using comma EONs that have sensors similar to those of any modern smartphone including a road-facing camera, phone GPS, thermometers and a 9-axis IMU. Additionally, the EON captures raw GNSS measurements and all CAN data sent by the car with a comma grey panda. Laika, an open-source GNSS processing library, is also introduced here. Laika produces 40% more accurate positions than the GNSS module used to collect the raw data. This dataset includes pose (position + orientation) estimates in a global reference frame of the recording camera. These poses were computed with a tightly coupled INS/GNSS/Vision optimizer that relies on data processed by Laika. comma2k19 is ideal for development and validation of tightly coupled GNSS algorithms and mapping algorithms that work with commodity sensors.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

ParkingScenes: A Structured Dataset for End-to-End Autonomous Parking in Simulation Scenes
cs.CV 2026-04 unverdicted novelty 6.0

ParkingScenes is a new multimodal dataset of 704 structured reverse and parallel parking episodes generated in CARLA with Hybrid A* and MPC trajectories, showing better model performance than unstructured simulation data.
All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles
cs.CV 2025-10 unverdicted novelty 4.0

A survey synthesizing sensor fusion strategies, AV datasets, and emerging LLM/VLM-powered object detection pipelines for autonomous vehicles.