TorontoCity: Seeing the World with a Million Eyes

Bin Yang; Gellert Mattyus; Hang Chu; Joel Cheverie; Justin Liang; Min Bai; Raquel Urtasun; Sanja Fidler; Shenlong Wang; Wenjie Luo

arxiv: 1612.00423 · v1 · pith:APZDYPRDnew · submitted 2016-12-01 · 💻 cs.CV

TorontoCity: Seeing the World with a Million Eyes

Shenlong Wang , Min Bai , Gellert Mattyus , Hang Chu , Wenjie Luo , Bin Yang , Justin Liang , Joel Cheverie

show 2 more authors

Sanja Fidler Raquel Urtasun

This is my paper

classification 💻 cs.CV

keywords buildingaroundbenchmarkdifferentextractionlabelingmapsroad

0 comments

read the original abstract

In this paper we introduce the TorontoCity benchmark, which covers the full greater Toronto area (GTA) with 712.5 $km^2$ of land, 8439 $km$ of road and around 400,000 buildings. Our benchmark provides different perspectives of the world captured from airplanes, drones and cars driving around the city. Manually labeling such a large scale dataset is infeasible. Instead, we propose to utilize different sources of high-precision maps to create our ground truth. Towards this goal, we develop algorithms that allow us to align all data sources with the maps while requiring minimal human supervision. We have designed a wide variety of tasks including building height estimation (reconstruction), road centerline and curb extraction, building instance segmentation, building contour extraction (reorganization), semantic labeling and scene type classification (recognition). Our pilot study shows that most of these tasks are still difficult for modern convolutional neural networks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Delaunay Canopy: Building Wireframe Reconstruction from Airborne LiDAR Point Clouds via Delaunay Graph
cs.CV 2026-04 unverdicted novelty 6.0

Delaunay Canopy uses Delaunay graphs as a geometric prior with region-wise curvature scoring to reconstruct accurate building wireframes from sparse and noisy airborne LiDAR point clouds.