pith. sign in

arxiv: 1902.06326 · v3 · pith:YZJFOIJ7new · submitted 2019-02-17 · 💻 cs.CV

PIXOR: Real-time 3D Object Detection from Point Clouds

classification 💻 cs.CV
keywords detectionobjectcloudspixorpointreal-timebenchmarkcomputation
0
0 comments X
read the original abstract

We address the problem of real-time 3D object detection from point clouds in the context of autonomous driving. Computation speed is critical as detection is a necessary component for safety. Existing approaches are, however, expensive in computation due to high dimensionality of point clouds. We utilize the 3D data more efficiently by representing the scene from the Bird's Eye View (BEV), and propose PIXOR, a proposal-free, single-stage detector that outputs oriented 3D object estimates decoded from pixel-wise neural network predictions. The input representation, network architecture, and model optimization are especially designed to balance high accuracy and real-time efficiency. We validate PIXOR on two datasets: the KITTI BEV object detection benchmark, and a large-scale 3D vehicle detection benchmark. In both datasets we show that the proposed detector surpasses other state-of-the-art methods notably in terms of Average Precision (AP), while still runs at >28 FPS.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data

    cs.CV 2024-11 unverdicted novelty 4.0

    Describes a camera-radar fusion network that uses raw RD spectra and BEV-polar camera features for BEV object detection, evaluated for accuracy and compute on the RADIal dataset.