Monocular 3D Object Detection for Autonomous Driving 论文

2016引用 1075
Advanced Neural Network ApplicationsRobotics and Sensor-Based LocalizationVisual Attention and Saliency Detection

详细信息

发表日期
2016-06-01
发表年份
2016

关键词

Advanced Neural Network ApplicationsRobotics and Sensor-Based LocalizationVisual Attention and Saliency Detection

摘要

The goal of this paper is to perform 3D object detection from a single monocular image in the domain of autonomous driving. Our method first aims to generate a set of candidate class-specific object proposals, which are then run through a standard CNN pipeline to obtain high-quality object detections. The focus of this paper is on proposal generation. In particular, we propose an energy minimization approach that places object candidates in 3D using the fact that objects should be on the ground-plane. We then score each candidate box projected to the image plane via several intuitive potentials encoding semantic segmentation, contextual information, size and location priors and typical object shape. Our experimental evaluation demonstrates that our object proposal generation approach significantly outperforms all monocular approaches, and achieves the best detection performance on the challenging KITTI benchmark, among published monocular competitors.