Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes 论文

2011引用 646
Robotics and Sensor-Based LocalizationAdvanced Image and Video Retrieval TechniquesAdvanced Neural Network Applications

摘要

We present a method for detecting 3D objects using multi-modalities. While it is generic, we demonstrate it on the combination of an image and a dense depth map which give complementary object information. It works in real-time, under heavy clutter, does not require a time consuming training stage, and can handle untextured objects. It is based on an efficient representation of templates that capture the different modalities, and we show in many experiments on commodity hardware that our approach significantly outperforms state-of-the-art methods on single modalities.