Object detection using a max-margin Hough transform 论文

20092009 IEEE Conference on Computer Vision and Pattern Recognition引用 302
Image and Object Detection TechniquesAdvanced Image and Video Retrieval TechniquesRobotics and Sensor-Based Localization

摘要

We present a discriminative Hough transform based object detector where each local part casts a weighted vote for the possible locations of the object center. We show that the weights can be learned in a max-margin framework which directly optimizes the classification performance. The discriminative training takes into account both the codebook appearance and the spatial distribution of its position with respect to the object center to derive its importance. On various datasets we show that the discriminative training improves the Hough detector. Combined with a verification step using a SVM based classifier, our approach achieves a detection rate of 91.9% at 0.3 false positives per image on the ETHZ shape dataset, a significant improvement over the state of the art, while running the verification step on at least an order of magnitude fewer windows than in a sliding window approach.