Finding Things: Image Parsing with Regions and Per-Exemplar Detectors 论文

2013引用 226

Advanced Neural Network ApplicationsAdvanced Image and Video Retrieval TechniquesMultimodal Machine Learning Applications

人工智能 Advanced Neural Network Applications Advanced Image and Video Retrieval Techniques Multimodal Machine Learning Applications

关系图谱

作者

摘要

This paper presents a system for image parsing, or labeling each pixel in an image with its semantic category, aimed at achieving broad coverage across hundreds of object categories, many of them sparsely sampled. The system combines region-level features with per-exemplar sliding window detectors. Per-exemplar detectors are better suited for our parsing task than traditional bounding box detectors: they perform well on classes with little training data and high intra-class variation, and they allow object masks to be transferred into the test image for pixel-level segmentation. The proposed system achieves state-of-the-art accuracy on three challenging datasets, the largest of which contains 45,676 images and 232 labels.

作者查看全部 (2)

Svetlana Lazebnik

Joseph Tighe

Finding Things: Image Parsing with Regions and Per-Exemplar Detectors 论文

摘要

作者查看全部 (2)

相关技术查看全部 (3)

相关事件

相关文章