Siamese Network for RGB-D Salient Object Detection and Beyond 论文

2021IEEE Transactions on Pattern Analysis and Machine Intelligence引用 232

Visual Attention and Saliency DetectionFace Recognition and PerceptionImage and Video Quality Assessment

Image and Video Quality Assessment Visual Attention and Saliency Detection Face Recognition and Perception

作者

摘要

Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as independent information and design separate networks for feature extraction from each. Such schemes can easily be constrained by a limited amount of training data or over-reliance on an elaborately designed training process. Inspired by the observation that RGB and depth modalities actually present certain commonality in distinguishing salient objects, a novel joint learning and densely cooperative fusion (JL-DCF) architecture is designed to learn from both RGB and depth inputs through a shared network backbone, known as the Siamese architecture. In this paper, we propose two effective components: joint learning (JL), and densely cooperative fusion (DCF). The JL module provides robust saliency feature learning by exploiting cross-modal commonality via a Siamese network, while the DCF module is introduced for complementary feature discovery. Comprehensive experiments using 5 popular metrics show that the designed framework yields a robust RGB-D saliency detector with good generalization. As a result, JL-DCF significantly advances the SOTAs by an average of ~2.0% (F-measure) across 7 challenging datasets. In addition, we show that JL-DCF is readily applicable to other related multi-modal detection tasks, including RGB-T SOD and video SOD, achieving comparable or better performance.

作者查看全部 (6)

Ce Zhu

Jianbing Shen

Qijun Zhao

Ge-Peng Ji

Siamese Network for RGB-D Salient Object Detection and Beyond 论文

详细信息

摘要

作者查看全部 (6)

相关技术查看全部 (2)

相关事件

相关文章