Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers 论文

20212021 IEEE/CVF International Conference on Computer Vision (ICCV)引用 296

Advanced Vision and ImagingAdvanced Image Processing TechniquesOptical measurement and interference techniques

Advanced Image Processing Techniques Advanced Vision and Imaging Optical measurement and interference techniques

作者

摘要

Stereo depth estimation relies on optimal correspondence matching between pixels on epipolar lines in the left and right images to infer depth. In this work, we revisit the problem from a sequence-to-sequence correspondence perspective to replace cost volume construction with dense pixel matching using position information and attention. This approach, named STereo TRansformer (STTR), has several advantages: It 1) relaxes the limitation of a fixed disparity range, 2) identifies occluded regions and provides confidence estimates, and 3) imposes uniqueness constraints during the matching process. We report promising results on both synthetic and real-world datasets and demonstrate that STTR generalizes across different domains, even without fine-tuning.

作者查看全部 (7)

Mathias Unberath

Russell H. Taylor

Francis X. Creighton

Andy S. Ding

Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers 论文

摘要

作者查看全部 (7)

相关技术查看全部 (1)

相关事件

相关文章