FlowIt: Global Matching via Hierarchical Transformers and Optimal Transport for Optical Flow 文章

ArXiv CS.CV2026-06-02NEWSen作者: Sadra Safadoust, Fabio Tosi, Matteo Poggi, Fatma G\"uney

摘要

arXiv:2603.28759v2 Announce Type: replace Abstract: We present FlowIt, a novel architecture for optical flow estimation that combines global matching with confidence and occlusion-guided refinement. At its core, FlowIt leverages a hierarchical transformer architecture that captures extensive global context, enabling the model to effectively model long-range correspondences. To overcome the limitations of localized matching, we formulate the flow initialization as an optimal transport problem. This formulation yields a highly robust initial flow field, alongside explicitly derived occlusion and confidence maps. These cues are then seamlessly integrated into a guided refinement stage, where the network actively propagates reliable motion estimates from high-confidence regions into ambiguous, low-confidence areas. Extensive experiments across the Sintel, KITTI, Spring, and LayeredFlow datasets validate the effectiveness of our approach.