Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification 论文

20212021 IEEE/CVF International Conference on Computer Vision (ICCV)引用 349
Advanced Neural Network ApplicationsVideo Surveillance and Tracking MethodsDomain Adaptation and Few-Shot Learning

详细信息

发表期刊/会议
2021 IEEE/CVF International Conference on Computer Vision (ICCV)
发表日期
2021-10-01
发表年份
2021

关键词

Advanced Neural Network ApplicationsVideo Surveillance and Tracking MethodsDomain Adaptation and Few-Shot Learning

摘要

Attention mechanism has demonstrated great potential in fine-grained visual recognition tasks. In this paper, we present a counterfactual attention learning method to learn more effective attention based on causal inference. Unlike most existing methods that learn visual attention based on conventional likelihood, we propose to learn the attention with counterfactual causality, which provides a tool to measure the attention quality and a powerful supervisory signal to guide the learning process. Specifically, we analyze the effect of the learned visual attention on network prediction through counterfactual intervention and maximize the effect to encourage the network to learn more useful attention for fine-grained image recognition. Empirically, we evaluate our method on a wide range of fine-grained recognition tasks where attention plays a crucial role, including fine-grained image categorization, person re-identification, and vehicle re-identification. The consistent improvement on all benchmarks demonstrates the effectiveness of our method. <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup>