Towards Resolving Optimization Conflicts Between Image- and Text-Based Person Re-Identification 文章

ArXiv CS.CV2026-06-02NEWSen作者: Karina Kvanchiani, Timur Mamedov

摘要

arXiv:2606.02242v1 Announce Type: new Abstract: The joint optimization of image-based (I2I) and text-based (T2I) person re-identification (ReID) is hindered by modality discrepancies and conflicting training objectives, leading to suboptimal shared representations. While I2I ReID focuses on identity-level invariance across images of the same person, T2I ReID is driven by instance-specific textual descriptions tied to unique visual traits. This paper explores the fundamental difference between two ReID tasks and their optimization processes for effective training. Since I2I and T2I ReID are often studied separately, the loss functions optimized for one retrieval setting may negatively affect the representation quality required by the other. Motivated by these findings, we propose a decoupled two-stage training pipeline for learning a shared representation across image and text modalities.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据