可见的红外人员重新识别的反事实干预功能转移

论文标题

可见的红外人员重新识别的反事实干预功能转移

Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification

论文作者

Li, Xulin, Lu, Yan, Liu, Bin, Liu, Yating, Yin, Guojun, Chu, Qi, Huang, Jinyang, Zhu, Feng, Zhao, Rui, Yu, Nenghai

论文摘要

基于图形的模型最近在人的重新识别任务中取得了巨大的成功，该任务首先计算了不同人的图形拓扑结构（亲和力），然后将信息传递给他们的信息以实现更强的功能。但是，我们发现在可见的红外人员重新识别任务（VI-REID）中，现有的基于图的方法由于两个问题而遭受不良概括：1）火车测试模式平衡差距，这是VI-REID任务的属性。在训练阶段，两个模式数据的数量是平衡的，但推理极为不平衡，导致基于图的VI-REID方法的概括较低。 2）由图形模块的端到端学习方式引起的亚最佳拓扑结构。我们分析训练有素的输入特征会削弱图形拓扑的学习，从而使其在推理过程中不够概括。在本文中，我们提出了一种反事实干预特征转移（CIFT）方法来解决这些问题。具体而言，均匀且异质的特征传输（H2FT）旨在通过两种独立的设计的图形模块和不平衡的方案模拟来减少火车测试模态差距。此外，提出了反事实关系干预（CRI）来利用反事实干预和因果效应工具来突出拓扑结构在整个训练过程中的作用，这使得图形拓扑结构更加可靠。对标准VI-REID基准测试的广泛实验表明，CIFT在各种设置下都优于最新方法。

Graph-based models have achieved great success in person re-identification tasks recently, which compute the graph topology structure (affinities) among different people first and then pass the information across them to achieve stronger features. But we find existing graph-based methods in the visible-infrared person re-identification task (VI-ReID) suffer from bad generalization because of two issues: 1) train-test modality balance gap, which is a property of VI-ReID task. The number of two modalities data are balanced in the training stage, but extremely unbalanced in inference, causing the low generalization of graph-based VI-ReID methods. 2) sub-optimal topology structure caused by the end-to-end learning manner to the graph module. We analyze that the well-trained input features weaken the learning of graph topology, making it not generalized enough during the inference process. In this paper, we propose a Counterfactual Intervention Feature Transfer (CIFT) method to tackle these problems. Specifically, a Homogeneous and Heterogeneous Feature Transfer (H2FT) is designed to reduce the train-test modality balance gap by two independent types of well-designed graph modules and an unbalanced scenario simulation. Besides, a Counterfactual Relation Intervention (CRI) is proposed to utilize the counterfactual intervention and causal effect tools to highlight the role of topology structure in the whole training process, which makes the graph topology structure more reliable. Extensive experiments on standard VI-ReID benchmarks demonstrate that CIFT outperforms the state-of-the-art methods under various settings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题