论文标题
车辆:学习重新识别的强大视觉表示
VehicleNet: Learning Robust Visual Representation for Vehicle Re-identification
论文作者
论文摘要
鉴于不同摄像头视图的阶层内车辆的显着变化,车辆重新识别(RE-ID)的一个基本挑战是学习强大而歧视性的视觉表示。由于现有的车辆数据集在训练图像和观点方面受到限制,因此我们建议通过利用四个公共车辆数据集来构建独特的大型车辆数据集(称为车辆),并设计一种简单而有效的两阶段渐进式方法,以学习从车辆莱尼内特学习更多强大的可靠视觉表现。我们方法的第一阶段是通过训练常规分类损失来学习所有域(即源车数据集)的通用表示。这个阶段放松了训练和测试域之间的完全比对,因为它对目标车辆域不可知。第二阶段是通过最大程度地减少我们的车辆和任何目标域之间的分布差异来微调纯粹基于目标车辆的训练模型。我们讨论了我们提出的多源数据集车辆车辆,并通过广泛的实验评估了两阶段渐进式表示学习的有效性。我们在AICITY挑战的私人测试集上实现了86.07%的最新准确性,并在另外两个公共车辆重新ID数据集(即Veri-776和warterID)上获得了竞争成果。我们希望这个新的VehicleNet数据集和博学的强大表示形式可以为实际环境中的车辆重新ID铺平道路。
One fundamental challenge of vehicle re-identification (re-id) is to learn robust and discriminative visual representation, given the significant intra-class vehicle variations across different camera views. As the existing vehicle datasets are limited in terms of training images and viewpoints, we propose to build a unique large-scale vehicle dataset (called VehicleNet) by harnessing four public vehicle datasets, and design a simple yet effective two-stage progressive approach to learning more robust visual representation from VehicleNet. The first stage of our approach is to learn the generic representation for all domains (i.e., source vehicle datasets) by training with the conventional classification loss. This stage relaxes the full alignment between the training and testing domains, as it is agnostic to the target vehicle domain. The second stage is to fine-tune the trained model purely based on the target vehicle set, by minimizing the distribution discrepancy between our VehicleNet and any target domain. We discuss our proposed multi-source dataset VehicleNet and evaluate the effectiveness of the two-stage progressive representation learning through extensive experiments. We achieve the state-of-art accuracy of 86.07% mAP on the private test set of AICity Challenge, and competitive results on two other public vehicle re-id datasets, i.e., VeRi-776 and VehicleID. We hope this new VehicleNet dataset and the learned robust representations can pave the way for vehicle re-id in the real-world environments.