RobusTReID: Defending Vision Transformer for Robust Image ReID

Hua Zhang, Tingting Xiao, Li Sun*, Qingli Li

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Vision Transformer (ViT) achieves competitive results in person ReID, not only due to the powerful ability on feature representation, but also its resistance to attacks. However, they are vulnerable to specially designed attacks. To enhance their robustness, this paper proposes RobusTReID to defend the ViT-based model against perturbed images without obvious performance drop on clean data. The basic idea is to incorporate the adversarial co-training into ReID, which first disturbs pixels by minimizing adversarial loss in primary feature branch, then optimizes model by ReID task loss computed in all branches on clean and perturbed data. We separate the representation paths for clean and perturbed images. Particularly, a learnable [ADV] token and low-rank positional embeddings (PE) are incorporated to build the feature for perturbed image. Extensive experiments on several ReID datasets show that our method effectively increases the robustness of ReID model under different types of attacks.

Original languageEnglish
Title of host publication2025 IEEE International Conference on Multimedia and Expo
Subtitle of host publicationJourney to the Center of Machine Imagination, ICME 2025 - Conference Proceedings
PublisherIEEE Computer Society
ISBN (Electronic)9798331594954
DOIs
StatePublished - 2025
Event2025 IEEE International Conference on Multimedia and Expo, ICME 2025 - Nantes, France
Duration: 30 Jun 20254 Jul 2025

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2025 IEEE International Conference on Multimedia and Expo, ICME 2025
Country/TerritoryFrance
CityNantes
Period30/06/254/07/25

Keywords

  • Adversarial attack and defense
  • Image Retrieval
  • Person Re-Identification
  • Vision Transformer

Fingerprint

Dive into the research topics of 'RobusTReID: Defending Vision Transformer for Robust Image ReID'. Together they form a unique fingerprint.

Cite this