Skip to main content
Article
PSTR: End-to-End One-Step Person Search With Transformers
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
  • Jiale Cao, Tianjin University, China & Mohamed bin Zayed University of Artificial Intelligence
  • Pang Yanwei, Tianjin University, China
  • Rao Anwer, Mohamed bin Zayed University of Artificial Intelligence
  • Hisham Cholakkal, Mohamed bin Zayed University of Artificial Intelligence
  • Jin Xie, Mohamed bin Zayed University of Artificial Intelligence & University of Central Florida, United States
  • Mubarak Shah, University of Central Florida
  • Fahad Shahbaz Khan, Mohamed bin Zayed University of Artificial Intelligence & Linköping University, Sweden
Document Type
Conference Proceeding
Abstract

We propose a novel one-step transformer-based person search framework, PSTR, that jointly performs person detection and re-identification (re-id) in a single architecture. PSTR comprises a person search-specialized (PSS) module that contains a detection encoder-decoder for person detection along with a discriminative re-id decoder for person re-id. The discriminative re-id decoder utilizes a multi-level supervision scheme with a shared decoder for discriminative re-id feature learning and also comprises a part attention block to encode relationship between different parts of a person. We further introduce a simple multi-scale scheme to support re-id across person instances at different scales. PSTR jointly achieves the diverse objectives of object-level recognition (detection) and instance-level matching (re-id). To the best of our knowledge, we are the first to propose an end-to-end one-step transformer-based person search framework. Experiments are performed on two popular benchmarks: CUHK-SYSU and PRW. Our extensive ablations reveal the merits of the proposed contributions. Further, the proposed PSTR sets a new state-of-the-art on both benchmarks. On the challenging PRW benchmark, PSTR achieves a mean average precision (mAP) score of 56.5%. The source code is available at https://github.com/JialeCao001/PSTR. © 2022 IEEE.

DOI
10.1109/CVPR52688.2022.00924
Publication Date
9-27-2022
Keywords
  • categorization,
  • Recognition,
  • detection,
  • retrieval
Comments

Open access available at CVPR 2022

Archived, thanks to CVPR Open Access

Uploaded: Feb 09, 2023

Citation Information
J. Cao et al., "PSTR: End-to-End One-Step Person Search With Transformers," 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022, pp. 9448-9457, doi: 10.1109/CVPR52688.2022.00924.