跳到主要导航 跳到搜索 跳到主要内容

PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation

  • Qiqi Gu
  • , Qianyu Zhou
  • , Minghao Xu
  • , Zhengyang Feng
  • , Guangliang Cheng
  • , Xuequan Lu
  • , Jianping Shi
  • , Lizhuang Ma*
  • *此作品的通讯作者
  • Shanghai Jiao Tong University
  • SenseTime Group Research
  • Shanghai AI Laboratory
  • Deakin University
  • Qing Yuan Research Institute
  • MoE Key Lab of Artificial Intelligence

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Cross-domain object detection and semantic segmentation have witnessed impressive progress recently. Existing approaches mainly consider the domain shift resulting from external environments including the changes of background, illumination or weather, while distinct camera intrinsic parameters appear commonly in different domains and their influence for domain adaptation has been very rarely explored. In this paper, we observe that the Field of View (FoV) gap induces noticeable instance appearance differences between the source and target domains. We further discover that the FoV gap between two domains impairs domain adaptation performance under both the FoV-increasing (source FoV < target FoV) and FoV-decreasing cases. Motivated by the observations, we propose the Position-Invariant Transform (PIT) to better align images in different domains. We also introduce a reverse PIT for mapping the transformed/aligned images back to the original image space, and design a loss re-weighting strategy to accelerate the training process. Our method can be easily plugged into existing cross-domain detection/segmentation frameworks, while bringing about negligible computational overhead. Extensive experiments demonstrate that our method can soundly boost the performance on both cross-domain object detection and segmentation for state-of-the-art techniques. Our code is available at https://github.com/sheepooo/PIT-Position-Invariant-Transform.

源语言英语
主期刊名Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021
出版商Institute of Electrical and Electronics Engineers Inc.
8741-8750
页数10
ISBN(电子版)9781665428125
DOI
出版状态已出版 - 2021
活动18th IEEE/CVF International Conference on Computer Vision, ICCV 2021 - Virtual, Online, 加拿大
期限: 11 10月 202117 10月 2021

出版系列

姓名Proceedings of the IEEE International Conference on Computer Vision
ISSN(印刷版)1550-5499

会议

会议18th IEEE/CVF International Conference on Computer Vision, ICCV 2021
国家/地区加拿大
Virtual, Online
时期11/10/2117/10/21

指纹

探究 'PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation' 的科研主题。它们共同构成独一无二的指纹。

引用此