TY - JOUR
T1 - Structure Destruction and Content Combination for Generalizable Anti-Spoofing
AU - Hu, Chengyang
AU - Cao, Junyi
AU - Zhang, Ke Yue
AU - Yao, Taiping
AU - Ding, Shouhong
AU - Ma, Lizhuang
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2022/10/1
Y1 - 2022/10/1
N2 - In securing the face verification systems, prior face anti-spoofing studies excavate hidden cues in original images to discriminate real persons and diverse attacks with the assistance of auxiliary supervision. However, limited by several inherent shortcomings in their training process: 1) Neglect of the multi-scale nature of spoof cues; 2) Complete integral structure in a single image; 3) Implicit subdomains in the whole dataset, these methods are weak to mine comprehensive spoof patterns and may stick on memorization of the entire training dataset, incurring overfitting. In this paper, we propose a new framework named Destruction and Combination Network (DCN) including Multi-scale Representation Extraction Module, Structure Destruction Module, and Content Combination Module to address these limitations respectively. The first mechanism exploits multi-scale representation to learn spoof cues comprehensively. The second one destroys images into patches to construct non-structural inputs, and the last scheme recombines patches from different subdomains or classes into a mixup construction. Based on the above splitting-and-splicing operation, we further introduce Local Relation Modeling Module to model the second-order relationship between patches. To show the generalizable capacity of the proposed framework, besides simple intra-dataset testing, we test our method in cross-domain, cross-content, and cross-attack scenarios. Extensive experiments on different scenarios demonstrate the reliability of our method against state-of-the-art competitors.
AB - In securing the face verification systems, prior face anti-spoofing studies excavate hidden cues in original images to discriminate real persons and diverse attacks with the assistance of auxiliary supervision. However, limited by several inherent shortcomings in their training process: 1) Neglect of the multi-scale nature of spoof cues; 2) Complete integral structure in a single image; 3) Implicit subdomains in the whole dataset, these methods are weak to mine comprehensive spoof patterns and may stick on memorization of the entire training dataset, incurring overfitting. In this paper, we propose a new framework named Destruction and Combination Network (DCN) including Multi-scale Representation Extraction Module, Structure Destruction Module, and Content Combination Module to address these limitations respectively. The first mechanism exploits multi-scale representation to learn spoof cues comprehensively. The second one destroys images into patches to construct non-structural inputs, and the last scheme recombines patches from different subdomains or classes into a mixup construction. Based on the above splitting-and-splicing operation, we further introduce Local Relation Modeling Module to model the second-order relationship between patches. To show the generalizable capacity of the proposed framework, besides simple intra-dataset testing, we test our method in cross-domain, cross-content, and cross-attack scenarios. Extensive experiments on different scenarios demonstrate the reliability of our method against state-of-the-art competitors.
KW - Face anti-spoofing
KW - generic object anti-spoofing
KW - structure destruction
UR - https://www.scopus.com/pages/publications/85141597462
U2 - 10.1109/TBIOM.2022.3220406
DO - 10.1109/TBIOM.2022.3220406
M3 - 文章
AN - SCOPUS:85141597462
SN - 2637-6407
VL - 4
SP - 508
EP - 521
JO - IEEE Transactions on Biometrics, Behavior, and Identity Science
JF - IEEE Transactions on Biometrics, Behavior, and Identity Science
IS - 4
ER -