TY - GEN
T1 - Layer-Wise Prompt-Guided Interaction with Prototypical Contrastive Learning for Face Anti-Spoofing Generalization
AU - Fu, Jiahao
AU - Li, Qingli
AU - Wang, Yan
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Recently, Vision-Language Models (VLMs) have demonstrated remarkable success in Face Anti-Spoofing (FAS). However, most of these VLM methods adopt a CLIP-like in-teraction strategy, where image and text features interact only once at the final layer of their respective encoders. This strategy tends to emphasize global features while largely overlooking local information, which is crucial for FAS tasks, thereby compromising subsequent classification accuracy. To address this issue, we propose layer-wise prompts that facilitate cross-modal interaction at three layers, fully leveraging the potential of textual guidance. Furthermore, to enhance the generalization capability of the FAS model, we introduce a prototype-based contrastive loss to encourage a clear separation between the two classes during training. Experimental results on several datasets demonstrate that our method outperforms current state-of-the-art approaches, confirming its effectiveness and superiority.
AB - Recently, Vision-Language Models (VLMs) have demonstrated remarkable success in Face Anti-Spoofing (FAS). However, most of these VLM methods adopt a CLIP-like in-teraction strategy, where image and text features interact only once at the final layer of their respective encoders. This strategy tends to emphasize global features while largely overlooking local information, which is crucial for FAS tasks, thereby compromising subsequent classification accuracy. To address this issue, we propose layer-wise prompts that facilitate cross-modal interaction at three layers, fully leveraging the potential of textual guidance. Furthermore, to enhance the generalization capability of the FAS model, we introduce a prototype-based contrastive loss to encourage a clear separation between the two classes during training. Experimental results on several datasets demonstrate that our method outperforms current state-of-the-art approaches, confirming its effectiveness and superiority.
KW - FAS
KW - Generalization
KW - Layer-wise Prompts
KW - Prototype-Based Contrastive Loss
KW - VLMs
UR - https://www.scopus.com/pages/publications/105025443910
U2 - 10.1109/CISP-BMEI68103.2025.11259267
DO - 10.1109/CISP-BMEI68103.2025.11259267
M3 - 会议稿件
AN - SCOPUS:105025443910
T3 - Proceedings - 2025 18th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2025
BT - Proceedings - 2025 18th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2025
A2 - Li, Qingli
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 18th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2025
Y2 - 25 October 2025 through 27 October 2025
ER -