Layer-Wise Prompt-Guided Interaction with Prototypical Contrastive Learning for Face Anti-Spoofing Generalization

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recently, Vision-Language Models (VLMs) have demonstrated remarkable success in Face Anti-Spoofing (FAS). However, most of these VLM methods adopt a CLIP-like in-teraction strategy, where image and text features interact only once at the final layer of their respective encoders. This strategy tends to emphasize global features while largely overlooking local information, which is crucial for FAS tasks, thereby compromising subsequent classification accuracy. To address this issue, we propose layer-wise prompts that facilitate cross-modal interaction at three layers, fully leveraging the potential of textual guidance. Furthermore, to enhance the generalization capability of the FAS model, we introduce a prototype-based contrastive loss to encourage a clear separation between the two classes during training. Experimental results on several datasets demonstrate that our method outperforms current state-of-the-art approaches, confirming its effectiveness and superiority.

Original languageEnglish
Title of host publicationProceedings - 2025 18th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2025
EditorsQingli Li
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331577360
DOIs
StatePublished - 2025
Event2025 18th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2025 - Qingdao, China
Duration: 25 Oct 202527 Oct 2025

Publication series

NameProceedings - 2025 18th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2025

Conference

Conference2025 18th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2025
Country/TerritoryChina
CityQingdao
Period25/10/2527/10/25

Keywords

  • FAS
  • Generalization
  • Layer-wise Prompts
  • Prototype-Based Contrastive Loss
  • VLMs

Fingerprint

Dive into the research topics of 'Layer-Wise Prompt-Guided Interaction with Prototypical Contrastive Learning for Face Anti-Spoofing Generalization'. Together they form a unique fingerprint.

Cite this