Channel Robust Strategies with Data Augmentation for Audio Anti-spoofing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Robustness against channel variability remains a formidable challenge in audio anti-spoofing for speaker verification systems. Channel effects can significantly degrade the performance of countermeasure systems, making them susceptible to spoofing attacks. To address this challenge, we present a comprehensive approach that integrates channel-robust preprocessing with advanced graph-based neural networks to enhance detection reliability. Raw audio waveforms are preprocessed with data augmentation to simulate diverse acoustic conditions, and encoded using a modified RawNet2-based encoder to extract critical features. An adaptive graph module processes these features into spectral and temporal graphs. Our proposed method dynamically combines these graphs using a Heterogeneous Stacking Graph Attention Layer (HS-GAL), facilitating deeper integration and processing of audio data. The Max Graph Operation (MGO) further refines feature selection, crucial for identifying spoofed content. Additionally, our model incorporates adversarial and multi-task learning strategies, significantly enhancing its generalization capabilities across various datasets. Experimental results demonstrate that our approach reduces the Equal Error Rate (EER) by over 20% and the minimum tandem detection cost function (min t-DCF) by 25% relative to the current state-of-the-art, substantiating its efficacy in improving the security of speaker verification systems against channel-induced vulnerabilities.

Original languageEnglish
Title of host publicationInformation Security - 27th International Conference, ISC 2024, Proceedings
EditorsNicky Mouha, Nick Nikiforakis
PublisherSpringer Science and Business Media Deutschland GmbH
Pages121-139
Number of pages19
ISBN (Print)9783031757631
DOIs
StatePublished - 2025
Event27th Information Security Conference, ISC 2024 - Arlington, United States
Duration: 23 Oct 202425 Oct 2024

Publication series

NameLecture Notes in Computer Science
Volume15258 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th Information Security Conference, ISC 2024
Country/TerritoryUnited States
CityArlington
Period23/10/2425/10/24

Keywords

  • Audio Anti-Spoofing
  • Channel Robustness
  • Graph Neural Networks
  • Speaker Verification

Fingerprint

Dive into the research topics of 'Channel Robust Strategies with Data Augmentation for Audio Anti-spoofing'. Together they form a unique fingerprint.

Cite this