A 28-nW Noise-Robust Voice Activity Detector with Background Aware Feature Extraction

Jingsen Yang, Liangjian Lyu, Zirui Dong, Heyu Ren, C. J.Richard Shi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In light of the increasing number of Internet of Things (loT) devices, such as intelligent vehicles and smart assistants, it has become imperative to develop low-power Voice Activity Detection (VAD) devices. The always-on VAD devices detect the voice to wake up the target system, thus dominating the standby power consumption of the loT devices. The typical VAD consists of a feature extractor and a neural-network-based classifier. The algorithm using the frequency-domain features which can be obtained by modulation frequency [1], fast Fourier transform (FFT) [2], and analog filter banks, can achieve high detection accuracy. However, the feature extractor induces high power consumption due to the complex operations. Alternatively, the time-domain analog VADs [5]-[6] achieve low power consumption, due to the lack of a frequency extractor, but also suffers from reduced accuracy that the audio amplitude is interfered with noise easily, especially in noisy environments with a signal-to-noise ratio (SNR) is lower than OdB. In summary, achieving high accuracy and low power consumption simultaneously in VAD devices is a critical challenge.

Original languageEnglish
Title of host publication2023 IEEE Asian Solid-State Circuits Conference, A-SSCC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350330038
DOIs
StatePublished - 2023
Event19th IEEE Asian Solid-State Circuits Conference, A-SSCC 2023 - Haikou, China
Duration: 5 Nov 20238 Nov 2023

Publication series

Name2023 IEEE Asian Solid-State Circuits Conference, A-SSCC 2023

Conference

Conference19th IEEE Asian Solid-State Circuits Conference, A-SSCC 2023
Country/TerritoryChina
CityHaikou
Period5/11/238/11/23

Fingerprint

Dive into the research topics of 'A 28-nW Noise-Robust Voice Activity Detector with Background Aware Feature Extraction'. Together they form a unique fingerprint.

Cite this