VoiceBit: GPU-Accelerated Real-Time Human Voice Separation for Mobile Phones

  • Gang Chen*
  • , Zhaoheng Zhou
  • , Shengyu He
  • , Yi Zheng
  • , Wang Yi
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In mobile speech communication, the speech quality can be severely degraded when the mobile device users are in a noisy acoustic environment. To suppress environmental noises, deep learning based monaural speech separation methods have achieved remarkable progress on boosting the performance of the separation accuracy. However, the latency and computational cost of these methods remain far insufficient for mobile devices. Performance and power constraints make it still challenging to deploy such methods on mobile devices due to their high computational complexity. In this paper, we present VoiceBit, an efficient and light-weight human voice separation framework for real-time speech sep-aration on mobile devices. Specifically, we propose a light-weight speech separation network with reduced computation complexity and memory footprint for minimal compromise in accuracy, to segregate human voice and interfering noises directly from time-domain signals. Furthermore, we present a set of parallel optimizations to accelerate the operations in VoiceBit. Our experiment results show that VoiceBit achieves significant speedup and energy efficiency compared with state-of-the-art frameworks.

Original languageEnglish
Title of host publicationProceedings - 24th IEEE International Conference on High Performance Computing and Communications, 8th IEEE International Conference on Data Science and Systems, 20th IEEE International Conference on Smart City and 8th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1987-1994
Number of pages8
ISBN (Electronic)9798350319934
DOIs
StatePublished - 2022
Externally publishedYes
Event24th IEEE International Conference on High Performance Computing and Communications, 8th IEEE International Conference on Data Science and Systems, 20th IEEE International Conference on Smart City and 8th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022 - Chengdu, China
Duration: 18 Dec 202220 Dec 2022

Publication series

NameProceedings - 24th IEEE International Conference on High Performance Computing and Communications, 8th IEEE International Conference on Data Science and Systems, 20th IEEE International Conference on Smart City and 8th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022

Conference

Conference24th IEEE International Conference on High Performance Computing and Communications, 8th IEEE International Conference on Data Science and Systems, 20th IEEE International Conference on Smart City and 8th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022
Country/TerritoryChina
CityChengdu
Period18/12/2220/12/22

Fingerprint

Dive into the research topics of 'VoiceBit: GPU-Accelerated Real-Time Human Voice Separation for Mobile Phones'. Together they form a unique fingerprint.

Cite this