跳到主要导航 跳到搜索 跳到主要内容

Speech enhancement via combination of Wiener filter and blind source separation

  • Hongmei Hu*
  • , Jalil Taghia
  • , Jinqiu Sang
  • , Jalal Taghia
  • , Nasser Mohammadiha
  • , Masoumeh Azarpour
  • , Rajyalakshmi Dokku
  • , Shouyan Wang
  • , Mark E. Lutman
  • , Stefan Bleeck
  • *此作品的通讯作者
  • University of Southampton
  • Jiangsu University
  • KTH Royal Institute of Technology
  • Ruhr University Bochum

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Automatic speech recognition (ASR) often fails in acoustically noisy environments. Aimed to improve speech recognition scores of an ASR in a real-life like acoustical environment, a speech pre-processing system is proposed in this paper, which consists of several stages: First, a convolutive blind source separation (BSS) is applied to the spectrogram of the signals that are pre-processed by binaural Wiener filtering (BWF). Secondly, the target speech is detected by an ASR system recognition rate based on a Hidden Markov Model (HMM). To evaluate the performance of the proposed algorithm, the signal-to-interference ratio (SIR), the improvement signal-to-noise ratio (ISNR) and the speech recognition rates of the output signals were calculated using the signal corpus of the CHiME database. The results show an improvement in SIR and ISNR, but no obvious improvement of speech recognition scores. Improvements for future research are suggested.

源语言英语
主期刊名Practical Applications of Intelligent Systems
主期刊副标题Proceedings of the Sixth International Conference on Intelligent Systems and Knowledge Engineering, Shanghai, China, Dec 2011 (ISKE2011)
编辑Yinglin Wang, Tianrui Li
485-494
页数10
DOI
出版状态已出版 - 2011
已对外发布

出版系列

姓名Advances in Intelligent and Soft Computing
124
ISSN(印刷版)1867-5662

指纹

探究 'Speech enhancement via combination of Wiener filter and blind source separation' 的科研主题。它们共同构成独一无二的指纹。

引用此