跳到主要导航 跳到搜索 跳到主要内容

Automatic Speech Recognition Performance Improvement for Mandarin Based on Optimizing Gain Control Strategy

  • Desheng Wang
  • , Yangjie Wei*
  • , Ke Zhang
  • , Dong Ji
  • , Yi Wang
  • *此作品的通讯作者
  • Key Laboratory of Medical Image Computing (Northeastern University)

科研成果: 期刊稿件文章同行评审

摘要

Automatic speech recognition (ASR) is an essential technique of human–computer interactions; gain control is a commonly used operation in ASR. However, inappropriate gain control strategies can lead to an increase in the word error rate (WER) of ASR. As there is a current lack of sufficient theoretical analyses and proof of the relationship between gain control and WER, various unconstrained gain control strategies have been adopted on realistic ASR systems, and the optimal gain control with respect to the lowest WER, is rarely achieved. A gain control strategy named maximized original signal transmission (MOST) is proposed in this study to minimize the adverse impact of gain control on ASR systems. First, by modeling the gain control strategy, the quantitative relationship between the gain control strategy and the ASR performance was established using the noise figure index. Second, through an analysis of the quantitative relationship, an optimal MOST gain control strategy with minimal performance degradation was theoretically deduced. Finally, comprehensive comparative experiments on a Mandarin dataset show that the proposed MOST gain control strategy can significantly reduce the WER of the experimental ASR system, with a 10% mean absolute WER reduction at −9 dB gain.

源语言英语
文章编号3027
期刊Sensors
22
8
DOI
出版状态已出版 - 1 4月 2022
已对外发布

指纹

探究 'Automatic Speech Recognition Performance Improvement for Mandarin Based on Optimizing Gain Control Strategy' 的科研主题。它们共同构成独一无二的指纹。

引用此