Speaking rhythmically improves speech recognition under "cocktail-party" conditions

Mengyuan Wang, Lingzhi Kong, Changxin Zhang, Xihong Wu, Liang Li*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

19 Scopus citations

Abstract

This study examines whether speech rhythm affects speech recognition under "cocktail-party" conditions. Against a two-talker masker, but not a speech-spectrum noise masker, recognition of the last (third) keyword in a normal rhythmic sentence was significantly better than that of the first keyword. However, this word-position-related speech-recognition improvement disappeared for rhythmically hybrid target sentences that were constructed by grouping parts from different sentences with different artificially modulated rhythms (rates) (fast, normal, or slow). Thus, the normal rhythm with a constant rate plays a role in improving speech recognition against informational speech masking, probably through a build-up of temporal prediction for target words.

Original languageEnglish
Pages (from-to)EL255-EL259
JournalJournal of the Acoustical Society of America
Volume143
Issue number4
DOIs
StatePublished - 1 Apr 2018

Fingerprint

Dive into the research topics of 'Speaking rhythmically improves speech recognition under "cocktail-party" conditions'. Together they form a unique fingerprint.

Cite this