跳到主要导航 跳到搜索 跳到主要内容

Effective Domain Adaptation for Robust Dysarthric Speech Recognition

  • Shanhu Wang
  • , Jing Zhao*
  • , Shiliang Sun
  • *此作品的通讯作者
  • East China Normal University
  • Ministry of Education of the People's Republic of China

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

By transferring knowledge from abundant normal speech to limited dysarthric speech, dysarthric speech recognition (DSR) has witnessed significant progress. However, existing adaptation techniques mainly focus on the full leverage of normal speech, discarding the sparse nature of dysarthric speech, which poses a great challenge for DSR training in low-resource scenarios. In this paper, we present an effective domain adaptation framework to build robust DSR systems with scarce target data. Joint data preprocessing strategy is employed to alleviate the sparsity of dysarthric speech and close the gap between source and target domains. To enhance the adaptability of dysarthric speakers across different severity levels, the Domain-adapted Transformer model is devised to learn both domain-invariant and domain-specific features. All experimental results demonstrate that the proposed methods achieve impressive performance on both speaker-dependent and speaker-independent DSR tasks. Particularly, even with half of the target training data, our DSR systems still maintain high accuracy on speakers with severe dysarthria.

源语言英语
主期刊名Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings
编辑Biao Luo, Long Cheng, Zheng-Guang Wu, Hongyi Li, Chaojie Li
出版商Springer Science and Business Media Deutschland GmbH
62-73
页数12
ISBN(印刷版)9789819981403
DOI
出版状态已出版 - 2024
活动30th International Conference on Neural Information Processing, ICONIP 2023 - Changsha, 中国
期限: 20 11月 202323 11月 2023

出版系列

姓名Communications in Computer and Information Science
1964 CCIS
ISSN(印刷版)1865-0929
ISSN(电子版)1865-0937

会议

会议30th International Conference on Neural Information Processing, ICONIP 2023
国家/地区中国
Changsha
时期20/11/2323/11/23

指纹

探究 'Effective Domain Adaptation for Robust Dysarthric Speech Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此