摘要
Automatic speech recognition (ASR) systems have a wide range of applications in classroom analysis. However, due to the unique structure of classroom dialogue, existing ASR systems often struggle to accurately recognize and organize spoken utterances, creating significant challenges for downstream tasks in educational dialogue analysis. To address this issue, we propose EPIC, a post-processing framework for classroom ASR error correction. We begin by extracting error patterns to gain a deeper understanding of the distribution of ASR errors. Next, we utilize large language models (LLMs) to reconstruct contextual information based on these error patterns, offering a viable solution for error correction with limited labeled data. Finally, after fine-tuning an error correction model, we implement a candidate selection process to identify the most appropriate hypothesis for each context. Extensive experiments with our proposed method demonstrate substantial improvements in word error rate (WER) and overall robustness in ASR error correction, enabling more reliable analysis of educational dialogues and offering deeper insights for educational research.
| 源语言 | 英语 |
|---|---|
| 期刊 | Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing |
| DOI | |
| 出版状态 | 已出版 - 2025 |
| 活动 | 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, 印度 期限: 6 4月 2025 → 11 4月 2025 |
指纹
探究 'EPIC: Error Pattern Informed Correction for Classroom ASR with Limited Labeled Data' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver