跳到主要导航 跳到搜索 跳到主要内容

Towards Robust Chinese Spelling Check Systems: Multi-round Error Correction with Ensemble Enhancement

  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Chinese Spelling Check requires a system to automatically correct spelling errors in a sentence. There are diverse methods proposed to solve this task. A few methods improve the robustness of the model through data augmentation, but they have some weaknesses. Errors inserted randomly might disturb the real distribution of data. Moreover, different models may produce different results when predicting the same error sentence. Based on these intuitions, we develop a multi-round error correction method with ensemble enhancement, which is robust in solving Chinese Spelling Check challenges. Specifically, multi-round error correction follows an iterative correction pipeline, where a single error is corrected at each round, and the subsequent correction is conducted based on the previous results. Furthermore, we proposed two strategies of ensemble enhancement. For each predicted correction, results of multiple models are mutually authenticated by weighted voting and dominate voting. Experiments have proved the effectiveness of our system. It achieves the best performance on NLPCC 2023 CSC shared tasks. More analyses verify that both multi-round error correction and ensemble enhancement contribute to its good results. Our code is publicly available on GitHub.

源语言英语
主期刊名Natural Language Processing and Chinese Computing - 12th National CCF Conference, NLPCC 2023, Proceedings
编辑Fei Liu, Nan Duan, Qingting Xu, Yu Hong
出版商Springer Science and Business Media Deutschland GmbH
325-336
页数12
ISBN(印刷版)9783031446986
DOI
出版状态已出版 - 2023
活动12th National CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2023 - Foshan, 中国
期限: 12 10月 202315 10月 2023

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14304 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议12th National CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2023
国家/地区中国
Foshan
时期12/10/2315/10/23

指纹

探究 'Towards Robust Chinese Spelling Check Systems: Multi-round Error Correction with Ensemble Enhancement' 的科研主题。它们共同构成独一无二的指纹。

引用此