跳到主要导航 跳到搜索 跳到主要内容

Towards Explainable Chinese Native Learner Essay Fluency Assessment: Dataset, Tasks, and Method

  • Xinshu Shen
  • , Hongyi Wu
  • , Yadong Zhang
  • , Man Lan*
  • , Xiaopeng Bai
  • , Shaoguang Mao
  • , Yuanbin Wu
  • , Xinlin Zhuang
  • , Li Cai
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Grammatical Error Correction (GEC) is a crucial technique in Automated Essay Scoring (AES) for evaluating the fluency of essays. However, in Chinese, existing GEC datasets often fail to consider the importance of specific grammatical error types within compositional scenarios, lack research on data collected from native Chinese speakers, and largely overlook cross-sentence grammatical errors. Furthermore, the measurement of the overall fluency of an essay is often overlooked. To address these issues, we present CEFA (Chinese Essay Fluency Assessment), an extensive corpus that is derived from essays authored by native Chinese-speaking primary and secondary students and encapsulates essay fluency scores along with both coarse and fine-grained grammatical error types and corrections. Experiments employing various benchmark models on CEFA substantiate the challenge of our dataset. Our findings further highlight the significance of fine-grained annotations in fluency assessment and the mutually beneficial relationship between error types and corrections.

源语言英语
主期刊名EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024
编辑Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
出版商Association for Computational Linguistics (ACL)
15515-15528
页数14
ISBN(电子版)9798891761681
DOI
出版状态已出版 - 2024
活动2024 Findings of the Association for Computational Linguistics, EMNLP 2024 - Hybrid, Miami, 美国
期限: 12 11月 202416 11月 2024

出版系列

姓名EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024

会议

会议2024 Findings of the Association for Computational Linguistics, EMNLP 2024
国家/地区美国
Hybrid, Miami
时期12/11/2416/11/24

指纹

探究 'Towards Explainable Chinese Native Learner Essay Fluency Assessment: Dataset, Tasks, and Method' 的科研主题。它们共同构成独一无二的指纹。

引用此