跳到主要导航 跳到搜索 跳到主要内容

Metacognitive symbolic distillation framework for multi-choice machine reading comprehension

  • Jiacheng Yao
  • , Xin Xu
  • , Guoxiu He*
  • *此作品的通讯作者
  • East China Normal University

科研成果: 期刊稿件文章同行评审

摘要

Symbolic knowledge distillation can transfer the reasoning abilities of large language models (LLMs) effectively to smaller models. However, in the context of multi-choice machine reading comprehension (MMRC), traditional distillation methods focus primarily on learning from the rationales of the correct options generated by the large teacher model, overlooking the educational significance of reasoning behind incorrect options. In human education, metacognition emphasizes the importance of actively identifying errors, enhancing the overall understanding. Inspired by this approach, we propose an innovative framework that incorporates metacognition into symbolic distillation. Initially, we prompt the teacher LLM to generate rationales for all options in the MMRC dataset. Subsequently, the small student model is fine-tuned using these rationales, including those for incorrect options. Our experiments on two MMRC datasets demonstrate that this framework improves the performance of the small student model significantly compared to standard fine-tuned and distilled models. We further find that when the student model is sufficiently large, upgrading the teacher model could yield further improvements. However, the effectiveness of our framework is constrained by the performance of the teacher model on more complex MMRC tasks.

源语言英语
文章编号113130
期刊Knowledge-Based Systems
312
DOI
出版状态已出版 - 15 3月 2025

指纹

探究 'Metacognitive symbolic distillation framework for multi-choice machine reading comprehension' 的科研主题。它们共同构成独一无二的指纹。

引用此