跳到主要导航 跳到搜索 跳到主要内容

Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources

  • Taolin Zhang
  • , Chengyu Wang
  • , Minghui Qiu
  • , Bite Yang
  • , Zerui Cai
  • , Xiaofeng He*
  • , Jun Huang
  • *此作品的通讯作者
  • East China Normal University
  • Alibaba Group Holding Ltd.
  • DXY

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Machine Reading Comprehension (MRC) aims to extract answers to questions given a passage, which has been widely studied recently especially in open domains. However, few efforts have been made on closed-domain MRC, mainly due to the lack of large-scale training data. In this paper, we introduce a multi-target MRC task for the medical domain, whose goal is to predict answers to medical questions and the corresponding support sentences from medical information sources simultaneously, in order to ensure the high reliability of medical knowledge serving. A high-quality dataset (more than 18k samples) is manually constructed for the purpose, named Multi-task Chinese Medical MRC dataset (CMedMRC), with detailed analysis conducted. We further propose a Chinese medical BERT model for the task (CMedBERT), which fuses medical knowledge into pre-trained language models by the dynamic fusion mechanism of heterogeneous features and the multi-task learning strategy. Experiments show that CMedBERT consistently outperforms strong baselines by fusing context-aware and knowledge-aware token representations.

源语言英语
主期刊名Findings of the Association for Computational Linguistics
主期刊副标题ACL-IJCNLP 2021
编辑Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
出版商Association for Computational Linguistics (ACL)
2237-2249
页数13
ISBN(电子版)9781954085541
DOI
出版状态已出版 - 2021
活动Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 - Virtual, Online
期限: 1 8月 20216 8月 2021

出版系列

姓名Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

会议

会议Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Virtual, Online
时期1/08/216/08/21

指纹

探究 'Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources' 的科研主题。它们共同构成独一无二的指纹。

引用此