Prompt Enhanced Generative MRC Framework for Pancreatic Cancer NER

  • Zhen Dong Tan
  • , Yan Yang*
  • , Bo Li
  • , Beilei Wang
  • , Gang Jin
  • , Chengcai Chen
  • , Liang He
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Medical Named Entity Recognition (NER) is a fundamental but challenging task due to the lack of specialized entity datasets like tumor entities, which are often overlapped and discontinuous. In this paper, we propose a novel Prompt Enhanced Generative Machine Reading Comprehension Framework (PGMRC) to improve the overlapped and discontinuous NER performance. Specifically, we formulate NER as a Machine Reading Comprehension (MRC) task and employ a pre-trained encoder-decoder module to generate entity span sequences according to their entity query. In this way, we adopt query to guide the model to focus on answer entities in context, which can naturally solve entity overlap and alleviate the exposure bias of the generative model. Then, we introduce continuous prompts to the self-attention mechanism in Transformer to reduce the dependence on manually constructed queries. In addition, we annotate 875 pathological documents of pancreatic cancer and construct a Chinese pathological NER dataset (PAN) containing overlapped and discontinuous entities. Finally, we conduct our experiments on three widely used benchmarks (GENIA, ACE04, ACE05) and our dataset PAN. Experiments have demonstrated its effectiveness and better performance than state-of-the-art methods.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
EditorsDonald Adjeroh, Qi Long, Xinghua Shi, Fei Guo, Xiaohua Hu, Srinivas Aluru, Giri Narasimhan, Jianxin Wang, Mingon Kang, Ananda M. Mondal, Jin Liu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages817-820
Number of pages4
ISBN (Electronic)9781665468190
DOIs
StatePublished - 2022
Event2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022 - Las Vegas, United States
Duration: 6 Dec 20228 Dec 2022

Publication series

NameProceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022

Conference

Conference2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
Country/TerritoryUnited States
CityLas Vegas
Period6/12/228/12/22

Keywords

  • Machine Reading Comprehension
  • NER
  • Pancreatic Cancer
  • Prompt
  • Sequence-to-Sequence

Fingerprint

Dive into the research topics of 'Prompt Enhanced Generative MRC Framework for Pancreatic Cancer NER'. Together they form a unique fingerprint.

Cite this