跳到主要导航 跳到搜索 跳到主要内容

大模型提示工程能够替代经典深度学习模型吗?——基于医学文本实体关系抽取任务的对比研究

  • Yufeng Duan*
  • , Jiahong Xie
  • , Ping Bai
  • , Tianyang Gong
  • *此作品的通讯作者
  • East China Normal University

科研成果: 期刊稿件文章同行评审

摘要

[Objective] To explore whether large language models (LLMs) and prompt engineering can replace classical deep learning models in the task of entity relation extraction from Chinese medical texts with high professionalism and domain characteristics. [Methods] This study uses three LLMs (GLM-4, ERNIE-4-Turbo, and DeepSeek-R1), and three classical deep learning models (CBLUE, CasRel, and GPLinker), to systematically compare the performance differences between LLMs based on prompt engineering and classical deep learning models. The comparison is conducted by varying the number of relation types to be extracted, the number of examples in the prompt for LLMs, and the training data size for classical deep learning models. We use BERT-Base and RoBERTa as encoders for classical deep learning models. [Results] Experimental results on the CMeIEV2 dataset show that: (I) RoBERTa-CBLUE and RoBERTa-GPLinker achieve the best extraction results. When extracting one relation type, the F1 score reaches 0.5826 and 0.5853, and when extracting ten relation types, the F1 score is 0.5112 and 0.4934; (II) LLMs are not good at extracting multiple relation types simultaneously. When extracting two relation types, the F1 score of GLM-4, ERNIE-4-Turbo, and DeepSeek-R1 decrease by 0.1182, 0.0885, and 0.1310, respectively, compared to extracting one relation type; (III) adding examples to the prompt can improve the extraction performance of LLMs, but adding more examples does not necessarily lead to better results. [Limitations] This study is based on a single dataset, and future work could extend the experiments to datasets from other domains. [Conclusions] The prompt engineering approach for LLMs is currently difficult to replace classical deep learning models and can only be considered as an alternative when labeled samples are limited.

投稿的翻译标题Can Large Language Model and Prompt Engineering Replace Classical Deep Learning Model? A Comparative Study Based on Medical Text Entity Relation Extraction Task
源语言繁体中文
页(从-至)61-75
页数15
期刊Data Analysis and Knowledge Discovery
10
1
DOI
出版状态已出版 - 25 1月 2026

关键词

  • Chinese Medical Text
  • Deep Learning Model
  • Entity Relation Extraction
  • Large Language Model
  • Prompt Engineering

指纹

探究 '大模型提示工程能够替代经典深度学习模型吗?——基于医学文本实体关系抽取任务的对比研究' 的科研主题。它们共同构成独一无二的指纹。

引用此