跳到主要导航 跳到搜索 跳到主要内容

A hybrid model with pre-trained entity-aware transformer for relation extraction

  • Jinxin Yao
  • , Min Zhang*
  • , Biyang Wang
  • , Xianda Xu
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Distantly supervised relation extraction is an efficient method to extract novel relational facts from unstructed text. Most previous neural methods adopt Convolutional Neural Network (CNN) or Recurrent Neural Network (RNN) to encode sentences. However, CNN is difficult to learn long-range dependencies and the parallelization of training RNN is precluded by its sequential nature. In this paper, we propose a novel hybrid model that combines Piece-wise Convolutional Neural Network (PCNN) and Entity-Aware Transformer to extract local features and learn the dependencies between distant positions jointly. The entity-aware Transformer is able to take semantic and syntax information under consideration and acquire entity-specific representations. The inner-sentence attention mechanism is then used over Transformer to alleviate the noise caused by irrelevant words. We concatenate outputs of PCNN and Transformer with word embeddings of entity mentions and then send them to the classifier, which can boost the performance of our model further. A transfer learning based strategy is applied, where the entity-aware Transformer is initialized with a priori knowledge learned from the related task of entity typing to improve the robustness of our model. The experimental results on a large-scale benchmark dataset show that our hybrid model with the pre-training strategy gets AUC score of 0.432 and outperforms the state-of-the-art baselines.

源语言英语
主期刊名Knowledge Science, Engineering and Management - 13th International Conference, KSEM 2020, Proceedings, Part 1
编辑Gang Li, Heng Tao Shen, Ye Yuan, Xiaoyang Wang, Huawen Liu, Xiang Zhao
出版商Springer
148-160
页数13
ISBN(印刷版)9783030551292
DOI
出版状态已出版 - 2020
活动13th International Conference on Knowledge Science, Engineering and Management, KSEM 2020 - Hangzhou, 中国
期限: 28 8月 202030 8月 2020

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12274 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议13th International Conference on Knowledge Science, Engineering and Management, KSEM 2020
国家/地区中国
Hangzhou
时期28/08/2030/08/20

指纹

探究 'A hybrid model with pre-trained entity-aware transformer for relation extraction' 的科研主题。它们共同构成独一无二的指纹。

引用此