跳到主要导航 跳到搜索 跳到主要内容

A Crowdsourcing Based Human-in-the-Loop Framework for Denoising UUs in Relation Extraction Tasks

  • Mengting Li
  • , Jian Jin*
  • , Wen Wu
  • , Yan Yang
  • , Liang He
  • , Jing Yang
  • *此作品的通讯作者
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In relation extraction tasks, distant supervision methods expand dataset by aligning entity pairs in different knowledge bases and completing the relations between two entities. However, these methods ignore the fact that sentences labels generated by distant supervision methods with high confidence are often incorrect in the real world called Unknown Unknowns (UUs). To deal with this challenge, we propose a crowdsourcing based human-in-the-loop denoising framework which iteratively discovers UUs and corrects them by crowdsourcing to better extract relations. During each epoch of iterations, we choose one sentence bag and repeat two steps: Firstly, attention based Long Short-Term Memory network is applied as a selector to discover potential UUs. Secondly, these UUs are annotated by crowdsourcing with two answer collecting strategies and fed back into selector as positive samples. Until the accuracy of selector reaches a threshold, all annotated samples are added into relation classifier as cleaned train set and framework moves on to next epoch with new sentence bags. The experiments on the New York Times dataset and analysis of potential UUs demonstrate that our framework denoise the dataset and outperforms all the baselines on distant supervision relation extraction tasks.

源语言英语
主期刊名2019 International Joint Conference on Neural Networks, IJCNN 2019
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781728119854
DOI
出版状态已出版 - 7月 2019
活动2019 International Joint Conference on Neural Networks, IJCNN 2019 - Budapest, 匈牙利
期限: 14 7月 201919 7月 2019

出版系列

姓名Proceedings of the International Joint Conference on Neural Networks
2019-July

会议

会议2019 International Joint Conference on Neural Networks, IJCNN 2019
国家/地区匈牙利
Budapest
时期14/07/1919/07/19

指纹

探究 'A Crowdsourcing Based Human-in-the-Loop Framework for Denoising UUs in Relation Extraction Tasks' 的科研主题。它们共同构成独一无二的指纹。

引用此