跳到主要导航 跳到搜索 跳到主要内容

SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition

  • Jianing Wang
  • , Chengyu Wang
  • , Chuanqi Tan
  • , Minghui Qiu
  • , Songfang Huang
  • , Jun Huang
  • , Ming Gao*
  • *此作品的通讯作者
  • East China Normal University
  • Alibaba Group Holding Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Few-shot Named Entity Recognition (NER) aims to identify named entities with very little annotated data. Previous methods solve this problem based on token-wise classification, which ignores the information of entity boundaries, and inevitably the performance is affected by the massive non-entity tokens. To this end, we propose a seminal span-based prototypical network (SpanProto) that tackles few-shot NER via a two-stage approach, including span extraction and mention classification. In the span extraction stage, we transform the sequential tags into a global boundary matrix, enabling the model to focus on the explicit boundary information. For mention classification, we leverage prototypical learning to capture the semantic representations for each labeled span and make the model better adapt to novel-class entities. To further improve the model performance, we split out the false positives generated by the span extractor but not labeled in the current episode set, and then present a margin-based loss to separate them from each prototype region. Experiments over multiple benchmarks demonstrate that our model outperforms strong baselines by a large margin.

源语言英语
主期刊名Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022
编辑Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
出版商Association for Computational Linguistics (ACL)
3466-3476
页数11
ISBN(电子版)9781959429401
DOI
出版状态已出版 - 2022
活动2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 - Hybrid, Abu Dhabi, 阿拉伯联合酋长国
期限: 7 12月 202211 12月 2022

出版系列

姓名Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022

会议

会议2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022
国家/地区阿拉伯联合酋长国
Hybrid, Abu Dhabi
时期7/12/2211/12/22

指纹

探究 'SpanProto: A Two-stage Span-based Prototypical Network for Few-shot Named Entity Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此