跳到主要导航 跳到搜索 跳到主要内容

Sequence Labeling With Meta-Learning

  • Jing Li
  • , Peng Han*
  • , Xiangnan Ren
  • , Jilin Hu
  • , Lisi Chen
  • , Shuo Shang*
  • *此作品的通讯作者
  • University of Electronic Science and Technology of China
  • King Abdullah University of Science and Technology
  • Inception Institute of Artificial Intelligence
  • Aalborg University

科研成果: 期刊稿件文章同行评审

摘要

Recent neural architectures in sequence labeling have yielded state-of-the-art performance on single domain data such as newswires. However, they still suffer from (i) requiring massive amounts of training data to avoid overfitting; (ii) huge performance degradation when there is a domain shift in the data distribution between training and testing. To make a sequence labeling system more broadly useful, it is crucial to reduce its training data requirements and transfer knowledge to other domains. In this paper, we investigate the problem of domain adaptation for sequence labeling under homogeneous and heterogeneous settings. We propose MetaSeq, a novel meta-learning approach for domain adaptation in sequence labeling. Specifically, MetaSeq incorporates meta-learning and adversarial training strategies to encourage robust, general and transferable representations for sequence labeling. The key advantage of MetaSeq is that it is capable of adapting to new unseen domains with a small amount of annotated data from those domains. We extensively evaluate MetaSeq on named entity recognition, part-of-speech tagging and slot filling under homogeneous and heterogeneous settings. The experimental results show that MetaSeq achieves state-of-the-art performance against eight baselines. Impressively, MetaSeq surpasses the in-domain performance using only 16.17% and 7% of target domain data on average for homogeneous settings, and 34.76%, 24%, 22.5% of target domain data on average for heterogeneous settings.

源语言英语
页(从-至)3072-3086
页数15
期刊IEEE Transactions on Knowledge and Data Engineering
35
3
DOI
出版状态已出版 - 1 3月 2023
已对外发布

指纹

探究 'Sequence Labeling With Meta-Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此