FastNER: Speeding up Inferences for Named Entity Recognition Tasks

Yuming Zhang, Xiangxiang Gao, Wei Zhu, Xiaoling Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

BERT and its variants are the most performing models for named entity recognition (NER), a fundamental information extraction task. We must apply inference speedup methods for BERT-based NER models to be deployed in the industrial setting. Early exiting allows the model to use only the shallow layers to process easy samples, thus reducing the average latency. In this work, we introduce FastNER, a novel framework for early exiting with a BERT biaffine NER model, which supports both flat NER tasks and nested NER tasks. First, we introduce a convolutional bypass module to provide suitable features for the current layer’s biaffine prediction head. This way, an intermediate layer can focus more on delivering high-quality semantic representations for the next layer. Second, we introduce a series of early exiting mechanisms for BERT biaffine model, which is the first in the literature. We conduct extensive experiments on 6 benchmark NER datasets, 3 of which are nested NER tasks. The experiments show that: (a) Our proposed convolutional bypass method can significantly improve the overall performances of the multi-exit BERT biaffine NER model. (b) our proposed early exiting mechanisms can effectively speed up the inference of BERT biaffine model. Comprehensive ablation studies are conducted and demonstrate the validity of our design for our FastNER framework.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 19th International Conference, ADMA 2023, Proceedings
EditorsXiaochun Yang, Bin Wang, Heru Suhartanto, Guoren Wang, Jing Jiang, Bing Li, Huaijie Zhu, Ningning Cui
PublisherSpringer Science and Business Media Deutschland GmbH
Pages185-199
Number of pages15
ISBN (Print)9783031466601
DOIs
StatePublished - 2023
Event19th International Conference on Advanced Data Mining and Applications, ADMA 2023 - Shenyang, China
Duration: 21 Aug 202323 Aug 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14176 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Advanced Data Mining and Applications, ADMA 2023
Country/TerritoryChina
CityShenyang
Period21/08/2323/08/23

Keywords

  • Early Exiting
  • Inference speed-up
  • Pre-trained language models

Fingerprint

Dive into the research topics of 'FastNER: Speeding up Inferences for Named Entity Recognition Tasks'. Together they form a unique fingerprint.

Cite this