Skip to main navigation Skip to search Skip to main content

Large Language Model Judged Self-Training for Named Entity Recognition

  • Shisong Chen
  • , Jiaan Wang
  • , Chengyi Yang
  • , Yanghua Xiao*
  • , Zhixu Li*
  • , Xin Lin
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Self-training for Named Entity Recognition (NER) aims at identifying named entities and their types in the text using self-training to fully make use of the limited labeled data and a large amount of unlabeled data. The major challenge in self-training is confirmation bias where incorrect pseudo-labels increase errors. Many efforts have been made to address this challenge, but few labeled data limit their performance. In this paper, we introduce Large Language Model (LLM) into self-training to select high-quality pseudo-labels leveraging its rich knowledge and few-shot learning capability. Specifically, we design a comprehensive prompt to improve the judgment performance of LLM, where the prompt incorporates task rules mined by LLM itself to fully leverage labeled data. In addition, to reduce the impact of LLM's hallucinations, we adopt a collaborative pseudo-label selection based on combined confidence and calibration-guided probability smoothing. Our empirical study conducted on several NER datasets shows that our method outperforms state-of-the-art approaches. The code is available at https://github.com/cheniison/llm-judged-ST.

Original languageEnglish
Title of host publicationWSDM 2026 - Proceedings of the 19th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery, Inc
Pages69-78
Number of pages10
ISBN (Electronic)9798400722929
DOIs
StatePublished - 21 Feb 2026
Event19th ACM International Conference on Web Search and Data Mining, WSDM 2026 - Boise, United States
Duration: 22 Feb 202626 Feb 2026

Publication series

NameWSDM 2026 - Proceedings of the 19th ACM International Conference on Web Search and Data Mining

Conference

Conference19th ACM International Conference on Web Search and Data Mining, WSDM 2026
Country/TerritoryUnited States
CityBoise
Period22/02/2626/02/26

Keywords

  • large language model
  • named entity recognition
  • self-training

Fingerprint

Dive into the research topics of 'Large Language Model Judged Self-Training for Named Entity Recognition'. Together they form a unique fingerprint.

Cite this