Chinese organization name recognition based on multiple features

  • Yajuan Ling
  • , Jing Yang
  • , Liang He*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Recognition of Chinese organization names is the key of the recognition of Chinese named entities. However, the lack of a single unified naming system to capture all types of organizations and the uncertainty in word segmentation, make the recognition of Chinese organization names especially difficult. In this paper, we focus on the recognition of Chinese organization names and propose an approach that takes advantage of various types of features of Chinese organization names to address it. First of all, we pre-process inputs to make the recognition more convenient. Secondly, we use the features of the left and right boundary to determine the candidate Chinese organization names automatically. Thirdly, we evaluate and refine the initial recognition results with the features of behaviors and debugging structure patterns to improve the performance of the recognition. From the experimental results on People's Daily testing data set, the approach proposed in this paper outperforms the method based on role tagging more than 7%. And through designing a series of other experiments, we have proved that the proposed approach can perfectly complete the task of recognizing Chinese organization names and is particularly effective in nested cases.

Original languageEnglish
Title of host publicationIntelligence and Security Informatics - Pacific Asia Workshop, PAISI 2012, Proceedings
Pages136-144
Number of pages9
DOIs
StatePublished - 2012
EventPacific Asia Workshop on Intelligence and Security Informatics, PAISI 2012 - Kuala Lumpur, Malaysia
Duration: 29 May 201229 May 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7299 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferencePacific Asia Workshop on Intelligence and Security Informatics, PAISI 2012
Country/TerritoryMalaysia
CityKuala Lumpur
Period29/05/1229/05/12

Keywords

  • Behavior feature
  • Chinese organization name recognition
  • Core feature word
  • Debugging structure patterns
  • Leftbounder rule

Fingerprint

Dive into the research topics of 'Chinese organization name recognition based on multiple features'. Together they form a unique fingerprint.

Cite this