跳到主要导航 跳到搜索 跳到主要内容

Chinese organization name recognition based on multiple features

  • Yajuan Ling
  • , Jing Yang
  • , Liang He*
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recognition of Chinese organization names is the key of the recognition of Chinese named entities. However, the lack of a single unified naming system to capture all types of organizations and the uncertainty in word segmentation, make the recognition of Chinese organization names especially difficult. In this paper, we focus on the recognition of Chinese organization names and propose an approach that takes advantage of various types of features of Chinese organization names to address it. First of all, we pre-process inputs to make the recognition more convenient. Secondly, we use the features of the left and right boundary to determine the candidate Chinese organization names automatically. Thirdly, we evaluate and refine the initial recognition results with the features of behaviors and debugging structure patterns to improve the performance of the recognition. From the experimental results on People's Daily testing data set, the approach proposed in this paper outperforms the method based on role tagging more than 7%. And through designing a series of other experiments, we have proved that the proposed approach can perfectly complete the task of recognizing Chinese organization names and is particularly effective in nested cases.

源语言英语
主期刊名Intelligence and Security Informatics - Pacific Asia Workshop, PAISI 2012, Proceedings
136-144
页数9
DOI
出版状态已出版 - 2012
活动Pacific Asia Workshop on Intelligence and Security Informatics, PAISI 2012 - Kuala Lumpur, 马来西亚
期限: 29 5月 201229 5月 2012

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
7299 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议Pacific Asia Workshop on Intelligence and Security Informatics, PAISI 2012
国家/地区马来西亚
Kuala Lumpur
时期29/05/1229/05/12

指纹

探究 'Chinese organization name recognition based on multiple features' 的科研主题。它们共同构成独一无二的指纹。

引用此