Chinese HowNet-based multi-factor word similarity algorithm integrated of result modification

Benbin Wu, Jing Yang, Liang He

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In this paper, we firstly describe a novel approach to calculate the Chinese sememe similarity based on the HowNet hierarchical sememe tree. When we calculate the sememe similarity, we not only take Semantic Distance, Node Depth and Semantic Coincidence Degree into consideration, but also propose two impact factors named Node Environment Dense (NED) and Node Layer Ratio (NLR) to optimize the calculation process. Secondly, quite a few words described by identical concept definition in HowNet should have a certain discrimination according to human perception, so we propose a hybrid modification algorithm integrated of TongYiCi CiLin (hereinafter, CiLin) to deal with this case. Experiment results of the HowNet-based multi-factor similarity hybrid algorithm shows that this approach improves the similarity of independent sememe words and the words having identical concept descriptions in HowNet, while no large bias influence on the similarity of other words.

Original languageEnglish
Title of host publicationNeural Information Processing - 19th International Conference, ICONIP 2012, Proceedings
Pages256-266
Number of pages11
EditionPART 5
DOIs
StatePublished - 2012
Event19th International Conference on Neural Information Processing, ICONIP 2012 - Doha, Qatar
Duration: 12 Nov 201215 Nov 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 5
Volume7667 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Neural Information Processing, ICONIP 2012
Country/TerritoryQatar
CityDoha
Period12/11/1215/11/12

Keywords

  • Density
  • HowNet
  • TongYiCi CiLin
  • Word Discrimination

Fingerprint

Dive into the research topics of 'Chinese HowNet-based multi-factor word similarity algorithm integrated of result modification'. Together they form a unique fingerprint.

Cite this