Skip to main navigation Skip to search Skip to main content

A new method of computing chinese word similarity based on statistics

  • Bo Zhang
  • , Lei Hong
  • , Shubin Song
  • , Liang He*
  • , Guorong Li
  • *Corresponding author for this work
  • East China Normal University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Word semantic similarity is a very subjective concept and it is very difficult to get a similarity value close to human judgment. Chinese word semantic similarity research is relatively scarce due to its inherent complexity. This paper presents an approach to compute Chinese word semantic similarity based on statistical methods with word frequency contrast introduced (WFC-WS). Word semantic vectors are first obtained using co-occurrence and then extended with HIT-IR Tongyici Cilin (Extended). Word frequency contrast is introduced to filter the semantic vectors. Experiments show that the results of WFC-WS are closer to artificial standard compared with some similar methods.

Original languageEnglish
Title of host publicationProceedings of the 2012 5th International Conference on Business Intelligence and Financial Engineering, BIFE 2012
Pages43-46
Number of pages4
DOIs
StatePublished - 2012
Event2012 5th International Conference on Business Intelligence and Financial Engineering, BIFE 2012 - Lanzhou, Gansu, China
Duration: 18 Aug 201221 Aug 2012

Publication series

NameProceedings of the 2012 5th International Conference on Business Intelligence and Financial Engineering, BIFE 2012

Conference

Conference2012 5th International Conference on Business Intelligence and Financial Engineering, BIFE 2012
Country/TerritoryChina
CityLanzhou, Gansu
Period18/08/1221/08/12

Keywords

  • Co-occurrence
  • Semantic similarity
  • Tongyici cilin

Fingerprint

Dive into the research topics of 'A new method of computing chinese word similarity based on statistics'. Together they form a unique fingerprint.

Cite this