A new method to compute the word relevance in news corpus

Liu Jinpan, He Liang, Lin Xin, Xu Mingmin, Lu Wei

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper we propose a new method to compute the relevance of term in news corpus. According to the characteristics of news corpus , we first propose that the news corpus should be divided into different channels, second we make use of the feature of news document , we divide the co-occurrence of terms into two cases, on the one hand the co-occurrence in the title of the news, On the other hand the co-occurrence in the news text, we use different methods to compute the co-occurrence. In the end, we introduce the web corpus Wikipedia to overcome some shortcomings of the news corpus

Original languageEnglish
Title of host publicationProceedings - 2010 2nd International Workshop on Intelligent Systems and Applications, ISA 2010
DOIs
StatePublished - 2010
Event2nd International Workshop on Intelligent Systems and Applications, ISA2010 - Wuhan, China
Duration: 22 May 201023 May 2010

Publication series

NameProceedings - 2010 2nd International Workshop on Intelligent Systems and Applications, ISA 2010

Conference

Conference2nd International Workshop on Intelligent Systems and Applications, ISA2010
Country/TerritoryChina
CityWuhan
Period22/05/1023/05/10

Keywords

  • Component
  • News corpus
  • Term co-occurrence
  • Wikipedia
  • Word relatedness

Cite this