Learning topic-oriented word embedding for query classification

Hebin Yang*, Qinmin Hu, Liang He

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

In this paper, we propose a topic-oriented word embedding approach to address the query classification problem. First, the topic information is encoded to generate query categories. Then, the user clickthrough information is also incorporated in the modified word embedding algorithms. After that, the short and ambiguous queries are enriched to be classified in a supervised learning way. The unique contributions are that we present four neural network strategies based on the proposed model. The experiments are designed on two open data sets, namely Baidu and Sogou, which are two famous commercial search companies. Our evaluation results show that the proposed approach is promising on both large data sets. Under the four proposed strategies, we achieve the high performance as 95. 73% in terms of Precision, 97. 79% in terms of the F1 measure.

Original languageEnglish
Title of host publicationAdvances in Knowledge Discovery and Data Mining - 19th Pacific-Asia Conference, PAKDD 2015, Proceedings
EditorsTu-Bao Ho, Hiroshi Motoda, Hiroshi Motoda, Ee-Peng Lim, Tru Cao, David Cheung, Zhi-Hua Zhou
PublisherSpringer Verlag
Pages188-198
Number of pages11
ISBN (Print)9783319180373
DOIs
StatePublished - 2015
Event19th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2015 - Ho Chi Minh City, Viet Nam
Duration: 19 May 201522 May 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9077
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2015
Country/TerritoryViet Nam
CityHo Chi Minh City
Period19/05/1522/05/15

Keywords

  • Query classification
  • Supervised learning
  • Word embedding
  • Word2vec

Fingerprint

Dive into the research topics of 'Learning topic-oriented word embedding for query classification'. Together they form a unique fingerprint.

Cite this