Integrating word embeddings and traditional NLP features to measure textual entailment and semantic relatedness of sentence pairs

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

Recent years the distributed representations of words (i.e., word embeddings) have been shown to be able to significantly improve performance in many natural language processing tasks, such as pos-of-tag tagging, chunking, named entity recognition and sentiment polarity judgement, etc. However, previous tasks only involve a single sentence. In contrast, this paper evaluates the effectiveness of word embeddings in sentence pair classification or regression problems. Specifically, we propose novel simple yet effective features based on word embeddings and extract many traditional linguistic features. Then these features serve as input of a classification/regression algorithm in isolation and in combination. Evaluations are conducted on three sentence pair classification/regression tasks, i.e., textual entailment, cross-lingual textual entailment and semantic relatedness estimation. Experiments on benchmark datasets provided by Semantic Evaluation 2013 and 2014 showed that using word embeddings is able to significantly improve the performance and our results outperform the best achieved results so far.

Original languageEnglish
Title of host publication2015 International Joint Conference on Neural Networks, IJCNN 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781479919604, 9781479919604, 9781479919604, 9781479919604
DOIs
StatePublished - 28 Sep 2015
EventInternational Joint Conference on Neural Networks, IJCNN 2015 - Killarney, Ireland
Duration: 12 Jul 201517 Jul 2015

Publication series

NameProceedings of the International Joint Conference on Neural Networks
Volume2015-September

Conference

ConferenceInternational Joint Conference on Neural Networks, IJCNN 2015
Country/TerritoryIreland
CityKillarney
Period12/07/1517/07/15

Keywords

  • Atmospheric measurements
  • Classification algorithms
  • Particle measurements
  • Pragmatics
  • Support vector machines

Fingerprint

Dive into the research topics of 'Integrating word embeddings and traditional NLP features to measure textual entailment and semantic relatedness of sentence pairs'. Together they form a unique fingerprint.

Cite this