ECNU: Using Traditional Similarity Measurements and Word Embedding for Semantic Textual Similarity Estimation

  • Jiang Zhao
  • , Man Lan*
  • , Jun Feng Tian
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Scopus citations

Abstract

This paper reports our submissions to semantic textual similarity task, i.e., task 2 in Semantic Evaluation 2015. We built our systems using various traditional features, such as string-based, corpus-based and syntactic similarity metrics, as well as novel similarity measures based on distributed word representations, which were trained using deep learning paradigms. Since the training and test datasets consist of instances collected from various domains, three different strategies of the usage of training datasets were explored: (1) use all available training datasets and build a unified supervised model for all test datasets; (2) select the most similar training dataset and separately construct a individual model for each test set; (3) adopt multi-task learning framework to make full use of available training sets. Results on the test datasets show that using all datasets as training set achieves the best averaged performance and our best system ranks 15 out of 73.

Original languageEnglish
Title of host publicationSemEval 2015 - 9th International Workshop on Semantic Evaluation, co-located with the 2015 Conference of the North American Chapter of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, NAACL-HLT 2015 - Proceedings
EditorsPreslav Nakov, Torsten Zesch, Daniel Cer, David Jurgens
PublisherAssociation for Computational Linguistics (ACL)
Pages117-122
Number of pages6
ISBN (Electronic)9781941643402
DOIs
StatePublished - 2015
Event9th International Workshop on Semantic Evaluation, SemEval 2015 co-located with the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2015 - Denver, United States
Duration: 4 Jun 20155 Jun 2015

Publication series

NameSemEval 2015 - 9th International Workshop on Semantic Evaluation, co-located with the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2015 - Proceedings

Conference

Conference9th International Workshop on Semantic Evaluation, SemEval 2015 co-located with the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2015
Country/TerritoryUnited States
CityDenver
Period4/06/155/06/15

Fingerprint

Dive into the research topics of 'ECNU: Using Traditional Similarity Measurements and Word Embedding for Semantic Textual Similarity Estimation'. Together they form a unique fingerprint.

Cite this