An adversarial joint learning model for low-resource language semantic textual similarity

  • Junfeng Tian
  • , Man Lan*
  • , Yuanbin Wu
  • , Jingang Wang
  • , Long Qiu
  • , Sheng Li
  • , Lang Jun
  • , Luo Si
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Semantic Textual Similarity (STS) of low-resource language is a challenging research problem with practical applications. Traditional solutions employ machine translation techniques to translate the low-resource languages to some resource-rich languages such as English. Hence, the final performance is highly dependent on the quality of machine translation. To decouple the machine translation dependency while still take advantage of the data in resource-rich languages, this work proposes to jointly learn the low-resource language STS task and that of a resource-rich one, which only relies on multilingual word embeddings. In particular, we project the low-resource language word embeddings into the semantic space of the resource-rich language via a translation matrix. To make the projected word embeddings resemble that of the resource-rich language, a language discriminator is introduced as an adversarial teacher. Thus the parameters of sentence similarity neural networks of two tasks can be effectively shared. The plausibility of our model is demonstrated by extensive experimental results.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 40th European Conference on IR Research, ECIR 2018, Proceedings
EditorsLeif Azzopardi, Gabriella Pasi, Allan Hanbury, Benjamin Piwowarski
PublisherSpringer Verlag
Pages89-101
Number of pages13
ISBN (Print)9783319769400
DOIs
StatePublished - 2018
Event40th European Conference on Information Retrieval, ECIR 2018 - Grenoble, France
Duration: 26 Mar 201829 Mar 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10772 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference40th European Conference on Information Retrieval, ECIR 2018
Country/TerritoryFrance
CityGrenoble
Period26/03/1829/03/18

Keywords

  • Adversarial learning
  • Low-resource language
  • Neural networks
  • Semantic textual similarity

Fingerprint

Dive into the research topics of 'An adversarial joint learning model for low-resource language semantic textual similarity'. Together they form a unique fingerprint.

Cite this