跳到主要导航 跳到搜索 跳到主要内容

Cross-domain knowledge distillation for retrieval-based question answering systems

  • Cen Chen
  • , Chengyu Wang
  • , Minghui Qiu
  • , Dehong Gao
  • , Linbo Jin
  • , Wang Li
  • Ant Group
  • Alibaba Group Holding Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Question Answering (QA) systems have been extensively studied in both academia and the research community due to their wide real-world applications. When building such industrial-scale QA applications, we are facing two prominent challenges, i.e., i) lacking a sufficient amount of training data to learn an accurate model and ii) requiring high inference speed for online model serving. There are generally two ways to mitigate the above-mentioned problems. One is to adopt transfer learning to leverage information from other domains; the other is to distill the "dark knowledge"from a large teacher model to small student models. The former usually employs parameter sharing mechanisms for knowledge transfer, but does not utilize the "dark knowledge"of pre-trained large models. The latter usually does not consider the cross-domain information from other domains. We argue that these two types of methods can be complementary to each other. Hence in this work, we provide a new perspective on the potential of the teacher-student paradigm facilitating cross-domain transfer learning, where the teacher and student tasks belong to heterogeneous domains, with the goal to improve the student model's performance in the target domain. Our framework considers the "dark knowledge"learned from large teacher models and also leverages the adaptive hints to alleviate the domain differences between teacher and student models. Extensive experiments have been conducted on two text matching tasks for retrieval-based QA systems. Results show the proposed method has better performance than the competing methods including the existing state-of-the-art transfer learning methods. We have also deployed our method in an online production system and observed significant improvements compared to the existing approaches in terms of both accuracy and cross-domain robustness.

源语言英语
主期刊名The Web Conference 2021 - Proceedings of the World Wide Web Conference, WWW 2021
出版商Association for Computing Machinery, Inc
2613-2623
页数11
ISBN(电子版)9781450383127
DOI
出版状态已出版 - 3 6月 2021
已对外发布
活动30th World Wide Web Conference, WWW 2021 - Ljubljana, 斯洛文尼亚
期限: 19 4月 202123 4月 2021

出版系列

姓名The Web Conference 2021 - Proceedings of the World Wide Web Conference, WWW 2021

会议

会议30th World Wide Web Conference, WWW 2021
国家/地区斯洛文尼亚
Ljubljana
时期19/04/2123/04/21

指纹

探究 'Cross-domain knowledge distillation for retrieval-based question answering systems' 的科研主题。它们共同构成独一无二的指纹。

引用此