跳到主要导航 跳到搜索 跳到主要内容

OnMKD: An Online Mutual Knowledge Distillation Framework for Passage Retrieval

  • Jiali Deng
  • , Dongyang Li
  • , Taolin Zhang
  • , Xiaofeng He*
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Dense passage retriever recalls a set of relevant passages from a large corpus according to a natural language question. The dual-encoder architecture is prevalent in dense passage retrievers, which is based on large-scale pre-trained language models (PLMs). However, existing PLMs usually have thick structures and bulky parameters, resulting in large memory and time consumption. To overcome the limitation of PLMs, in this paper we apply online distillation to passage retrieval and propose an Online Mutual Knowledge Distillation framework (OnMKD). Specifically, we obtain a lightweight retriever by simultaneously updating two peer networks with the same dual-encoder structure and different initial parameters, named Online Mutual Knowledge Refinement. To further interact with the latent knowledge of intermediate layers, we utilize a novel cross-wise contrastive loss to alternate the representation of questions and passages. Experimental results indicate that our framework outperforms other small baselines with the same number of layers on multiple QA benchmarks. Compared to the heavy PLMs, OnMKD significantly accelerates the inference process and reduces storage requirements with only a slight sacrifice in performance.

源语言英语
主期刊名Natural Language Processing and Chinese Computing - 12th National CCF Conference, NLPCC 2023, Proceedings
编辑Fei Liu, Nan Duan, Qingting Xu, Yu Hong
出版商Springer Science and Business Media Deutschland GmbH
719-731
页数13
ISBN(印刷版)9783031446955
DOI
出版状态已出版 - 2023
活动12th National CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2023 - Foshan, 中国
期限: 12 10月 202315 10月 2023

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14303 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议12th National CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2023
国家/地区中国
Foshan
时期12/10/2315/10/23

指纹

探究 'OnMKD: An Online Mutual Knowledge Distillation Framework for Passage Retrieval' 的科研主题。它们共同构成独一无二的指纹。

引用此