跳到主要导航 跳到搜索 跳到主要内容

D2LLM: Decomposed and Distilled Large Language Models for Semantic Search

  • East China Normal University
  • Ant Group

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The key challenge in semantic search is to create models that are both accurate and efficient in pinpointing relevant sentences for queries. While BERT-style bi-encoders excel in efficiency with pre-computed embeddings, they often miss subtle nuances in search tasks. Conversely, GPT-style LLMs with cross-encoder designs capture these nuances but are computationally intensive, hindering real-time applications. In this paper, we present D2LLMs-Decomposed and Distilled LLMs for semantic search-that combines the best of both worlds. We decompose a cross-encoder into an efficient bi-encoder integrated with Pooling by Multihead Attention and an Interaction Emulation Module, achieving nuanced understanding and pre-computability. Knowledge from the LLM is distilled into this model using contrastive, rank, and feature imitation techniques. Our experiments show that D2LLM surpasses five leading baselines in terms of all metrics across three tasks, particularly improving NLI task performance by at least 6.45%. The source code is available at https://github.com/codefuse-ai/D2LLM.

源语言英语
主期刊名Long Papers
编辑Lun-Wei Ku, Andre F. T. Martins, Vivek Srikumar
出版商Association for Computational Linguistics (ACL)
14798-14814
页数17
ISBN(电子版)9798891760943
DOI
出版状态已出版 - 2024
活动62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Bangkok, 泰国
期限: 11 8月 202416 8月 2024

出版系列

姓名Proceedings of the Annual Meeting of the Association for Computational Linguistics
1
ISSN(印刷版)0736-587X

会议

会议62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
国家/地区泰国
Bangkok
时期11/08/2416/08/24

指纹

探究 'D2LLM: Decomposed and Distilled Large Language Models for Semantic Search' 的科研主题。它们共同构成独一无二的指纹。

引用此