跳到主要导航 跳到搜索 跳到主要内容

Hypernetwork-Assisted Parameter-Efficient Fine-Tuning with Meta-Knowledge Distillation for Domain Knowledge Disentanglement

  • East China Normal University
  • Shanghai AI Laboratory
  • Shanghai Key Laboraiory of Multdimensional Infomation Procesing

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Domain adaptation from labeled source domains to the target domain is important in practical summarization scenarios. However, the key challenge is domain knowledge disentanglement. In this work, we explore how to disentangle domain-invariant knowledge from source domains while learning specific knowledge of the target domain. Specifically, we propose a hypernetwork-assisted encoder-decoder architecture with parameter-efficient fine-tuning. It leverages a hypernetwork instruction learning module to generate domain-specific parameters from the encoded inputs accompanied by task-related instruction. Further, to better disentangle and transfer knowledge from source domains to the target domain, we introduce a meta-knowledge distillation strategy to build a meta-teacher model that captures domain-invariant knowledge across multiple domains and use it to transfer knowledge to students. Experiments on three dialogue summarization datasets show the effectiveness of the proposed model. Human evaluations also show the superiority of our model with regard to the summary generation quality.

源语言英语
主期刊名Findings of the Association for Computational Linguistics
主期刊副标题NAACL 2024 - Findings
编辑Kevin Duh, Helena Gomez, Steven Bethard
出版商Association for Computational Linguistics (ACL)
1681-1695
页数15
ISBN(电子版)9798891761193
DOI
出版状态已出版 - 2024
活动2024 Findings of the Association for Computational Linguistics: NAACL 2024 - Hybrid, Mexico City, 墨西哥
期限: 16 6月 202421 6月 2024

出版系列

姓名Findings of the Association for Computational Linguistics: NAACL 2024 - Findings

会议

会议2024 Findings of the Association for Computational Linguistics: NAACL 2024
国家/地区墨西哥
Hybrid, Mexico City
时期16/06/2421/06/24

指纹

探究 'Hypernetwork-Assisted Parameter-Efficient Fine-Tuning with Meta-Knowledge Distillation for Domain Knowledge Disentanglement' 的科研主题。它们共同构成独一无二的指纹。

引用此