跳到主要导航 跳到搜索 跳到主要内容

Cross-model Control: Improving Multiple Large Language Models in One-time Training

  • Jiayi Wu
  • , Hao Sun
  • , Hengyi Cai
  • , Lixin Su
  • , Shuaiqiang Wang
  • , Dawei Yin
  • , Xiang Li*
  • , Ming Gao
  • *此作品的通讯作者
  • East China Normal University
  • Peking University
  • Chinese Academy of Sciences
  • Baidu Inc
  • Guizhou Zhuwen ECNU Data Power Institute

科研成果: 期刊稿件会议文章同行评审

摘要

The number of large language models (LLMs) with varying parameter scales and vocabularies is increasing. While they deliver powerful performance, they also face a set of common optimization needs to meet specific requirements or standards, such as instruction following or avoiding the output of sensitive information from the real world. However, how to reuse the fine-tuning outcomes of one model to other models to reduce training costs remains a challenge. To bridge this gap, we introduce Cross-model Control (CMC), a method that improves multiple LLMs in one-time training with a portable tiny language model. Specifically, we have observed that the logit shift before and after fine-tuning is remarkably similar across different models. Based on this insight, we incorporate a tiny language model with a minimal number of parameters. By training alongside a frozen template LLM, the tiny model gains the capability to alter the logits output by the LLMs. To make this tiny language model applicable to models with different vocabularies, we propose a novel token mapping strategy named PM-MinED. We have conducted extensive experiments on instruction tuning and unlearning tasks, demonstrating the effectiveness of CMC. Our code is available at https://github.com/wujwyi/CMC.

源语言英语
期刊Advances in Neural Information Processing Systems
37
出版状态已出版 - 2024
活动38th Conference on Neural Information Processing Systems, NeurIPS 2024 - Vancouver, 加拿大
期限: 9 12月 202415 12月 2024

指纹

探究 'Cross-model Control: Improving Multiple Large Language Models in One-time Training' 的科研主题。它们共同构成独一无二的指纹。

引用此