跳到主要导航 跳到搜索 跳到主要内容

Generation with Dynamic Vocabulary

  • East China Normal University
  • Fudan University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

We introduce a new dynamic vocabulary for language models.It can involve arbitrary text spans during generation.These text spans act as basic generation bricks, akin to tokens in the traditional static vocabularies.We show that, the ability to generate multi-tokens atomically improve both generation quality and efficiency (compared to the standard language model, the MAUVE metric is increased by 25%, the latency is decreased by 20%).The dynamic vocabulary can be deployed in a plug-and-play way, thus is attractive for various downstream applications.For example, we demonstrate that dynamic vocabulary can be applied to different domains in a training-free manner.It also helps to generate reliable citations in question answering tasks (substantially enhancing citation results without compromising answer accuracy).

源语言英语
主期刊名EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
编辑Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
出版商Association for Computational Linguistics (ACL)
18931-18948
页数18
ISBN(电子版)9798891761643
DOI
出版状态已出版 - 2024
活动2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024 - Hybrid, Miami, 美国
期限: 12 11月 202416 11月 2024

出版系列

姓名EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

会议

会议2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024
国家/地区美国
Hybrid, Miami
时期12/11/2416/11/24

指纹

探究 'Generation with Dynamic Vocabulary' 的科研主题。它们共同构成独一无二的指纹。

引用此