跳到主要导航 跳到搜索 跳到主要内容

Chimera Model of Candidate Soups for Non-Autoregressive Translation

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Non-Autoregressive Translation (NAT) models have drawn much attention because of their excellent decoding speed. However, NAT models suffer a significant drop in translation quality compared to Autoregressive Translation (AT) models. Candidate Soups (CandiSoups) is an effective method that can fully use the different candidate translations, significantly improving the translation quality for NAT models. However, it needs to use an additional AT model for re-scoring to achieve the best performance, which slows down its inference speed and takes up more computing resources. In this paper, we propose a Chimera Model framework of CandiSoups (CMCS), which can significantly accelerate inference speed while maintaining superior performance for CandiSoups. Specifically, by modifying the decoder, we fuse the AT and NAT models to construct a Chimera Model that can perform self-rescore. Moreover, we propose a novel adaptive training method to help train Chimera Models better. Experimental results on two major benchmarks demonstrate the effectiveness of our proposed approach, which can significantly improve translation quality while maintaining the excellent inference speed.

源语言英语
主期刊名Database Systems for Advanced Applications - 29th International Conference, DASFAA 2024, Proceedings
编辑Makoto Onizuka, Jae-Gil Lee, Yongxin Tong, Chuan Xiao, Yoshiharu Ishikawa, Kejing Lu, Sihem Amer-Yahia, H.V. Jagadish
出版商Springer Science and Business Media Deutschland GmbH
416-425
页数10
ISBN(印刷版)9789819757787
DOI
出版状态已出版 - 2025
活动29th International Conference on Database Systems for Advanced Applications, DASFAA 2024 - Gifu, 日本
期限: 2 7月 20245 7月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14851 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议29th International Conference on Database Systems for Advanced Applications, DASFAA 2024
国家/地区日本
Gifu
时期2/07/245/07/24

指纹

探究 'Chimera Model of Candidate Soups for Non-Autoregressive Translation' 的科研主题。它们共同构成独一无二的指纹。

引用此