跳到主要导航 跳到搜索 跳到主要内容

PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization

  • Jiayi Wu
  • , Hengyi Cai
  • , Lingyong Yan
  • , Hao Sun
  • , Xiang Li*
  • , Shuaiqiang Wang
  • , Dawei Yin
  • , Ming Gao
  • *此作品的通讯作者
  • East China Normal University
  • Chinese Academy of Sciences
  • Baidu Inc
  • Peking University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The emergence of Retrieval-augmented generation (RAG) has alleviated the issues of outdated and hallucinatory content in the generation of large language models (LLMs), yet it still reveals numerous limitations. When a general-purpose LLM serves as the RAG generator, it often suffers from inadequate response informativeness, response robustness, and citation quality. Past approaches to tackle these limitations, either by incorporating additional steps beyond generating responses or optimizing the generator through supervised fine-tuning (SFT), still failed to align with the RAG requirement thoroughly. Consequently, optimizing the RAG generator from multiple preference perspectives while maintaining its end-to-end LLM form remains a challenge. To bridge this gap, we propose Multiple Perspective Preference Alignment for Retrieval-Augmented Generation (PA-RAG), a method for optimizing the RAG generator to align with RAG requirements comprehensively. Specifically, we construct high-quality instruction fine-tuning data and multi-perspective preference data by sampling varied quality responses from the generator across different prompt documents quality scenarios. Subsequently, we optimize the generator using SFT and Direct Preference Optimization (DPO). Extensive experiments conducted on four question-answer datasets across three LLMs demonstrate that PA-RAG can significantly enhance the performance of RAG generators. Our code and datasets are available at https://github.com/wujwyi/PA-RAG.

源语言英语
主期刊名Long Papers
编辑Luis Chiruzzo, Alan Ritter, Lu Wang
出版商Association for Computational Linguistics (ACL)
9091-9112
页数22
ISBN(电子版)9798891761896
DOI
出版状态已出版 - 2025
活动2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2025 - Hybrid, Albuquerque, 美国
期限: 29 4月 20254 5月 2025

出版系列

姓名Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies: Long Papers, NAACL-HLT 2025
1

会议

会议2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2025
国家/地区美国
Hybrid, Albuquerque
时期29/04/254/05/25

指纹

探究 'PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization' 的科研主题。它们共同构成独一无二的指纹。

引用此