摘要
Fine-grained text processing is a significant domain in Natural Language Processing (NLP), including tasks such as long-document question answering, aspect-based sentiment analysis, and document summarization. Although Large Language Models (LLMs) perform excellent on many NLP tasks, they often exhibit hallucination, such as detail loss or inaccuracies in tasks that require handling fine-grained content. This shortcoming arises because LLMs’ final layers tend to lose attention to details compared to the middle layers. Existing optimization methods for LLMs lack a focus on attention mechanisms for fine-grained information. To address this issue, we propose a novel Cross-Layer Fine-Grained Attention Correction method (CiGA). CiGA includes two correction terms that integrate detail-oriented attention from middle layers into the final layers. Experimental results demonstrate that CiGA significantly improves LLMs’ performance on fine-grained text processing tasks.
| 源语言 | 英语 |
|---|---|
| 期刊 | Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing |
| DOI | |
| 出版状态 | 已出版 - 2025 |
| 活动 | 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, 印度 期限: 6 4月 2025 → 11 4月 2025 |
指纹
探究 'CiGA: A Cross-Layer Fine-Grained Attention Correction Method for Large Language Model' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver