跳到主要导航 跳到搜索 跳到主要内容

CiGA: A Cross-Layer Fine-Grained Attention Correction Method for Large Language Model

  • East China Normal University

科研成果: 期刊稿件会议文章同行评审

摘要

Fine-grained text processing is a significant domain in Natural Language Processing (NLP), including tasks such as long-document question answering, aspect-based sentiment analysis, and document summarization. Although Large Language Models (LLMs) perform excellent on many NLP tasks, they often exhibit hallucination, such as detail loss or inaccuracies in tasks that require handling fine-grained content. This shortcoming arises because LLMs’ final layers tend to lose attention to details compared to the middle layers. Existing optimization methods for LLMs lack a focus on attention mechanisms for fine-grained information. To address this issue, we propose a novel Cross-Layer Fine-Grained Attention Correction method (CiGA). CiGA includes two correction terms that integrate detail-oriented attention from middle layers into the final layers. Experimental results demonstrate that CiGA significantly improves LLMs’ performance on fine-grained text processing tasks.

指纹

探究 'CiGA: A Cross-Layer Fine-Grained Attention Correction Method for Large Language Model' 的科研主题。它们共同构成独一无二的指纹。

引用此