跳到主要导航 跳到搜索 跳到主要内容

Evaluating the Performance of Complex Text Generated by Large Language Models

  • Fenglin Bi
  • , Yantong Wang
  • , Fanyu Han
  • , Zhi Li
  • , Tao Hu
  • , Yanbin Zhang
  • , Wei Wang*
  • *此作品的通讯作者
  • East China Normal University
  • ByteDance Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The rapid advancement of Large Language Models (LLMs) has significantly enhanced text generation quality in Natural Language Processing (NLP). However, practical applications impose complex requirements, particularly in fields such as generating analysis reports. This study systematically reviews evaluation methods for LLMs and proposes a general framework for complex text generation, encompassing Generation Content, Prompt Dimension, Retrieval-Augmented Generation (RAG), and LLM Fine-tuning. We first introduce the Complex Text Generation Task Evaluation Paradigm. Based on this paradigm, we identify 15 sub-indicators with corresponding evaluation methods to comprehensively assess and improve LLM performance. Our research fills gaps in existing evaluation systems and provides a scalable framework for future studies, enhancing the applicability and impact of LLMs across various domains.

源语言英语
主期刊名Intelligent Computers, Algorithms, and Applications - 4th BenchCouncil International Symposium, IC 2024, Revised Selected Papers
编辑Chunjie Luo, Weiping Li
出版商Springer Science and Business Media Deutschland GmbH
151-167
页数17
ISBN(印刷版)9789819663095
DOI
出版状态已出版 - 2025
活动4th BenchCouncil International Symposium on Intelligent Computers, Algorithms, and Applications, IC 2024 - Guangzhou, 中国
期限: 4 12月 20246 12月 2024

出版系列

姓名Communications in Computer and Information Science
2517 CCIS
ISSN(印刷版)1865-0929
ISSN(电子版)1865-0937

会议

会议4th BenchCouncil International Symposium on Intelligent Computers, Algorithms, and Applications, IC 2024
国家/地区中国
Guangzhou
时期4/12/246/12/24

指纹

探究 'Evaluating the Performance of Complex Text Generated by Large Language Models' 的科研主题。它们共同构成独一无二的指纹。

引用此