TY - GEN
T1 - CoT Reasoning-Based Content Adaptation and Image Generation for Chinese Poetry
AU - Chen, Yutong
AU - Chen, Jihao
AU - Jiang, Jiaqi
AU - Chen, Songtao
AU - He, Gaoqi
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - When users employ LLMs to find suitable ancient Chinese poems or generate prompts for text-to-image models, LLMs often misunderstand poetic elements or metaphors, which leads to false or inaccurate generated content. In this paper, we propose PCOT (Poetry Chain-of-Thought), a method that improves the accuracy of poem content adaptation and poetry image generation by integrating poetry database and verification mechanism to enhance Chain-of-Thought (CoT) reasoning of LLMs. Through these techniques, PCOT is able to better analyze key semantic elements, emotions and metaphors, which then enables more accurate retrieval of poems that align with user intent, and generation of high-quality poetry-to-image prompts. Evaluations demonstrate that our approach outperforms pure LLM in higher accuracy and semantic consistency on tasks of poetry content adaptation and poetry image prompt generation.
AB - When users employ LLMs to find suitable ancient Chinese poems or generate prompts for text-to-image models, LLMs often misunderstand poetic elements or metaphors, which leads to false or inaccurate generated content. In this paper, we propose PCOT (Poetry Chain-of-Thought), a method that improves the accuracy of poem content adaptation and poetry image generation by integrating poetry database and verification mechanism to enhance Chain-of-Thought (CoT) reasoning of LLMs. Through these techniques, PCOT is able to better analyze key semantic elements, emotions and metaphors, which then enables more accurate retrieval of poems that align with user intent, and generation of high-quality poetry-to-image prompts. Evaluations demonstrate that our approach outperforms pure LLM in higher accuracy and semantic consistency on tasks of poetry content adaptation and poetry image prompt generation.
KW - Chain of Thought Reasoning
KW - Chinese Classical Poetry
KW - Cultural Computing
KW - Text to Image Generation
UR - https://www.scopus.com/pages/publications/105012423401
U2 - 10.1007/978-981-95-0020-8_4
DO - 10.1007/978-981-95-0020-8_4
M3 - 会议稿件
AN - SCOPUS:105012423401
SN - 9789819500192
T3 - Lecture Notes in Computer Science
SP - 40
EP - 51
BT - Advanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings
A2 - Huang, De-Shuang
A2 - Zhang, Qinhu
A2 - Zhang, Chuanlei
A2 - Chen, Wei
PB - Springer Science and Business Media Deutschland GmbH
T2 - 21st International Conference on Intelligent Computing, ICIC 2025
Y2 - 26 July 2025 through 29 July 2025
ER -