跳到主要导航 跳到搜索 跳到主要内容

A Unified Framework for Knowledge-Intensive Numerical Reasoning over Financial Document

  • Long Yin
  • , Kai Yin*
  • , Hui Zhao*
  • *此作品的通讯作者
  • East China Normal University
  • The University of Auckland

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Numerical reasoning in financial document analysis constitutes a fundamental challenge for corporate financial report understanding, drawing increasing attention in both academia and industry. However, current approaches suffer from semantic misalignment in multi-hierarchical tables, reasoning disruptions from insufficient integration of financial metric formulas, and sensitivity of the model’s reasoning results to the order of evidence. To address these challenges, we propose a unified framework for knowledge-intensive numerical reasoning over financial documents. Within this framework, we introduce a Triplets-based Multi-Stage Tabular-Textual Hybrid Evidence Retrieval (THER) method to resolve semantic misalignment by converting multi-hierarchical tables into triplet representations. Furthermore, we propose the Fine Grained Knowledge Injected Chain-of-Thought (FGKI-CoT) method to enhance numerical reasoning by explicitly integrating financial conceptual formulas into the reasoning path. Building on FGKI-CoT, we introduce the Evidence Order Sampling based Self-Consistency (EOSC) method, which mitigates the model’s sensitivity to evidence order by altering the input evidence sequence. Experiments demonstrate that our framework enables a 1.5B-parameter language model to outperform GPT-3.5-turbo by 3.95% in numerical reasoning on the Multihiertt Dev dataset. Additionally, we conduct supplementary experiments to further explore the impact of table representations and reasoning step expressions on the numerical reasoning performance of language models.

源语言英语
主期刊名Document Analysis and Recognition – ICDAR 2025 - 19th International Conference, Proceedings
编辑Xu-Cheng Yin, Dimosthenis Karatzas, Daniel Lopresti
出版商Springer Science and Business Media Deutschland GmbH
38-59
页数22
ISBN(印刷版)9783032046260
DOI
出版状态已出版 - 2026
活动19th International Conference on Document Analysis and Recognition, ICDAR 2025 - Wuhan, 中国
期限: 16 9月 202521 9月 2025

出版系列

姓名Lecture Notes in Computer Science
16026 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议19th International Conference on Document Analysis and Recognition, ICDAR 2025
国家/地区中国
Wuhan
时期16/09/2521/09/25

指纹

探究 'A Unified Framework for Knowledge-Intensive Numerical Reasoning over Financial Document' 的科研主题。它们共同构成独一无二的指纹。

引用此