Financial report chunking for effective retrieval augmented generation

Yepes, Antonio Jimeno, You, Yao, Milczek, Jan, Laverde, Sebastian, Li, Renyu , month = mar, year = · 2024 · arXiv 2402.05131

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

read on arXiv browse 10 citing papers

representative citing papers

IntrAgent: An LLM Agent for Content-Grounded Information Retrieval through Literature Review

cs.IR · 2026-04-23 · unverdicted · novelty 7.0

IntrAgent uses a two-stage pipeline of section ranking and iterative reading to perform content-grounded literature information retrieval, achieving 13.2% higher accuracy than RAG and agent baselines on the new IntraBench benchmark.

SHM-Agents: A Generalist-Specialist Integrated Agent System for Structural Health Monitoring

cs.MA · 2026-05-13 · unverdicted · novelty 5.0

SHM-Agents is an LLM-plus-specialist-agent framework that claims to execute a wide range of SHM tasks end-to-end via natural language on data from a long-span cable-stayed bridge.

Adaptive Query Routing: A Tier-Based Framework for Hybrid Retrieval Across Financial, Legal, and Medical Documents

cs.IR · 2026-04-14 · conditional · novelty 5.0

Tree reasoning outperforms vector search on complex document queries but a hybrid approach balances results across tiers, with validation showing an 11.7-point gap on real finance documents.

Empirical Evaluation of PDF Parsing and Chunking for Financial Question Answering with RAG

cs.CL · 2026-04-13 · unverdicted · novelty 5.0

Systematic tests show that specific PDF parsers combined with overlapping chunking strategies better preserve structure and improve RAG answer correctness on financial QA benchmarks including the new TableQuest dataset.

RefineRAG: Word-Level Poisoning Attacks via Retriever-Guided Text Refinement

cs.CR · 2026-04-08 · unverdicted · novelty 5.0

RefineRAG achieves 90% attack success on NQ by generating toxic seeds then optimizing them via retriever-in-the-loop word refinement, outperforming prior methods on effectiveness and naturalness.

MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering

cs.CL · 2025-06-25 · conditional · novelty 5.0

MultiFinRAG is a multimodal RAG framework that improves accuracy on financial QA tasks involving text, tables, and images by 19 percentage points over ChatGPT-4o while running on commodity hardware.

MimirRAG: A Multi-Agent RAG Framework for Financial Data Retrieval with Metadata Integration

cs.LG · 2026-05-24 · unverdicted · novelty 4.0

MimirRAG, a multi-agent RAG framework with metadata integration and table-aware chunking, reaches 89.3% accuracy on FinanceBench and outperforms prior baselines for financial document retrieval.

Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints

cs.IR · 2026-04-20 · unverdicted · novelty 4.0 · 2 refs

Structured memory improves precision on deterministic financial calculations while retrieval-augmented generation outperforms in conversational settings, supporting a hybrid deployment framework for resource-constrained SMEs.

Evaluating Chunking Strategies for Retrieval-Augmented Generation on Academic Texts

cs.IR · 2026-07-02 · unverdicted · novelty 3.0

Cluster-based semantic chunking does not outperform fixed-size or recursive chunking for RAG on academic theses, and RAGAs faithfulness shows limited reliability in this setup.

MetaGraph: A Large-Scale Meta-Analysis of GenAI in Financial NLP (2022-2025)

cs.CL · 2025-09-11

citing papers explorer

Showing 7 of 7 citing papers after filters.

IntrAgent: An LLM Agent for Content-Grounded Information Retrieval through Literature Review cs.IR · 2026-04-23 · unverdicted · none · ref 10
IntrAgent uses a two-stage pipeline of section ranking and iterative reading to perform content-grounded literature information retrieval, achieving 13.2% higher accuracy than RAG and agent baselines on the new IntraBench benchmark.
SHM-Agents: A Generalist-Specialist Integrated Agent System for Structural Health Monitoring cs.MA · 2026-05-13 · unverdicted · none · ref 53
SHM-Agents is an LLM-plus-specialist-agent framework that claims to execute a wide range of SHM tasks end-to-end via natural language on data from a long-span cable-stayed bridge.
Empirical Evaluation of PDF Parsing and Chunking for Financial Question Answering with RAG cs.CL · 2026-04-13 · unverdicted · none · ref 49
Systematic tests show that specific PDF parsers combined with overlapping chunking strategies better preserve structure and improve RAG answer correctness on financial QA benchmarks including the new TableQuest dataset.
RefineRAG: Word-Level Poisoning Attacks via Retriever-Guided Text Refinement cs.CR · 2026-04-08 · unverdicted · none · ref 27
RefineRAG achieves 90% attack success on NQ by generating toxic seeds then optimizing them via retriever-in-the-loop word refinement, outperforming prior methods on effectiveness and naturalness.
MimirRAG: A Multi-Agent RAG Framework for Financial Data Retrieval with Metadata Integration cs.LG · 2026-05-24 · unverdicted · none · ref 8
MimirRAG, a multi-agent RAG framework with metadata integration and table-aware chunking, reaches 89.3% accuracy on FinanceBench and outperforms prior baselines for financial document retrieval.
Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints cs.IR · 2026-04-20 · unverdicted · none · ref 18 · 2 links
Structured memory improves precision on deterministic financial calculations while retrieval-augmented generation outperforms in conversational settings, supporting a hybrid deployment framework for resource-constrained SMEs.
Evaluating Chunking Strategies for Retrieval-Augmented Generation on Academic Texts cs.IR · 2026-07-02 · unverdicted · none · ref 14
Cluster-based semantic chunking does not outperform fixed-size or recursive chunking for RAG on academic theses, and RAGAs faithfulness shows limited reliability in this setup.

Financial report chunking for effective retrieval augmented generation

fields

years

verdicts

representative citing papers

citing papers explorer