Financial report chunking for effective retrieval augmented generation

John Smith, Jane Doe, Emily Johnson · 2024 · arXiv 2402.05131

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

representative citing papers

IntrAgent: An LLM Agent for Content-Grounded Information Retrieval through Literature Review

cs.IR · 2026-04-23 · unverdicted · novelty 7.0

IntrAgent uses a two-stage pipeline of section ranking and iterative reading to perform content-grounded literature information retrieval, achieving 13.2% higher accuracy than RAG and agent baselines on the new IntraBench benchmark.

Adaptive Query Routing: A Tier-Based Framework for Hybrid Retrieval Across Financial, Legal, and Medical Documents

cs.IR · 2026-04-14 · conditional · novelty 5.0

Tree reasoning outperforms vector search on complex document queries but a hybrid approach balances results across tiers, with validation showing an 11.7-point gap on real finance documents.

Empirical Evaluation of PDF Parsing and Chunking for Financial Question Answering with RAG

cs.CL · 2026-04-13 · unverdicted · novelty 5.0

Systematic tests show that specific PDF parsers combined with overlapping chunking strategies better preserve structure and improve RAG answer correctness on financial QA benchmarks including the new TableQuest dataset.

RefineRAG: Word-Level Poisoning Attacks via Retriever-Guided Text Refinement

cs.CR · 2026-04-08 · unverdicted · novelty 5.0

RefineRAG achieves 90% attack success on NQ by generating toxic seeds then optimizing them via retriever-in-the-loop word refinement, outperforming prior methods on effectiveness and naturalness.

MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering

cs.CL · 2025-06-25 · conditional · novelty 5.0

MultiFinRAG is a multimodal RAG framework that improves accuracy on financial QA tasks involving text, tables, and images by 19 percentage points over ChatGPT-4o while running on commodity hardware.

Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints

cs.IR · 2026-04-20 · unverdicted · novelty 4.0 · 2 refs

Structured memory improves precision on deterministic financial calculations while retrieval-augmented generation outperforms in conversational settings, supporting a hybrid deployment framework for resource-constrained SMEs.

MetaGraph: A Large-Scale Meta-Analysis of GenAI in Financial NLP (2022-2025)

cs.CL · 2025-09-11

citing papers explorer

Showing 7 of 7 citing papers.

IntrAgent: An LLM Agent for Content-Grounded Information Retrieval through Literature Review cs.IR · 2026-04-23 · unverdicted · none · ref 10
IntrAgent uses a two-stage pipeline of section ranking and iterative reading to perform content-grounded literature information retrieval, achieving 13.2% higher accuracy than RAG and agent baselines on the new IntraBench benchmark.
Adaptive Query Routing: A Tier-Based Framework for Hybrid Retrieval Across Financial, Legal, and Medical Documents cs.IR · 2026-04-14 · conditional · none · ref 52
Tree reasoning outperforms vector search on complex document queries but a hybrid approach balances results across tiers, with validation showing an 11.7-point gap on real finance documents.
Empirical Evaluation of PDF Parsing and Chunking for Financial Question Answering with RAG cs.CL · 2026-04-13 · unverdicted · none · ref 49
Systematic tests show that specific PDF parsers combined with overlapping chunking strategies better preserve structure and improve RAG answer correctness on financial QA benchmarks including the new TableQuest dataset.
RefineRAG: Word-Level Poisoning Attacks via Retriever-Guided Text Refinement cs.CR · 2026-04-08 · unverdicted · none · ref 27
RefineRAG achieves 90% attack success on NQ by generating toxic seeds then optimizing them via retriever-in-the-loop word refinement, outperforming prior methods on effectiveness and naturalness.
MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering cs.CL · 2025-06-25 · conditional · none · ref 22
MultiFinRAG is a multimodal RAG framework that improves accuracy on financial QA tasks involving text, tables, and images by 19 percentage points over ChatGPT-4o while running on commodity hardware.
Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints cs.IR · 2026-04-20 · unverdicted · none · ref 18 · 2 links
Structured memory improves precision on deterministic financial calculations while retrieval-augmented generation outperforms in conversational settings, supporting a hybrid deployment framework for resource-constrained SMEs.
MetaGraph: A Large-Scale Meta-Analysis of GenAI in Financial NLP (2022-2025) cs.CL · 2025-09-11 · unreviewed · ref 59

Financial report chunking for effective retrieval augmented generation

fields

years

verdicts

representative citing papers

citing papers explorer