FinRetrieval: A benchmark for financial data retrieval by AI agents

Eric Y Kim, Jie Huang · 2026 · arXiv 2603.04403

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents

cs.AI · 2026-06-02 · unverdicted · novelty 7.0

BigFinanceBench is a workflow-grounded benchmark of 928 financial research tasks with point-weighted rubrics, where the best of ten tested agents scores 58.8% on derivation quality.

IPO Finance Agent: Benchmark of LLM Financial Analysts Beyond Finance Agent v2, with Automated Rubric Generation, on the SpaceX (SPCX) IPO

cs.AI · 2026-06-22 · unverdicted · novelty 6.0

IPO Finance Agent benchmarks LLMs on SpaceX S-1 questions with contextual retrieval and auto-generated rubrics, reporting up to 79.8% accuracy and better cost-efficiency than prior Finance Agent v2 entries.

citing papers explorer

Showing 2 of 2 citing papers after filters.

BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents cs.AI · 2026-06-02 · unverdicted · none · ref 11
BigFinanceBench is a workflow-grounded benchmark of 928 financial research tasks with point-weighted rubrics, where the best of ten tested agents scores 58.8% on derivation quality.
IPO Finance Agent: Benchmark of LLM Financial Analysts Beyond Finance Agent v2, with Automated Rubric Generation, on the SpaceX (SPCX) IPO cs.AI · 2026-06-22 · unverdicted · none · ref 14
IPO Finance Agent benchmarks LLMs on SpaceX S-1 questions with contextual retrieval and auto-generated rubrics, reporting up to 79.8% accuracy and better cost-efficiency than prior Finance Agent v2 entries.

FinRetrieval: A benchmark for financial data retrieval by AI agents

fields

years

verdicts

representative citing papers

citing papers explorer