C onv F in QA : Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering

Zhiyu Chen, Shiyang Li, Charese Smiley, Zhiqiang Ma, Sameena Shah, William Yang Wang · 2022 · DOI 10.18653/v1/2022.emnlp-main.421

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open at publisher browse 5 citing papers

representative citing papers

Fin-Bias: Comprehensive Evaluation for LLM Decision-Making under human bias in Finance Domain

cs.CL · 2026-05-09 · unverdicted · novelty 7.0

LLMs copy biased analyst ratings in investment decisions but a new detection method encourages independent reasoning and can improve stock return predictions beyond human levels.

FrontierFinance: A Long-Horizon Computer-Use Benchmark of Real-World Financial Tasks

cs.CL · 2026-04-07 · unverdicted · novelty 7.0

FrontierFinance benchmark shows human financial experts outperform state-of-the-art LLMs by achieving higher scores and more client-ready outputs on realistic long-horizon tasks.

Query Symbolically or Retrieve Semantically? A Dataset and Method for Semi-Structured Question Answering

cs.AI · 2026-05-26 · unverdicted · novelty 6.0

DualGraph combines semantic textual KGs with symbolic KGs for semi-structured QA and introduces the SpecsQA benchmark, outperforming baselines on both open and specification questions.

OCC-RAG: Optimal Cognitive Core for Faithful Question Answering

cs.CL · 2026-05-30 · unverdicted · novelty 5.0

OCC-RAG develops task-specialized SLMs (0.6B and 1.7B) via a new synthetic data pipeline for multi-hop reasoning and context faithfulness, claiming to match or exceed 2-6x larger general models on HotpotQA, MuSiQue, TAT-QA, ConFiQA, and MuSiQue-Un.

MetaGraph: A Large-Scale Meta-Analysis of GenAI in Financial NLP (2022-2025)

cs.CL · 2025-09-11

citing papers explorer

Showing 4 of 4 citing papers after filters.

Fin-Bias: Comprehensive Evaluation for LLM Decision-Making under human bias in Finance Domain cs.CL · 2026-05-09 · unverdicted · none · ref 46
LLMs copy biased analyst ratings in investment decisions but a new detection method encourages independent reasoning and can improve stock return predictions beyond human levels.
FrontierFinance: A Long-Horizon Computer-Use Benchmark of Real-World Financial Tasks cs.CL · 2026-04-07 · unverdicted · none · ref 6
FrontierFinance benchmark shows human financial experts outperform state-of-the-art LLMs by achieving higher scores and more client-ready outputs on realistic long-horizon tasks.
Query Symbolically or Retrieve Semantically? A Dataset and Method for Semi-Structured Question Answering cs.AI · 2026-05-26 · unverdicted · none · ref 9
DualGraph combines semantic textual KGs with symbolic KGs for semi-structured QA and introduces the SpecsQA benchmark, outperforming baselines on both open and specification questions.
OCC-RAG: Optimal Cognitive Core for Faithful Question Answering cs.CL · 2026-05-30 · unverdicted · none · ref 27
OCC-RAG develops task-specialized SLMs (0.6B and 1.7B) via a new synthetic data pipeline for multi-hop reasoning and context faithfulness, claiming to match or exceed 2-6x larger general models on HotpotQA, MuSiQue, TAT-QA, ConFiQA, and MuSiQue-Un.

C onv F in QA : Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering

fields

years

verdicts

representative citing papers

citing papers explorer