Herculean benchmark shows frontier agents handle trading and market insights better than hedging and auditing workflows that demand state consistency and structured verification.
Association for Computing Machinery, New York, NY , USA, 2025
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Adding a Bayesian source memory for market-feedback adaptive retrieval to a frozen LLM improves macro-F1 from 0.438 to 0.471 and portfolio Sharpe from 0.52 to 0.84 in point-in-time financial event-impact prediction.
citing papers explorer
-
Herculean: An Agentic Benchmark for Financial Intelligence
Herculean benchmark shows frontier agents handle trading and market insights better than hedging and auditing workflows that demand state consistency and structured verification.
-
Point-in-Time Financial RAG with Frozen LLMs and Market-Feedback Adaptive Retrieval
Adding a Bayesian source memory for market-feedback adaptive retrieval to a frozen LLM improves macro-F1 from 0.438 to 0.471 and portfolio Sharpe from 0.52 to 0.84 in point-in-time financial event-impact prediction.