Moira: Language-driven Hierarchical Reinforcement Learning for Pair Trading

· 2026 · cs.AI · arXiv 2605.01954

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Many sequential decision-making problems exhibit hierarchical structure, where high-level semantic choices constrain downstream actions and feedback is delayed and ambiguous. Learning in such settings is challenging due to credit assignment: performance degradation may arise from flawed abstractions, suboptimal execution, or their interaction. We study this challenge through pair trading, a domain that naturally combines long-horizon semantic reasoning for asset pair selection with short-horizon execution under partial observability. We formulate pair trading as a hierarchical reinforcement learning problem and propose a language-driven optimization framework in which both high-level and low-level policies are parameterized by large language models (LLMs) and optimized exclusively through prompt updates. Our approach leverages pretrained LLMs as hierarchical policies and uses trajectory- and episode-level textual feedback to adapt abstractions and execution without gradient-based fine-tuning. By explicitly separating abstraction selection from execution, the framework reduces non-stationarity across hierarchical levels and enables targeted adaptation under delayed feedback. Experiments on real-world market data show consistent improvements over traditional and LLM-based baselines, demonstrating the effectiveness of language-driven hierarchical reinforcement learning.

representative citing papers

Herculean: An Agentic Benchmark for Financial Intelligence

cs.AI · 2026-05-14 · unverdicted · novelty 7.0

Herculean benchmark shows frontier agents handle trading and market insights better than hedging and auditing workflows that demand state consistency and structured verification.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Herculean: An Agentic Benchmark for Financial Intelligence cs.AI · 2026-05-14 · unverdicted · none · ref 10 · internal anchor
Herculean benchmark shows frontier agents handle trading and market insights better than hedging and auditing workflows that demand state consistency and structured verification.

Moira: Language-driven Hierarchical Reinforcement Learning for Pair Trading

fields

years

verdicts

representative citing papers

citing papers explorer