Finrl: A deep reinforcement learning library for automated stock trading in quantitative finance.CoRR, abs/2011.09607

· 2020 · arXiv 2011.09607

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

read on arXiv browse 10 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Robust Adversarial Policy Optimization Under Dynamics Uncertainty

cs.LG · 2026-04-13 · unverdicted · novelty 7.0

RAPO uses a dual robust RL formulation with trajectory-level adversarial networks and model-level Boltzmann reweighting over dynamics ensembles to improve policy resilience and out-of-distribution generalization while keeping the problem tractable.

Counterfactual Transport Flows for Offline Conservative Trajectory Refinement

cs.LG · 2026-06-08 · unverdicted · novelty 6.0

Counterfactual transport flows enable conservative, instance-specific trajectory refinement in offline RL by constructing local preference pairs in latent space from offline data and learning refinement directions controlled by a strength parameter.

SBCA: Cross-Modal BERT-driven Actor-Critic for Multi-Asset Portfolio Optimization

q-fin.CP · 2026-05-02 · unverdicted · novelty 6.0

SBCA is a reinforcement learning framework using BERT cross-modal fusion and Actor-Critic to integrate price data with sentiment text for multi-asset portfolio optimization with practical trading constraints.

Mitigating Bias in Low-SNR Financial Reinforcement Learning via Quantum Representations

cs.LG · 2026-06-09 · unverdicted · novelty 5.0

FPQC-SAC adds a bounded parameterized quantum circuit to SAC to constrain representations in low-SNR financial environments, reporting 66.89% higher cumulative returns than standard SAC on real portfolio tasks.

Semantic State Abstraction Interfaces for LLM-Augmented Portfolio Decisions: Multi-Axis News Decomposition and RL Diagnostics

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

SSAI maps news into four factors (sentiment, risk, confidence, volatility) for trading, but factor portfolios, ridge models, and RL agents show no reliable edge over baselines after coverage controls and costs.

From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments

cs.AI · 2026-03-25 · unverdicted · novelty 5.0

An empirical literature analysis reveals a bifurcation in RL environments into Semantic Prior (LLM-dominated) and Domain-Specific Generalization ecosystems with distinct cognitive fingerprints.

Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep Reinforcement Learning

cs.LG · 2026-06-03 · unverdicted · novelty 4.0

A hybrid DRL system for multi-pair crypto trading with deterministic risk shielding outperforms a heuristic baseline at 10% significance on Binance futures data.

EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation

cs.AI · 2026-04-13 · unverdicted · novelty 4.0

EvoNash-MARL achieves 19.6% annualized returns on equity allocation from 2014-2024 versus 11.7% for SPY, with evidence of robustness under constraints but no strong statistical superiority per WRC and SPA-lite tests.

AlphaQuanter: An End-to-End Tool-Augmented Agentic Reinforcement Learning Framework for Stock Trading

cs.CE · 2025-10-16 · unverdicted · novelty 4.0

AlphaQuanter introduces a single-agent tool-augmented RL framework for stock trading that learns dynamic policies over a transparent decision workflow and reports state-of-the-art financial metrics.

AI-Powered Sustainable Finance: An Integrative Taxonomy and Framework of AI Applications for Sustainable Investment Decision-Making

cs.CE · 2026-05-25 · unverdicted · novelty 3.0

A review paper that builds a taxonomy of AI methods (supervised, unsupervised, reinforcement learning, NLP, optimization) and a framework for their use in ESG score prediction, controversy detection, portfolio management, and sustainability report analysis.

citing papers explorer

Showing 9 of 9 citing papers after filters.

Robust Adversarial Policy Optimization Under Dynamics Uncertainty cs.LG · 2026-04-13 · unverdicted · none · ref 8
RAPO uses a dual robust RL formulation with trajectory-level adversarial networks and model-level Boltzmann reweighting over dynamics ensembles to improve policy resilience and out-of-distribution generalization while keeping the problem tractable.
Counterfactual Transport Flows for Offline Conservative Trajectory Refinement cs.LG · 2026-06-08 · unverdicted · none · ref 46
Counterfactual transport flows enable conservative, instance-specific trajectory refinement in offline RL by constructing local preference pairs in latent space from offline data and learning refinement directions controlled by a strength parameter.
SBCA: Cross-Modal BERT-driven Actor-Critic for Multi-Asset Portfolio Optimization q-fin.CP · 2026-05-02 · unverdicted · none · ref 11
SBCA is a reinforcement learning framework using BERT cross-modal fusion and Actor-Critic to integrate price data with sentiment text for multi-asset portfolio optimization with practical trading constraints.
Mitigating Bias in Low-SNR Financial Reinforcement Learning via Quantum Representations cs.LG · 2026-06-09 · unverdicted · none · ref 29
FPQC-SAC adds a bounded parameterized quantum circuit to SAC to constrain representations in low-SNR financial environments, reporting 66.89% higher cumulative returns than standard SAC on real portfolio tasks.
Semantic State Abstraction Interfaces for LLM-Augmented Portfolio Decisions: Multi-Axis News Decomposition and RL Diagnostics cs.LG · 2026-05-07 · unverdicted · none · ref 4
SSAI maps news into four factors (sentiment, risk, confidence, volatility) for trading, but factor portfolios, ridge models, and RL agents show no reliable edge over baselines after coverage controls and costs.
From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments cs.AI · 2026-03-25 · unverdicted · none · ref 130
An empirical literature analysis reveals a bifurcation in RL environments into Semantic Prior (LLM-dominated) and Domain-Specific Generalization ecosystems with distinct cognitive fingerprints.
Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep Reinforcement Learning cs.LG · 2026-06-03 · unverdicted · none · ref 29
A hybrid DRL system for multi-pair crypto trading with deterministic risk shielding outperforms a heuristic baseline at 10% significance on Binance futures data.
EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation cs.AI · 2026-04-13 · unverdicted · none · ref 8
EvoNash-MARL achieves 19.6% annualized returns on equity allocation from 2014-2024 versus 11.7% for SPY, with evidence of robustness under constraints but no strong statistical superiority per WRC and SPA-lite tests.
AI-Powered Sustainable Finance: An Integrative Taxonomy and Framework of AI Applications for Sustainable Investment Decision-Making cs.CE · 2026-05-25 · unverdicted · none · ref 54
A review paper that builds a taxonomy of AI methods (supervised, unsupervised, reinforcement learning, NLP, optimization) and a framework for their use in ESG score prediction, controversy detection, portfolio management, and sustainability report analysis.

Finrl: A deep reinforcement learning library for automated stock trading in quantitative finance.CoRR, abs/2011.09607

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer