Dual- objective reinforcement learning with novel hamilton-jacobi-bellman formulations,

· 2025 · arXiv 2506.16016

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Bellman Value Decomposition for Task Logic in Safe Optimal Control

cs.RO · 2026-02-23 · unverdicted · novelty 7.0

Bellman values for temporal logic tasks decompose into a graph of reach-avoid, avoid, and reach-avoid-loop equations solved by embedding the graph in a two-layer neural net (VDPPO) for safe high-dimensional control.

Carbon-Aware Intrusion Detection: A Comparative Study of Supervised and Unsupervised DRL for Sustainable IoT Edge Gateways

cs.CR · 2025-11-23 · unverdicted · novelty 5.0

Introduces two carbon-aware DRL-based intrusion detection systems for IoT edge gateways, reporting 94% accuracy for a supervised LSTM-DRL model and 98% for a label-free Autoencoder-DRL hybrid.

citing papers explorer

Showing 2 of 2 citing papers.

Bellman Value Decomposition for Task Logic in Safe Optimal Control cs.RO · 2026-02-23 · unverdicted · none · ref 5
Bellman values for temporal logic tasks decompose into a graph of reach-avoid, avoid, and reach-avoid-loop equations solved by embedding the graph in a two-layer neural net (VDPPO) for safe high-dimensional control.
Carbon-Aware Intrusion Detection: A Comparative Study of Supervised and Unsupervised DRL for Sustainable IoT Edge Gateways cs.CR · 2025-11-23 · unverdicted · none · ref 40
Introduces two carbon-aware DRL-based intrusion detection systems for IoT edge gateways, reporting 94% accuracy for a supervised LSTM-DRL model and 98% for a label-free Autoencoder-DRL hybrid.

Dual- objective reinforcement learning with novel hamilton-jacobi-bellman formulations,

fields

years

verdicts

representative citing papers

citing papers explorer