hub

arXiv preprint arXiv:1908.05659 , year=

Hamed Rahimian, Sanjay Mehrotra · 1908 · arXiv 1908.05659

18 Pith papers cite this work. Polarity classification is still indexing.

18 Pith papers citing it

read on arXiv browse 18 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

other 1

citation-polarity summary

unclear 1

representative citing papers

Taming the Curses of Multiagency in Robust Markov Games with Large State Space through Linear Function Approximation

cs.LG · 2026-05-04 · unverdicted · novelty 8.0

The work gives the first algorithms for general robust Markov games with linear function approximation whose sample complexity breaks the curse of multiagency for large state spaces in both generative and online settings.

Distributionally Robust Safety Under Arbitrary Uncertainties: A Safety Filtering Approach

cs.RO · 2026-05-13 · unverdicted · novelty 7.0 · 2 refs

A distributionally robust safety filter reduces certification for nonlinear systems under arbitrary uncertainties to a one-dimensional switching-time search with Wasserstein-inflated sampling guarantees.

Integrating Feature Correlation in Differential Privacy with Applications in DP-ERM

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

CorrDP relaxes standard differential privacy by incorporating feature correlations, enabling distance-dependent noise in DP-ERM for better privacy-utility tradeoffs.

Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form

cs.LG · 2024-08-29 · unverdicted · novelty 7.0

Presents the first algorithm to identify an ε-optimal policy in robust constrained MDPs via epigraph form and bisection search with Õ(ε^{-4}) robust policy evaluations.

Minimizing Upper Confidence Bounds: A Data-Driven Framework for Stochastic Programming

math.OC · 2024-03-13 · unverdicted · novelty 7.0

Proposes APUB optimization framework for stochastic programming, proves asymptotic correctness and consistency of the new bound, and develops bootstrap and L-shaped solvers for two-stage linear problems with empirical tests on a product mix example.

Safety-Constrained Reinforcement Learning with Post-Training Reachability Verification for Robot Navigation

cs.RO · 2026-05-13 · unverdicted · novelty 6.0

CVaR-constrained TD3 policies for robot navigation show larger safety margins and higher post-training reachability verification rates than average-cost baselines across simulated scenarios and real-robot tests.

Regret Equals Covariance: A Closed-Form Characterization for Stochastic Optimization

econ.EM · 2026-05-13 · unverdicted · novelty 6.0

Expected regret equals covariance between costs and optimal decisions for linear and quadratic stochastic programs, with explicit bounds on the residual.

Ready from Day 1: Population-Aware Coordination for Large-Scale Constrained Multi-Agent Systems

cs.MA · 2026-05-12 · unverdicted · novelty 6.0 · 2 refs

Learned primal and dual maps conditioned on population summaries enable reliable coordination across composition shifts in large multi-agent systems, cutting forecast error 16-19% and violations 20-51% in a supply-chain case study.

Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge

cs.AI · 2026-05-11 · unverdicted · novelty 6.0

RACER routes between reasoning and non-reasoning LLM judges via constrained distributionally robust optimization to achieve better accuracy-cost trade-offs under distribution shift.

Q-MMR: Off-Policy Evaluation via Recursive Reweighting and Moment Matching

cs.LG · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

Q-MMR introduces recursive reweighting and moment matching for off-policy evaluation, delivering dimension-free error bounds under Q^π realizability alone.

The Distributionally Robust Cyclic Inventory Routing Problem

math.OC · 2026-05-05 · unverdicted · novelty 6.0 · 2 refs

The authors create a distributionally robust formulation for the cyclic inventory routing problem that admits a deterministic reformulation via multi-point worst-case distributions and chance-constraint equivalents, solved by nested branch-and-price and tested on real automotive data.

Wasserstein Distributionally Robust Regret Optimization for Reinforcement Learning from Human Feedback

cs.LG · 2026-04-30 · unverdicted · novelty 6.0

DRRO for RLHF minimizes worst-case regret relative to the best policy under Wasserstein reward perturbations, yielding an exact inner solution and water-filling policy structure for the promptwise simplex model plus a practical policy-gradient algorithm.

Nonsmooth Nonconvex-Concave Minimax Optimization: Convergence Criteria and Algorithms

math.OC · 2026-04-23 · unverdicted · novelty 6.0

The authors introduce (ηx,ηy,δ,ε)-GSSP as a convergence criterion and develop projected gradient-free descent-ascent methods achieving non-asymptotic rates for nonsmooth nonconvex-concave minimax optimization without weak convexity assumptions.

Distributionally Robust Stochastic MPC under Disturbance-Affine Feedback Policies

eess.SY · 2026-04-14 · unverdicted · novelty 6.0

A new disturbance-affine distributionally robust MPC framework for uncertain linear systems that is less conservative than tube-based approaches while guaranteeing recursive feasibility and stability.

Interactive Trajectory Planning with Learning-based Distributionally Robust Model Predictive Control and Markov Systems

eess.SY · 2026-05-08 · unverdicted · novelty 5.0

PAC learning-based DR-MPC framework interpolates between robust MPC and stochastic MPC for interactive trajectory planning under agent decision uncertainty.

Assured autonomy: How operations research powers and orchestrates generative AI systems

cs.LG · 2025-12-30 · unverdicted · novelty 5.0

The authors develop a conceptual framework for assured autonomy in generative AI by using flow-based models for auditable generation and adversarial robustness for operational safety, repositioning operations research as a system architect.

A Data-embedded Solution Paradigm for Nonconvex Probable Event Constrained Optimization

math.OC · 2026-04-21 · unverdicted · novelty 4.0

PECO strengthens chance constraints by mandating feasibility for all high-probability events and is solved via a data-embedded deterministic program that works for nonlinear nonconvex instances when the size of the solution-determining data family can be estimated by machine learning.

Target-based Distributionally Robust Minimum Spanning Tree Problem

math.OC · 2023-11-17 · unverdicted · novelty 4.0

A target-based DRO model for MST under distributional uncertainty is solved exactly via Benders decomposition and a modified Prim algorithm.

citing papers explorer

Showing 18 of 18 citing papers.

Taming the Curses of Multiagency in Robust Markov Games with Large State Space through Linear Function Approximation cs.LG · 2026-05-04 · unverdicted · none · ref 9
The work gives the first algorithms for general robust Markov games with linear function approximation whose sample complexity breaks the curse of multiagency for large state spaces in both generative and online settings.
Distributionally Robust Safety Under Arbitrary Uncertainties: A Safety Filtering Approach cs.RO · 2026-05-13 · unverdicted · none · ref 18 · 2 links
A distributionally robust safety filter reduces certification for nonlinear systems under arbitrary uncertainties to a one-dimensional switching-time search with Wasserstein-inflated sampling guarantees.
Integrating Feature Correlation in Differential Privacy with Applications in DP-ERM cs.LG · 2026-05-05 · unverdicted · none · ref 38
CorrDP relaxes standard differential privacy by incorporating feature correlations, enabling distance-dependent noise in DP-ERM for better privacy-utility tradeoffs.
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form cs.LG · 2024-08-29 · unverdicted · none · ref 63
Presents the first algorithm to identify an ε-optimal policy in robust constrained MDPs via epigraph form and bisection search with Õ(ε^{-4}) robust policy evaluations.
Minimizing Upper Confidence Bounds: A Data-Driven Framework for Stochastic Programming math.OC · 2024-03-13 · unverdicted · none · ref 25
Proposes APUB optimization framework for stochastic programming, proves asymptotic correctness and consistency of the new bound, and develops bootstrap and L-shaped solvers for two-stage linear problems with empirical tests on a product mix example.
Safety-Constrained Reinforcement Learning with Post-Training Reachability Verification for Robot Navigation cs.RO · 2026-05-13 · unverdicted · none · ref 20
CVaR-constrained TD3 policies for robot navigation show larger safety margins and higher post-training reachability verification rates than average-cost baselines across simulated scenarios and real-robot tests.
Regret Equals Covariance: A Closed-Form Characterization for Stochastic Optimization econ.EM · 2026-05-13 · unverdicted · none · ref 20
Expected regret equals covariance between costs and optimal decisions for linear and quadratic stochastic programs, with explicit bounds on the residual.
Ready from Day 1: Population-Aware Coordination for Large-Scale Constrained Multi-Agent Systems cs.MA · 2026-05-12 · unverdicted · none · ref 30 · 2 links
Learned primal and dual maps conditioned on population summaries enable reliable coordination across composition shifts in large multi-agent systems, cutting forecast error 16-19% and violations 20-51% in a supply-chain case study.
Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge cs.AI · 2026-05-11 · unverdicted · none · ref 21
RACER routes between reasoning and non-reasoning LLM judges via constrained distributionally robust optimization to achieve better accuracy-cost trade-offs under distribution shift.
Q-MMR: Off-Policy Evaluation via Recursive Reweighting and Moment Matching cs.LG · 2026-05-07 · unverdicted · none · ref 6 · 2 links
Q-MMR introduces recursive reweighting and moment matching for off-policy evaluation, delivering dimension-free error bounds under Q^π realizability alone.
The Distributionally Robust Cyclic Inventory Routing Problem math.OC · 2026-05-05 · unverdicted · none · ref 70 · 2 links
The authors create a distributionally robust formulation for the cyclic inventory routing problem that admits a deterministic reformulation via multi-point worst-case distributions and chance-constraint equivalents, solved by nested branch-and-price and tested on real automotive data.
Wasserstein Distributionally Robust Regret Optimization for Reinforcement Learning from Human Feedback cs.LG · 2026-04-30 · unverdicted · none · ref 15
DRRO for RLHF minimizes worst-case regret relative to the best policy under Wasserstein reward perturbations, yielding an exact inner solution and water-filling policy structure for the promptwise simplex model plus a practical policy-gradient algorithm.
Nonsmooth Nonconvex-Concave Minimax Optimization: Convergence Criteria and Algorithms math.OC · 2026-04-23 · unverdicted · none · ref 7
The authors introduce (ηx,ηy,δ,ε)-GSSP as a convergence criterion and develop projected gradient-free descent-ascent methods achieving non-asymptotic rates for nonsmooth nonconvex-concave minimax optimization without weak convexity assumptions.
Distributionally Robust Stochastic MPC under Disturbance-Affine Feedback Policies eess.SY · 2026-04-14 · unverdicted · none · ref 32
A new disturbance-affine distributionally robust MPC framework for uncertain linear systems that is less conservative than tube-based approaches while guaranteeing recursive feasibility and stability.
Interactive Trajectory Planning with Learning-based Distributionally Robust Model Predictive Control and Markov Systems eess.SY · 2026-05-08 · unverdicted · none · ref 17
PAC learning-based DR-MPC framework interpolates between robust MPC and stochastic MPC for interactive trajectory planning under agent decision uncertainty.
Assured autonomy: How operations research powers and orchestrates generative AI systems cs.LG · 2025-12-30 · unverdicted · none · ref 5
The authors develop a conceptual framework for assured autonomy in generative AI by using flow-based models for auditable generation and adversarial robustness for operational safety, repositioning operations research as a system architect.
A Data-embedded Solution Paradigm for Nonconvex Probable Event Constrained Optimization math.OC · 2026-04-21 · unverdicted · none · ref 2
PECO strengthens chance constraints by mandating feasibility for all high-probability events and is solved via a data-embedded deterministic program that works for nonlinear nonconvex instances when the size of the solution-determining data family can be estimated by machine learning.
Target-based Distributionally Robust Minimum Spanning Tree Problem math.OC · 2023-11-17 · unverdicted · none · ref 43
A target-based DRO model for MST under distributional uncertainty is solved exactly via Benders decomposition and a modified Prim algorithm.

arXiv preprint arXiv:1908.05659 , year=

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer