Doubly robust policy evaluation and optimization

Lihong Li · 2014 · DOI 10.1214/14-sts500

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

The Partial Testimony of Logs: Evaluation of Language Model Generation under Confounded Model Choice

cs.LG · 2026-05-02 · unverdicted · novelty 7.0

An identification theorem shows that a randomized experiment and simulator together recover causal model values from confounded logs, with logs used only afterward to reduce estimation error.

CASP: Support-Aware Offline Policy Selection for Two-Stage Recommender Systems

cs.IR · 2026-04-24 · unverdicted · novelty 7.0

CASP selects lower-burden two-stage recommender policies by combining doubly robust estimation with a penalty for weak data support and provides theoretical guarantees for conservative selection.

citing papers explorer

Showing 2 of 2 citing papers.

The Partial Testimony of Logs: Evaluation of Language Model Generation under Confounded Model Choice cs.LG · 2026-05-02 · unverdicted · none · ref 10
An identification theorem shows that a randomized experiment and simulator together recover causal model values from confounded logs, with logs used only afterward to reduce estimation error.
CASP: Support-Aware Offline Policy Selection for Two-Stage Recommender Systems cs.IR · 2026-04-24 · unverdicted · none · ref 7
CASP selects lower-burden two-stage recommender policies by combining doubly robust estimation with a penalty for weak data support and provides theoretical guarantees for conservative selection.

Doubly robust policy evaluation and optimization

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer