pith. sign in

Improving predictive inference under covariate shift by weighting the log-likelihood function

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2019 1

verdicts

UNVERDICTED 1

representative citing papers

Entropic Regularization of Markov Decision Processes

cs.LG · 2019-07-06 · unverdicted · novelty 6.0

Using alpha-divergences for entropic regularization in MDPs unifies actor-critic architectures via closed-form policy improvement and provides asymptotic analysis on standard RL problems.

citing papers explorer

Showing 1 of 1 citing paper.

  • Entropic Regularization of Markov Decision Processes cs.LG · 2019-07-06 · unverdicted · none · ref 9

    Using alpha-divergences for entropic regularization in MDPs unifies actor-critic architectures via closed-form policy improvement and provides asymptotic analysis on standard RL problems.