pith. sign in

Martha White

Identifiers

  • name variant Martha White 0.60 · backfill

Papers (33)

  1. Addressing Terminal Constraints in Data-Driven Demand Response Scheduling eess.SY · 2026 · author #2
  2. Revisiting Mixture Policies in Entropy-Regularized Actor-Critic cs.LG · 2026 · author #5
  3. Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning cs.LG · 2026 · author #4
  4. Forager: a lightweight testbed for continual learning with partial observability in RL cs.LG · 2026 · author #9
  5. Gradient Iterated Temporal-Difference Learning cs.LG · 2026 · author #6
  6. Deep Double Q-learning cs.LG · 2025 · author #2
  7. Distributions as Actions: A Unified Framework for Diverse Action Spaces cs.LG · 2025 · author #3
  8. Mitigating Value Hallucination in Dyna Planning via Multistep Predecessor Models cs.LG · 2020 · author #6
  9. Hill Climbing on Value Estimates for Search-control in Dyna cs.LG · 2019 · author #4
  10. Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling cs.LG · 2018 · author #4
  11. An Off-policy Policy Gradient Theorem Using Emphatic Weightings cs.LG · 2018 · author #3
  12. The Barbados 2018 List of Open Issues in Continual Learning cs.AI · 2018 · author #4
  13. The Utility of Sparse Representations for Control in Reinforcement Learning cs.LG · 2018 · author #4
  14. Online Off-policy Prediction cs.LG · 2018 · author #3
  15. High-confidence error estimates for learned value functions stat.ML · 2018 · author #3
  16. Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control cs.LG · 2018 · author #3
  17. Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains cs.AI · 2018 · author #5
  18. Improving Regression Performance with Distributional Losses stat.ML · 2018 · author #2
  19. Directly Estimating the Variance of the {\lambda}-Return Using Temporal-Difference Methods cs.AI · 2018 · author #6
  20. Effective sketching methods for value function approximation cs.LG · 2017 · author #3
  21. Learning Sparse Representations in Reinforcement Learning with Sparse Coding cs.AI · 2017 · author #3
  22. Recovering True Classifier Performance in Positive-Unlabeled Learning stat.ML · 2017 · author #2
  23. Accelerated Gradient Temporal Difference Learning cs.AI · 2016 · author #3
  24. A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning cs.AI · 2016 · author #1
  25. Estimating the class prior and posterior from noisy positives and unlabeled data stat.ML · 2016 · author #2
  26. Identifying global optimality for dictionary learning stat.ML · 2016 · author #2
  27. Investigating practical linear temporal difference learning cs.LG · 2016 · author #2
  28. Nonparametric semi-supervised learning of class proportions stat.ML · 2016 · author #2
  29. Incremental Truncated LSTD cs.LG · 2015 · author #3
  30. Emphatic Temporal-Difference Learning cs.LG · 2015 · author #3
  31. An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning cs.LG · 2015 · author #3
  32. Partition Tree Weighting cs.IT · 2012 · author #2
  33. Off-Policy Actor-Critic cs.LG · 2012 · author #2

Mentions

  • 1211.0587 #2 · backfill · confidence 0.70 Martha White
  • 1205.4839 #2 · backfill · confidence 0.70 Martha White
  • 2605.16318 #4 · arxiv_oai · confidence 0.70 Martha White
  • 2507.00275 #2 · arxiv_oai · confidence 0.70 Martha White

Frequent Coauthors