pith. sign in

Runzhe Wu

Identifiers

  • name variant Runzhe Wu 0.60 · backfill

Papers (6)

  1. Making RL with Preference-based Feedback Efficient via Randomization cs.LG · 2023 · author #1
  2. Contextual Bandits and Imitation Learning via Preference-Based Active Queries cs.LG · 2023 · author #4
  3. Selective Sampling and Imitation Learning via Online Regression cs.LG · 2023 · author #4
  4. The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning cs.LG · 2023 · author #3
  5. Distributional Offline Policy Evaluation with Predictive Error Guarantees cs.LG · 2023 · author #1
  6. MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning cs.MA · 2021 · author #5

Mentions

  • 2310.14554 #1 · arxiv_oai · confidence 0.70 Runzhe Wu
  • 2302.09456 #1 · arxiv_oai · confidence 0.70 Runzhe Wu
  • 2305.15703 #3 · arxiv_oai · confidence 0.70 Runzhe Wu
  • 2307.12926 #4 · arxiv_oai · confidence 0.70 Runzhe Wu
  • 2307.04998 #4 · arxiv_oai · confidence 0.70 Runzhe Wu
  • 2106.07551 #5 · arxiv_oai · confidence 0.70 Runzhe Wu

Frequent Coauthors