pith. sign in

Fan-Ming Luo

Identifiers

  • name variant Fan-Ming Luo 0.60 · backfill

Papers (5)

  1. Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate cs.LG · 2024 · author #1
  2. Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning cs.LG · 2023 · author #1
  3. Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games cs.LG · 2022 · author #2
  4. A Survey on Model-based Reinforcement Learning cs.LG · 2022 · author #1
  5. Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble cs.LG · 2022 · author #1

Mentions

  • 2206.00238 #1 · arxiv_oai · confidence 0.70 Fan-Ming Luo
  • 2405.15384 #1 · arxiv_oai · confidence 0.70 Fan-Ming Luo
  • 2310.05422 #1 · arxiv_oai · confidence 0.70 Fan-Ming Luo
  • 2208.09452 #2 · arxiv_oai · confidence 0.70 Fan-Ming Luo
  • 2206.09328 #1 · arxiv_oai · confidence 0.70 Fan-Ming Luo

Frequent Coauthors