Ashique Rupam Mahmood
Identifiers
No identifiers captured yet.
Papers (1)
- Multi-step Off-policy Learning Without Importance Sampling Ratios cs.LG · 2017 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Huizhen Yu 1 shared papers
- Richard S. Sutton 1 shared papers