pith. sign in

Doina Precup

Identifiers

  • name variant Doina Precup 0.60 · backfill

Papers (54)

  1. Rotation-Preserving Supervised Fine-Tuning cs.LG · 2026 · author #6
  2. RL Fine-Tuning Heals OOD Forgetting in SFT cs.LG · 2025 · author #7
  3. Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #16
  4. Recurrent Value Functions cs.LG · 2019 · author #4
  5. Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks cs.RO · 2019 · author #4
  6. Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials cs.LG · 2019 · author #2
  7. The Termination Critic cs.AI · 2019 · author #6
  8. Clustering-Oriented Representation Learning with Attractive-Repulsive Loss cs.LG · 2018 · author #6
  9. Environments for Lifelong Reinforcement Learning cs.AI · 2018 · author #4
  10. The Barbados 2018 List of Open Issues in Continual Learning cs.AI · 2018 · author #10
  11. Temporal Regularization in Markov Decision Process cs.LG · 2018 · author #4
  12. Combined Reinforcement Learning via Abstract Representations cs.LG · 2018 · author #3
  13. Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants cs.LG · 2018 · author #7
  14. Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing cs.LG · 2018 · author #7
  15. A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants eess.SP · 2018 · author #7
  16. Exploring Uncertainty Measures in Deep Networks for Multiple Sclerosis Lesion Detection and Segmentation cs.CV · 2018 · author #2
  17. Attend Before you Act: Leveraging human visual attention for continual learning cs.AI · 2018 · author #2
  18. Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning cs.LG · 2018 · author #3
  19. Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization cs.CL · 2018 · author #3
  20. Dyna Planning using a Feature Based Generative Model cs.LG · 2018 · author #2
  21. Learning Safe Policies with Expert Guidance cs.LG · 2018 · author #3
  22. Disentangling the independently controllable factors of variation by interacting with the world stat.ML · 2018 · author #8
  23. Learning Robust Options cs.AI · 2018 · author #4
  24. Learnings Options End-to-End for Continuous Action Tasks cs.LG · 2017 · author #4
  25. Ubenwa: Cry-based Diagnosis of Birth Asphyxia stat.ML · 2017 · author #5
  26. Learning with Options that Terminate Off-Policy cs.AI · 2017 · author #4
  27. OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning cs.LG · 2017 · author #6
  28. Deep Reinforcement Learning that Matters cs.LG · 2017 · author #5
  29. When Waiting is not an Option : Learning Options with a Deliberation Cost cs.AI · 2017 · author #4
  30. Neural Network Based Nonlinear Weighted Finite Automata cs.FL · 2017 · author #3
  31. Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control cs.LG · 2017 · author #4
  32. Independently Controllable Factors cs.LG · 2017 · author #8
  33. Variational Generative Stochastic Networks with Collaborative Shaping cs.LG · 2017 · author #2
  34. Convergent Tree Backup and Retrace with Function Approximation cs.LG · 2017 · author #3
  35. Investigating Recurrence and Eligibility Traces in Deep Q-Networks cs.AI · 2017 · author #2
  36. Independently Controllable Features cs.LG · 2017 · author #4
  37. Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options cs.AI · 2017 · author #2
  38. A Matrix Splitting Perspective on Planning with Options cs.AI · 2016 · author #2
  39. The Option-Critic Architecture cs.AI · 2016 · author #3
  40. Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data cs.CL · 2016 · author #4
  41. Differentially Private Policy Evaluation cs.LG · 2016 · author #3
  42. Policy Gradient Methods for Off-policy Control cs.AI · 2015 · author #2
  43. Conditional Computation in Neural Networks for faster models cs.LG · 2015 · author #4
  44. Testing Visual Attention in Dynamic Environments cs.LG · 2015 · author #3
  45. Data Generation as Sequential Decision Making cs.LG · 2015 · author #2
  46. A Canonical Form for Weighted Automata and Applications to Approximate Minimization cs.FL · 2015 · author #3
  47. Learning with Pseudo-Ensembles stat.ML · 2014 · author #3
  48. Practical Kernel-Based Reinforcement Learning cs.LG · 2014 · author #2
  49. Classification-based Approximate Policy Iteration: Experiments and Extended Discussions cs.LG · 2014 · author #2
  50. Algorithms for multi-armed bandit problems cs.AI · 2014 · author #2
  51. Bellman Error Based Feature Generation using Random Projections on Sparse Spaces cs.LG · 2012 · author #5
  52. Metrics for Finite Markov Decision Processes cs.AI · 2012 · author #3
  53. Metrics for Markov Decision Processes with Infinite State Spaces cs.AI · 2012 · author #3
  54. Methods for computing state similarity in Markov Decision Processes cs.AI · 2012 · author #3

Mentions

  • 1207.5554 #5 · backfill · confidence 0.70 Doina Precup
  • 1207.4114 #3 · backfill · confidence 0.70 Doina Precup
  • 1207.1386 #3 · backfill · confidence 0.70 Doina Precup
  • 1206.6836 #3 · backfill · confidence 0.70 Doina Precup
  • 2409.12917 #16 · arxiv_oai · confidence 0.70 Doina Precup

Frequent Coauthors