Doina Precup
Identifiers
- name variant Doina Precup 0.60 · backfill
Papers (54)
- Rotation-Preserving Supervised Fine-Tuning cs.LG · 2026 · author #6
- RL Fine-Tuning Heals OOD Forgetting in SFT cs.LG · 2025 · author #7
- Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #16
- Recurrent Value Functions cs.LG · 2019 · author #4
- Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks cs.RO · 2019 · author #4
- Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials cs.LG · 2019 · author #2
- The Termination Critic cs.AI · 2019 · author #6
- Clustering-Oriented Representation Learning with Attractive-Repulsive Loss cs.LG · 2018 · author #6
- Environments for Lifelong Reinforcement Learning cs.AI · 2018 · author #4
- The Barbados 2018 List of Open Issues in Continual Learning cs.AI · 2018 · author #10
- Temporal Regularization in Markov Decision Process cs.LG · 2018 · author #4
- Combined Reinforcement Learning via Abstract Representations cs.LG · 2018 · author #3
- Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants cs.LG · 2018 · author #7
- Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing cs.LG · 2018 · author #7
- A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants eess.SP · 2018 · author #7
- Exploring Uncertainty Measures in Deep Networks for Multiple Sclerosis Lesion Detection and Segmentation cs.CV · 2018 · author #2
- Attend Before you Act: Leveraging human visual attention for continual learning cs.AI · 2018 · author #2
- Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning cs.LG · 2018 · author #3
- Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization cs.CL · 2018 · author #3
- Dyna Planning using a Feature Based Generative Model cs.LG · 2018 · author #2
- Learning Safe Policies with Expert Guidance cs.LG · 2018 · author #3
- Disentangling the independently controllable factors of variation by interacting with the world stat.ML · 2018 · author #8
- Learning Robust Options cs.AI · 2018 · author #4
- Learnings Options End-to-End for Continuous Action Tasks cs.LG · 2017 · author #4
- Ubenwa: Cry-based Diagnosis of Birth Asphyxia stat.ML · 2017 · author #5
- Learning with Options that Terminate Off-Policy cs.AI · 2017 · author #4
- OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning cs.LG · 2017 · author #6
- Deep Reinforcement Learning that Matters cs.LG · 2017 · author #5
- When Waiting is not an Option : Learning Options with a Deliberation Cost cs.AI · 2017 · author #4
- Neural Network Based Nonlinear Weighted Finite Automata cs.FL · 2017 · author #3
- Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control cs.LG · 2017 · author #4
- Independently Controllable Factors cs.LG · 2017 · author #8
- Variational Generative Stochastic Networks with Collaborative Shaping cs.LG · 2017 · author #2
- Convergent Tree Backup and Retrace with Function Approximation cs.LG · 2017 · author #3
- Investigating Recurrence and Eligibility Traces in Deep Q-Networks cs.AI · 2017 · author #2
- Independently Controllable Features cs.LG · 2017 · author #4
- Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options cs.AI · 2017 · author #2
- A Matrix Splitting Perspective on Planning with Options cs.AI · 2016 · author #2
- The Option-Critic Architecture cs.AI · 2016 · author #3
- Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data cs.CL · 2016 · author #4
- Differentially Private Policy Evaluation cs.LG · 2016 · author #3
- Policy Gradient Methods for Off-policy Control cs.AI · 2015 · author #2
- Conditional Computation in Neural Networks for faster models cs.LG · 2015 · author #4
- Testing Visual Attention in Dynamic Environments cs.LG · 2015 · author #3
- Data Generation as Sequential Decision Making cs.LG · 2015 · author #2
- A Canonical Form for Weighted Automata and Applications to Approximate Minimization cs.FL · 2015 · author #3
- Learning with Pseudo-Ensembles stat.ML · 2014 · author #3
- Practical Kernel-Based Reinforcement Learning cs.LG · 2014 · author #2
- Classification-based Approximate Policy Iteration: Experiments and Extended Discussions cs.LG · 2014 · author #2
- Algorithms for multi-armed bandit problems cs.AI · 2014 · author #2
- Bellman Error Based Feature Generation using Random Projections on Sparse Spaces cs.LG · 2012 · author #5
- Metrics for Finite Markov Decision Processes cs.AI · 2012 · author #3
- Metrics for Markov Decision Processes with Infinite State Spaces cs.AI · 2012 · author #3
- Methods for computing state similarity in Markov Decision Processes cs.AI · 2012 · author #3
Mentions
- 1207.5554 #5 · backfill · confidence 0.70 Doina Precup
- 1207.4114 #3 · backfill · confidence 0.70 Doina Precup
- 1207.1386 #3 · backfill · confidence 0.70 Doina Precup
- 1206.6836 #3 · backfill · confidence 0.70 Doina Precup
- 2409.12917 #16 · arxiv_oai · confidence 0.70 Doina Precup
Frequent Coauthors
- Joelle Pineau 11 shared papers
- Pierre-Luc Bacon 11 shared papers
- Jean Harb 5 shared papers
- Philip Bachman 5 shared papers
- Emmanuel Bengio 4 shared papers
- Guillaume Rabusseau 4 shared papers
- Prakash Panangaden 4 shared papers
- Yoshua Bengio 4 shared papers
- Charles C. Onu 3 shared papers
- David Meger 3 shared papers
- Guilherme M. Sant'Anna 3 shared papers
- Jackie Chi Kit Cheung 3 shared papers
- Karen A. Brown 3 shared papers
- Lara J. Kanbar 3 shared papers
- Norman Ferns 3 shared papers
- Peter Henderson 3 shared papers
- RObert E. Kearney 3 shared papers
- Valentin Thomas 3 shared papers
- Wissam Shalish 3 shared papers
- Amir-massoud Farahmand 2 shared papers