Doina Precup — Pith Author Registry

Identifiers

name variant Doina Precup 0.60 · backfill

Papers (54)

Rotation-Preserving Supervised Fine-Tuning cs.LG · 2026 · author #6
RL Fine-Tuning Heals OOD Forgetting in SFT cs.LG · 2025 · author #7
Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #16
Recurrent Value Functions cs.LG · 2019 · author #4
Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks cs.RO · 2019 · author #4
Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials cs.LG · 2019 · author #2
The Termination Critic cs.AI · 2019 · author #6
Clustering-Oriented Representation Learning with Attractive-Repulsive Loss cs.LG · 2018 · author #6
Environments for Lifelong Reinforcement Learning cs.AI · 2018 · author #4
The Barbados 2018 List of Open Issues in Continual Learning cs.AI · 2018 · author #10
Temporal Regularization in Markov Decision Process cs.LG · 2018 · author #4
Combined Reinforcement Learning via Abstract Representations cs.LG · 2018 · author #3
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants cs.LG · 2018 · author #7
Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing cs.LG · 2018 · author #7
A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants eess.SP · 2018 · author #7
Exploring Uncertainty Measures in Deep Networks for Multiple Sclerosis Lesion Detection and Segmentation cs.CV · 2018 · author #2
Attend Before you Act: Leveraging human visual attention for continual learning cs.AI · 2018 · author #2
Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning cs.LG · 2018 · author #3
Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization cs.CL · 2018 · author #3
Dyna Planning using a Feature Based Generative Model cs.LG · 2018 · author #2
Learning Safe Policies with Expert Guidance cs.LG · 2018 · author #3
Disentangling the independently controllable factors of variation by interacting with the world stat.ML · 2018 · author #8
Learning Robust Options cs.AI · 2018 · author #4
Learnings Options End-to-End for Continuous Action Tasks cs.LG · 2017 · author #4
Ubenwa: Cry-based Diagnosis of Birth Asphyxia stat.ML · 2017 · author #5
Learning with Options that Terminate Off-Policy cs.AI · 2017 · author #4
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning cs.LG · 2017 · author #6
Deep Reinforcement Learning that Matters cs.LG · 2017 · author #5
When Waiting is not an Option : Learning Options with a Deliberation Cost cs.AI · 2017 · author #4
Neural Network Based Nonlinear Weighted Finite Automata cs.FL · 2017 · author #3
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control cs.LG · 2017 · author #4
Independently Controllable Factors cs.LG · 2017 · author #8
Variational Generative Stochastic Networks with Collaborative Shaping cs.LG · 2017 · author #2
Convergent Tree Backup and Retrace with Function Approximation cs.LG · 2017 · author #3
Investigating Recurrence and Eligibility Traces in Deep Q-Networks cs.AI · 2017 · author #2
Independently Controllable Features cs.LG · 2017 · author #4
Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options cs.AI · 2017 · author #2
A Matrix Splitting Perspective on Planning with Options cs.AI · 2016 · author #2
The Option-Critic Architecture cs.AI · 2016 · author #3
Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data cs.CL · 2016 · author #4
Differentially Private Policy Evaluation cs.LG · 2016 · author #3
Policy Gradient Methods for Off-policy Control cs.AI · 2015 · author #2
Conditional Computation in Neural Networks for faster models cs.LG · 2015 · author #4
Testing Visual Attention in Dynamic Environments cs.LG · 2015 · author #3
Data Generation as Sequential Decision Making cs.LG · 2015 · author #2
A Canonical Form for Weighted Automata and Applications to Approximate Minimization cs.FL · 2015 · author #3
Learning with Pseudo-Ensembles stat.ML · 2014 · author #3
Practical Kernel-Based Reinforcement Learning cs.LG · 2014 · author #2
Classification-based Approximate Policy Iteration: Experiments and Extended Discussions cs.LG · 2014 · author #2
Algorithms for multi-armed bandit problems cs.AI · 2014 · author #2
Bellman Error Based Feature Generation using Random Projections on Sparse Spaces cs.LG · 2012 · author #5
Metrics for Finite Markov Decision Processes cs.AI · 2012 · author #3
Metrics for Markov Decision Processes with Infinite State Spaces cs.AI · 2012 · author #3
Methods for computing state similarity in Markov Decision Processes cs.AI · 2012 · author #3

Mentions

1207.5554 #5 · backfill · confidence 0.70 Doina Precup
1207.4114 #3 · backfill · confidence 0.70 Doina Precup
1207.1386 #3 · backfill · confidence 0.70 Doina Precup
1206.6836 #3 · backfill · confidence 0.70 Doina Precup
2409.12917 #16 · arxiv_oai · confidence 0.70 Doina Precup

Frequent Coauthors

Joelle Pineau 11 shared papers
Pierre-Luc Bacon 11 shared papers
Jean Harb 5 shared papers
Philip Bachman 5 shared papers
Emmanuel Bengio 4 shared papers
Guillaume Rabusseau 4 shared papers
Prakash Panangaden 4 shared papers
Yoshua Bengio 4 shared papers
Charles C. Onu 3 shared papers
David Meger 3 shared papers
Guilherme M. Sant'Anna 3 shared papers
Jackie Chi Kit Cheung 3 shared papers
Karen A. Brown 3 shared papers
Lara J. Kanbar 3 shared papers
Norman Ferns 3 shared papers
Peter Henderson 3 shared papers
RObert E. Kearney 3 shared papers
Valentin Thomas 3 shared papers
Wissam Shalish 3 shared papers
Amir-massoud Farahmand 2 shared papers