Shie Mannor
Identifiers
- name variant Shie Mannor 0.60 · backfill
Papers (113)
- Toward Micro-Endoscopy: Distal-Free, Configuration-Agnostic Focusing Through Multimode Fiber physics.optics · 2026 · author #5
- Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels stat.ME · 2026 · author #2
- The Value of Mechanistic Priors in Sequential Decision Making cs.LG · 2026 · author #3
- Simulating clinical interventions with a generative multimodal model of human physiology cs.AI · 2026 · author #7
- Optimal Sample Complexity for Single Time-Scale Actor-Critic with Momentum cs.LG · 2026 · author #7
- Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces cs.LG · 2025 · author #8
- Representative Action Selection for Large Action Space Bandit Families cs.LG · 2025 · author #3
- Simulus: Combining Improvements in Sample-Efficient World Model Agents cs.LG · 2025 · author #5
- Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization cs.LG · 2024 · author #6
- A Bayesian Approach to Robust Reinforcement Learning cs.LG · 2019 · author #4
- The Natural Language of Actions cs.AI · 2019 · author #2
- Action Robust Reinforcement Learning and Applications in Continuous Control cs.LG · 2019 · author #3
- Trust Region Value Optimization using Kalman Filtering cs.LG · 2019 · author #2
- Multi Instance Learning For Unbalanced Data cs.LG · 2018 · author #6
- Inspiration Learning through Preferences cs.LG · 2018 · author #2
- Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning cs.LG · 2018 · author #5
- How to Combine Tree-Search Methods in Reinforcement Learning cs.LG · 2018 · author #4
- Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach cs.LG · 2018 · author #2
- Reward Constrained Policy Optimization cs.LG · 2018 · author #3
- Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning cs.LG · 2018 · author #4
- Nonlinear Distributional Gradient Temporal-Difference Learning cs.LG · 2018 · author #2
- Interpreting Electrical-Resistivity Tomography measurements using Neural Network physics.geo-ph · 2018 · author #4
- Interdependent Gibbs Samplers stat.ML · 2018 · author #2
- Deep Learning Reconstruction of Ultra-Short Pulses physics.optics · 2018 · author #4
- Soft-Robust Actor-Critic Policy-Gradient cs.LG · 2018 · author #4
- Train on Validation: Squeezing the Data Lemon stat.ML · 2018 · author #3
- Beyond the One Step Greedy Approach in Reinforcement Learning cs.AI · 2018 · author #4
- Learning Robust Options cs.AI · 2018 · author #5
- Chance-Constrained Outage Scheduling using a Machine Learning Proxy cs.CE · 2018 · author #3
- The Stochastic Firefighter Problem cs.SY · 2017 · author #3
- Situationally Aware Options cs.AI · 2017 · author #3
- Multi-objective Bandits: Optimizing the Generalized Gini Index cs.LG · 2017 · author #4
- Shallow Updates for Deep Reinforcement Learning cs.AI · 2017 · author #5
- Finite Sample Analyses for TD(0) with Function Approximation cs.AI · 2017 · author #4
- Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning cs.AI · 2017 · author #4
- Deep Robust Kalman Filter cs.AI · 2017 · author #2
- Online Learning with Many Experts cs.LG · 2017 · author #2
- Rotting Bandits stat.ML · 2017 · author #3
- Consistent On-Line Off-Policy Evaluation stat.ML · 2017 · author #2
- Outlier Robust Online Learning cs.LG · 2017 · author #3
- Adaptive Lambda Least-Squares Temporal Difference Learning cs.LG · 2016 · author #3
- Supervised Learning for Optimal Power Flow as a Real-Time Proxy cs.LG · 2016 · author #3
- Model-based Adversarial Imitation Learning stat.ML · 2016 · author #3
- Unit Commitment using Nearest Neighbor as a Short-Term Proxy cs.LG · 2016 · author #3
- Is a picture worth a thousand words? A Deep Multi-Modal Fusion Architecture for Product Classification in e-commerce cs.CV · 2016 · author #4
- Situational Awareness by Risk-Conscious Skills cs.AI · 2016 · author #3
- A nonparametric sequential test for online randomized experiments stat.ML · 2016 · author #2
- Bayesian Reinforcement Learning: A Survey cs.AI · 2016 · author #2
- How to Allocate Resources For Features Acquisition? cs.AI · 2016 · author #2
- Visualizing Dynamics: from t-SNE to SEMI-MDPs stat.ML · 2016 · author #3
- Deep Reinforcement Learning Discovers Internal Models cs.AI · 2016 · author #3
- Bending the Curve: Improving the ROC Curve Through Error Redistribution cs.LG · 2016 · author #2
- A Reinforcement Learning System to Encourage Physical Activity in Diabetes Patients cs.CY · 2016 · author #4
- Clustering Time Series and the Surprising Robustness of HMMs cs.IT · 2016 · author #2
- Strategic Formation of Heterogeneous Networks cs.GT · 2016 · author #2
- A Deep Hierarchical Approach to Lifelong Learning in Minecraft cs.AI · 2016 · author #5
- Hierarchical Decision Making In Electricity Grid Management cs.AI · 2016 · author #3
- Adaptive Skills, Adaptive Partitions (ASAP) cs.LG · 2016 · author #3
- Iterative Hierarchical Optimization for Misspecified Problems (IHOMP) cs.LG · 2016 · author #3
- Graying the black box: Understanding DQNs cs.LG · 2016 · author #3
- Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms cs.LG · 2016 · author #6
- Distributed Scenario-Based Optimization for Asset Management in a Hierarchical Decision Making Environment cs.SY · 2016 · author #3
- Learn on Source, Refine on Target:A Model Transfer Learning Framework with Random Forests cs.LG · 2015 · author #3
- Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis stat.ML · 2015 · author #4
- Emphatic TD Bellman Operator is a Contraction stat.ML · 2015 · author #3
- Reinforcement Learning for the Unit Commitment Problem cs.AI · 2015 · author #2
- Bootstrapping Skills cs.AI · 2015 · author #3
- Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach cs.AI · 2015 · author #3
- Multi-user lax communications: a multi-armed bandit approach cs.LG · 2015 · author #2
- Overlapping Community Detection by Online Cluster Aggregation cs.LG · 2015 · author #2
- Overlapping Communities Detection via Measure Space Embedding cs.LG · 2015 · author #2
- Actively Learning to Attract Followers on Twitter stat.ML · 2015 · author #3
- Policy Gradient for Coherent Risk Measures cs.AI · 2015 · author #4
- Off-policy evaluation for MDPs with unknown structure stat.ML · 2015 · author #4
- Contextual Markov Decision Processes stat.ML · 2015 · author #3
- Formation Games of Reliable Networks cs.GT · 2014 · author #2
- Implicit Temporal Differences stat.ML · 2014 · author #3
- Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback cs.LG · 2014 · author #4
- Distributed Robust Learning stat.ML · 2014 · author #3
- Thompson Sampling for Learning Parameterized Markov Decision Processes stat.ML · 2014 · author #2
- Concurrent bandits and cognitive radio networks cs.LG · 2014 · author #2
- Optimizing the CVaR via Sampling stat.ML · 2014 · author #3
- Oracle-Based Robust Optimization via Online Learning math.OC · 2014 · author #4
- Localized epidemic detection in networks with overwhelming noise cs.SI · 2014 · author #4
- Thompson Sampling for Complex Bandit Problems stat.ML · 2013 · author #2
- Variance Adjusted Actor Critic Algorithms stat.ML · 2013 · author #2
- Distinguishing Infections on Different Graph Topologies cs.SI · 2013 · author #3
- Network Formation Games with Heterogeneous Players and the Internet Structure cs.GT · 2013 · author #2
- Scaling Up Robust MDPs by Reinforcement Learning cs.LG · 2013 · author #3
- Online Convex Optimization Against Adversaries with Memory and Application to Statistical Arbitrage cs.LG · 2013 · author #3
- Online Learning for Time Series Prediction cs.LG · 2013 · author #3
- Robust High Dimensional Sparse Regression and Matching Pursuit stat.ML · 2013 · author #3
- Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes cs.LG · 2013 · author #3
- The Perturbed Variation cs.LG · 2012 · author #2
- More Is Better: Large Scale Partially-supervised Sentiment Classification - Appendix cs.LG · 2012 · author #3
- How to sample if you must: on optimal functional sampling stat.ML · 2012 · author #2
- Clustered Bandits cs.LG · 2012 · author #3
- Decoupling Exploration and Exploitation in Multi-Armed Bandits cs.LG · 2012 · author #2
- Relaxed Half-Stochastic Belief Propagation cs.AR · 2012 · author #3
- Go Viral, or Not: Rate-Optimal Control for Resource-Constrained Branching Processes math.OC · 2012 · author #1
- Regulation, Volatility and Efficiency in Continuous-Time Markets cs.SY · 2011 · author #2
- Bandits with an Edge cs.LG · 2011 · author #3
- From Bandits to Experts: On the Value of Side-Observations cs.LG · 2011 · author #1
- A Maximal Large Deviation Inequality for Sub-Gaussian Variables cs.LG · 2011 · author #3
- Mean-Variance Optimization in Markov Decision Processes cs.LG · 2011 · author #1
- The Sample Complexity of Dictionary Learning stat.ML · 2010 · author #2
- Robustness and Generalization cs.LG · 2010 · author #2
- Adaptive Bases for Reinforcement Learning cs.LG · 2010 · author #2
- Learning from Multiple Outlooks cs.LG · 2010 · author #2
- Principal Component Analysis with Contaminated Data: The High Dimensional Case stat.ML · 2010 · author #3
- Robust Regression and Lasso cs.IT · 2008 · author #3
- Robustness and Regularization of Support Vector Machines cs.LG · 2008 · author #3
- Strategies for prediction under imperfect monitoring math.ST · 2007 · author #2
Mentions
- 1404.5421 #2 · backfill · confidence 0.70 Shie Mannor
- 1404.3862 #3 · backfill · confidence 0.70 Shie Mannor
- 1402.6361 #4 · backfill · confidence 0.70 Shie Mannor
- 1402.1263 #4 · backfill · confidence 0.70 Shie Mannor
- 1311.0466 #2 · backfill · confidence 0.70 Shie Mannor
- 1310.3697 #2 · backfill · confidence 0.70 Shie Mannor
- 1309.6545 #3 · backfill · confidence 0.70 Shie Mannor
- 2605.28506 #5 · arxiv_oai · confidence 0.70 Shie Mannor
- 1307.4102 #2 · backfill · confidence 0.70 Shie Mannor
- 1306.6189 #3 · backfill · confidence 0.70 Shie Mannor
- 1302.6937 #3 · backfill · confidence 0.70 Shie Mannor
- 1302.6927 #3 · backfill · confidence 0.70 Shie Mannor
- 1301.2725 #3 · backfill · confidence 0.70 Shie Mannor
- 1301.0104 #3 · backfill · confidence 0.70 Shie Mannor
- 1210.4006 #2 · backfill · confidence 0.70 Shie Mannor
- 1209.6329 #3 · backfill · confidence 0.70 Shie Mannor
- 1208.2417 #2 · backfill · confidence 0.70 Shie Mannor
- 1206.4169 #3 · backfill · confidence 0.70 Shie Mannor
- 1205.2874 #2 · backfill · confidence 0.70 Shie Mannor
- 1205.2428 #3 · backfill · confidence 0.70 Shie Mannor
- 1203.1072 #1 · backfill · confidence 0.70 Shie Mannor
- 1109.3151 #2 · backfill · confidence 0.70 Shie Mannor
- 1109.2296 #3 · backfill · confidence 0.70 Shie Mannor
- 2509.22963 #8 · arxiv_oai · confidence 0.70 Shie Mannor
- 1106.2436 #1 · backfill · confidence 0.70 Shie Mannor
- 1105.2550 #3 · backfill · confidence 0.70 Shie Mannor
- 1104.5601 #1 · backfill · confidence 0.70 Shie Mannor
- 2605.17559 #2 · arxiv_oai · confidence 0.70 Shie Mannor
- 1011.5395 #2 · backfill · confidence 0.70 Shie Mannor
- 1005.2243 #2 · backfill · confidence 0.70 Shie Mannor
- 1005.0125 #2 · backfill · confidence 0.70 Shie Mannor
- 1005.0027 #2 · backfill · confidence 0.70 Shie Mannor
- 1002.4658 #3 · backfill · confidence 0.70 Shie Mannor
- 0811.1790 #3 · backfill · confidence 0.70 Shie Mannor
- 0803.3490 #3 · backfill · confidence 0.70 Shie Mannor
Frequent Coauthors
- Aviv Tamar 13 shared papers
- Daniel J. Mankowitz 11 shared papers
- Gal Dalal 11 shared papers
- Tom Zahavy 10 shared papers
- Huan Xu 9 shared papers
- Constantine Caramanis 7 shared papers
- Mark Kozdoba 7 shared papers
- Timothy A. Mann 7 shared papers
- Assaf Hallak 6 shared papers
- Dotan Di Castro 5 shared papers
- Ariel Orda 4 shared papers
- Chen Tessler 4 shared papers
- Elad Gilboa 4 shared papers
- Koby Crammer 4 shared papers
- Ohad Shamir 4 shared papers
- Orly Avner 4 shared papers
- Yonathan Efroni 4 shared papers
- Bruno Scherrer 3 shared papers
- Claudio Gentile 3 shared papers
- Elad Hazan 3 shared papers