Shie Mannor

Identifiers

name variant Shie Mannor 0.60 · backfill

Papers (113)

Toward Micro-Endoscopy: Distal-Free, Configuration-Agnostic Focusing Through Multimode Fiber physics.optics · 2026 · author #5
Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels stat.ME · 2026 · author #2
The Value of Mechanistic Priors in Sequential Decision Making cs.LG · 2026 · author #3
Simulating clinical interventions with a generative multimodal model of human physiology cs.AI · 2026 · author #7
Optimal Sample Complexity for Single Time-Scale Actor-Critic with Momentum cs.LG · 2026 · author #7
Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces cs.LG · 2025 · author #8
Representative Action Selection for Large Action Space Bandit Families cs.LG · 2025 · author #3
Simulus: Combining Improvements in Sample-Efficient World Model Agents cs.LG · 2025 · author #5
Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization cs.LG · 2024 · author #6
A Bayesian Approach to Robust Reinforcement Learning cs.LG · 2019 · author #4
The Natural Language of Actions cs.AI · 2019 · author #2
Action Robust Reinforcement Learning and Applications in Continuous Control cs.LG · 2019 · author #3
Trust Region Value Optimization using Kalman Filtering cs.LG · 2019 · author #2
Multi Instance Learning For Unbalanced Data cs.LG · 2018 · author #6
Inspiration Learning through Preferences cs.LG · 2018 · author #2
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning cs.LG · 2018 · author #5
How to Combine Tree-Search Methods in Reinforcement Learning cs.LG · 2018 · author #4
Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach cs.LG · 2018 · author #2
Reward Constrained Policy Optimization cs.LG · 2018 · author #3
Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning cs.LG · 2018 · author #4
Nonlinear Distributional Gradient Temporal-Difference Learning cs.LG · 2018 · author #2
Interpreting Electrical-Resistivity Tomography measurements using Neural Network physics.geo-ph · 2018 · author #4
Interdependent Gibbs Samplers stat.ML · 2018 · author #2
Deep Learning Reconstruction of Ultra-Short Pulses physics.optics · 2018 · author #4
Soft-Robust Actor-Critic Policy-Gradient cs.LG · 2018 · author #4
Train on Validation: Squeezing the Data Lemon stat.ML · 2018 · author #3
Beyond the One Step Greedy Approach in Reinforcement Learning cs.AI · 2018 · author #4
Learning Robust Options cs.AI · 2018 · author #5
Chance-Constrained Outage Scheduling using a Machine Learning Proxy cs.CE · 2018 · author #3
The Stochastic Firefighter Problem cs.SY · 2017 · author #3
Situationally Aware Options cs.AI · 2017 · author #3
Multi-objective Bandits: Optimizing the Generalized Gini Index cs.LG · 2017 · author #4
Shallow Updates for Deep Reinforcement Learning cs.AI · 2017 · author #5
Finite Sample Analyses for TD(0) with Function Approximation cs.AI · 2017 · author #4
Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning cs.AI · 2017 · author #4
Deep Robust Kalman Filter cs.AI · 2017 · author #2
Online Learning with Many Experts cs.LG · 2017 · author #2
Rotting Bandits stat.ML · 2017 · author #3
Consistent On-Line Off-Policy Evaluation stat.ML · 2017 · author #2
Outlier Robust Online Learning cs.LG · 2017 · author #3
Adaptive Lambda Least-Squares Temporal Difference Learning cs.LG · 2016 · author #3
Supervised Learning for Optimal Power Flow as a Real-Time Proxy cs.LG · 2016 · author #3
Model-based Adversarial Imitation Learning stat.ML · 2016 · author #3
Unit Commitment using Nearest Neighbor as a Short-Term Proxy cs.LG · 2016 · author #3
Is a picture worth a thousand words? A Deep Multi-Modal Fusion Architecture for Product Classification in e-commerce cs.CV · 2016 · author #4
Situational Awareness by Risk-Conscious Skills cs.AI · 2016 · author #3
A nonparametric sequential test for online randomized experiments stat.ML · 2016 · author #2
Bayesian Reinforcement Learning: A Survey cs.AI · 2016 · author #2
How to Allocate Resources For Features Acquisition? cs.AI · 2016 · author #2
Visualizing Dynamics: from t-SNE to SEMI-MDPs stat.ML · 2016 · author #3
Deep Reinforcement Learning Discovers Internal Models cs.AI · 2016 · author #3
Bending the Curve: Improving the ROC Curve Through Error Redistribution cs.LG · 2016 · author #2
A Reinforcement Learning System to Encourage Physical Activity in Diabetes Patients cs.CY · 2016 · author #4
Clustering Time Series and the Surprising Robustness of HMMs cs.IT · 2016 · author #2
Strategic Formation of Heterogeneous Networks cs.GT · 2016 · author #2
A Deep Hierarchical Approach to Lifelong Learning in Minecraft cs.AI · 2016 · author #5
Hierarchical Decision Making In Electricity Grid Management cs.AI · 2016 · author #3
Adaptive Skills, Adaptive Partitions (ASAP) cs.LG · 2016 · author #3
Iterative Hierarchical Optimization for Misspecified Problems (IHOMP) cs.LG · 2016 · author #3
Graying the black box: Understanding DQNs cs.LG · 2016 · author #3
Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms cs.LG · 2016 · author #6
Distributed Scenario-Based Optimization for Asset Management in a Hierarchical Decision Making Environment cs.SY · 2016 · author #3
Learn on Source, Refine on Target:A Model Transfer Learning Framework with Random Forests cs.LG · 2015 · author #3
Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis stat.ML · 2015 · author #4
Emphatic TD Bellman Operator is a Contraction stat.ML · 2015 · author #3
Reinforcement Learning for the Unit Commitment Problem cs.AI · 2015 · author #2
Bootstrapping Skills cs.AI · 2015 · author #3
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach cs.AI · 2015 · author #3
Multi-user lax communications: a multi-armed bandit approach cs.LG · 2015 · author #2
Overlapping Community Detection by Online Cluster Aggregation cs.LG · 2015 · author #2
Overlapping Communities Detection via Measure Space Embedding cs.LG · 2015 · author #2
Actively Learning to Attract Followers on Twitter stat.ML · 2015 · author #3
Policy Gradient for Coherent Risk Measures cs.AI · 2015 · author #4
Off-policy evaluation for MDPs with unknown structure stat.ML · 2015 · author #4
Contextual Markov Decision Processes stat.ML · 2015 · author #3
Formation Games of Reliable Networks cs.GT · 2014 · author #2
Implicit Temporal Differences stat.ML · 2014 · author #3
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback cs.LG · 2014 · author #4
Distributed Robust Learning stat.ML · 2014 · author #3
Thompson Sampling for Learning Parameterized Markov Decision Processes stat.ML · 2014 · author #2
Concurrent bandits and cognitive radio networks cs.LG · 2014 · author #2
Optimizing the CVaR via Sampling stat.ML · 2014 · author #3
Oracle-Based Robust Optimization via Online Learning math.OC · 2014 · author #4
Localized epidemic detection in networks with overwhelming noise cs.SI · 2014 · author #4
Thompson Sampling for Complex Bandit Problems stat.ML · 2013 · author #2
Variance Adjusted Actor Critic Algorithms stat.ML · 2013 · author #2
Distinguishing Infections on Different Graph Topologies cs.SI · 2013 · author #3
Network Formation Games with Heterogeneous Players and the Internet Structure cs.GT · 2013 · author #2
Scaling Up Robust MDPs by Reinforcement Learning cs.LG · 2013 · author #3
Online Convex Optimization Against Adversaries with Memory and Application to Statistical Arbitrage cs.LG · 2013 · author #3
Online Learning for Time Series Prediction cs.LG · 2013 · author #3
Robust High Dimensional Sparse Regression and Matching Pursuit stat.ML · 2013 · author #3
Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes cs.LG · 2013 · author #3
The Perturbed Variation cs.LG · 2012 · author #2
More Is Better: Large Scale Partially-supervised Sentiment Classification - Appendix cs.LG · 2012 · author #3
How to sample if you must: on optimal functional sampling stat.ML · 2012 · author #2
Clustered Bandits cs.LG · 2012 · author #3
Decoupling Exploration and Exploitation in Multi-Armed Bandits cs.LG · 2012 · author #2
Relaxed Half-Stochastic Belief Propagation cs.AR · 2012 · author #3
Go Viral, or Not: Rate-Optimal Control for Resource-Constrained Branching Processes math.OC · 2012 · author #1
Regulation, Volatility and Efficiency in Continuous-Time Markets cs.SY · 2011 · author #2
Bandits with an Edge cs.LG · 2011 · author #3
From Bandits to Experts: On the Value of Side-Observations cs.LG · 2011 · author #1
A Maximal Large Deviation Inequality for Sub-Gaussian Variables cs.LG · 2011 · author #3
Mean-Variance Optimization in Markov Decision Processes cs.LG · 2011 · author #1
The Sample Complexity of Dictionary Learning stat.ML · 2010 · author #2
Robustness and Generalization cs.LG · 2010 · author #2
Adaptive Bases for Reinforcement Learning cs.LG · 2010 · author #2
Learning from Multiple Outlooks cs.LG · 2010 · author #2
Principal Component Analysis with Contaminated Data: The High Dimensional Case stat.ML · 2010 · author #3
Robust Regression and Lasso cs.IT · 2008 · author #3
Robustness and Regularization of Support Vector Machines cs.LG · 2008 · author #3
Strategies for prediction under imperfect monitoring math.ST · 2007 · author #2

Mentions

1404.5421 #2 · backfill · confidence 0.70 Shie Mannor
1404.3862 #3 · backfill · confidence 0.70 Shie Mannor
1402.6361 #4 · backfill · confidence 0.70 Shie Mannor
1402.1263 #4 · backfill · confidence 0.70 Shie Mannor
1311.0466 #2 · backfill · confidence 0.70 Shie Mannor
1310.3697 #2 · backfill · confidence 0.70 Shie Mannor
1309.6545 #3 · backfill · confidence 0.70 Shie Mannor
2605.28506 #5 · arxiv_oai · confidence 0.70 Shie Mannor
1307.4102 #2 · backfill · confidence 0.70 Shie Mannor
1306.6189 #3 · backfill · confidence 0.70 Shie Mannor
1302.6937 #3 · backfill · confidence 0.70 Shie Mannor
1302.6927 #3 · backfill · confidence 0.70 Shie Mannor
1301.2725 #3 · backfill · confidence 0.70 Shie Mannor
1301.0104 #3 · backfill · confidence 0.70 Shie Mannor
1210.4006 #2 · backfill · confidence 0.70 Shie Mannor
1209.6329 #3 · backfill · confidence 0.70 Shie Mannor
1208.2417 #2 · backfill · confidence 0.70 Shie Mannor
1206.4169 #3 · backfill · confidence 0.70 Shie Mannor
1205.2874 #2 · backfill · confidence 0.70 Shie Mannor
1205.2428 #3 · backfill · confidence 0.70 Shie Mannor
1203.1072 #1 · backfill · confidence 0.70 Shie Mannor
1109.3151 #2 · backfill · confidence 0.70 Shie Mannor
1109.2296 #3 · backfill · confidence 0.70 Shie Mannor
2509.22963 #8 · arxiv_oai · confidence 0.70 Shie Mannor
1106.2436 #1 · backfill · confidence 0.70 Shie Mannor
1105.2550 #3 · backfill · confidence 0.70 Shie Mannor
1104.5601 #1 · backfill · confidence 0.70 Shie Mannor
2605.17559 #2 · arxiv_oai · confidence 0.70 Shie Mannor
1011.5395 #2 · backfill · confidence 0.70 Shie Mannor
1005.2243 #2 · backfill · confidence 0.70 Shie Mannor
1005.0125 #2 · backfill · confidence 0.70 Shie Mannor
1005.0027 #2 · backfill · confidence 0.70 Shie Mannor
1002.4658 #3 · backfill · confidence 0.70 Shie Mannor
0811.1790 #3 · backfill · confidence 0.70 Shie Mannor
0803.3490 #3 · backfill · confidence 0.70 Shie Mannor

Frequent Coauthors

Aviv Tamar 13 shared papers
Daniel J. Mankowitz 11 shared papers
Gal Dalal 11 shared papers
Tom Zahavy 10 shared papers
Huan Xu 9 shared papers
Constantine Caramanis 7 shared papers
Mark Kozdoba 7 shared papers
Timothy A. Mann 7 shared papers
Assaf Hallak 6 shared papers
Dotan Di Castro 5 shared papers
Ariel Orda 4 shared papers
Chen Tessler 4 shared papers
Elad Gilboa 4 shared papers
Koby Crammer 4 shared papers
Ohad Shamir 4 shared papers
Orly Avner 4 shared papers
Yonathan Efroni 4 shared papers
Bruno Scherrer 3 shared papers
Claudio Gentile 3 shared papers
Elad Hazan 3 shared papers