pith. sign in

Shie Mannor

Identifiers

  • name variant Shie Mannor 0.60 · backfill

Papers (113)

  1. Toward Micro-Endoscopy: Distal-Free, Configuration-Agnostic Focusing Through Multimode Fiber physics.optics · 2026 · author #5
  2. Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels stat.ME · 2026 · author #2
  3. The Value of Mechanistic Priors in Sequential Decision Making cs.LG · 2026 · author #3
  4. Simulating clinical interventions with a generative multimodal model of human physiology cs.AI · 2026 · author #7
  5. Optimal Sample Complexity for Single Time-Scale Actor-Critic with Momentum cs.LG · 2026 · author #7
  6. Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces cs.LG · 2025 · author #8
  7. Representative Action Selection for Large Action Space Bandit Families cs.LG · 2025 · author #3
  8. Simulus: Combining Improvements in Sample-Efficient World Model Agents cs.LG · 2025 · author #5
  9. Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization cs.LG · 2024 · author #6
  10. A Bayesian Approach to Robust Reinforcement Learning cs.LG · 2019 · author #4
  11. The Natural Language of Actions cs.AI · 2019 · author #2
  12. Action Robust Reinforcement Learning and Applications in Continuous Control cs.LG · 2019 · author #3
  13. Trust Region Value Optimization using Kalman Filtering cs.LG · 2019 · author #2
  14. Multi Instance Learning For Unbalanced Data cs.LG · 2018 · author #6
  15. Inspiration Learning through Preferences cs.LG · 2018 · author #2
  16. Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning cs.LG · 2018 · author #5
  17. How to Combine Tree-Search Methods in Reinforcement Learning cs.LG · 2018 · author #4
  18. Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach cs.LG · 2018 · author #2
  19. Reward Constrained Policy Optimization cs.LG · 2018 · author #3
  20. Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning cs.LG · 2018 · author #4
  21. Nonlinear Distributional Gradient Temporal-Difference Learning cs.LG · 2018 · author #2
  22. Interpreting Electrical-Resistivity Tomography measurements using Neural Network physics.geo-ph · 2018 · author #4
  23. Interdependent Gibbs Samplers stat.ML · 2018 · author #2
  24. Deep Learning Reconstruction of Ultra-Short Pulses physics.optics · 2018 · author #4
  25. Soft-Robust Actor-Critic Policy-Gradient cs.LG · 2018 · author #4
  26. Train on Validation: Squeezing the Data Lemon stat.ML · 2018 · author #3
  27. Beyond the One Step Greedy Approach in Reinforcement Learning cs.AI · 2018 · author #4
  28. Learning Robust Options cs.AI · 2018 · author #5
  29. Chance-Constrained Outage Scheduling using a Machine Learning Proxy cs.CE · 2018 · author #3
  30. The Stochastic Firefighter Problem cs.SY · 2017 · author #3
  31. Situationally Aware Options cs.AI · 2017 · author #3
  32. Multi-objective Bandits: Optimizing the Generalized Gini Index cs.LG · 2017 · author #4
  33. Shallow Updates for Deep Reinforcement Learning cs.AI · 2017 · author #5
  34. Finite Sample Analyses for TD(0) with Function Approximation cs.AI · 2017 · author #4
  35. Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning cs.AI · 2017 · author #4
  36. Deep Robust Kalman Filter cs.AI · 2017 · author #2
  37. Online Learning with Many Experts cs.LG · 2017 · author #2
  38. Rotting Bandits stat.ML · 2017 · author #3
  39. Consistent On-Line Off-Policy Evaluation stat.ML · 2017 · author #2
  40. Outlier Robust Online Learning cs.LG · 2017 · author #3
  41. Adaptive Lambda Least-Squares Temporal Difference Learning cs.LG · 2016 · author #3
  42. Supervised Learning for Optimal Power Flow as a Real-Time Proxy cs.LG · 2016 · author #3
  43. Model-based Adversarial Imitation Learning stat.ML · 2016 · author #3
  44. Unit Commitment using Nearest Neighbor as a Short-Term Proxy cs.LG · 2016 · author #3
  45. Is a picture worth a thousand words? A Deep Multi-Modal Fusion Architecture for Product Classification in e-commerce cs.CV · 2016 · author #4
  46. Situational Awareness by Risk-Conscious Skills cs.AI · 2016 · author #3
  47. A nonparametric sequential test for online randomized experiments stat.ML · 2016 · author #2
  48. Bayesian Reinforcement Learning: A Survey cs.AI · 2016 · author #2
  49. How to Allocate Resources For Features Acquisition? cs.AI · 2016 · author #2
  50. Visualizing Dynamics: from t-SNE to SEMI-MDPs stat.ML · 2016 · author #3
  51. Deep Reinforcement Learning Discovers Internal Models cs.AI · 2016 · author #3
  52. Bending the Curve: Improving the ROC Curve Through Error Redistribution cs.LG · 2016 · author #2
  53. A Reinforcement Learning System to Encourage Physical Activity in Diabetes Patients cs.CY · 2016 · author #4
  54. Clustering Time Series and the Surprising Robustness of HMMs cs.IT · 2016 · author #2
  55. Strategic Formation of Heterogeneous Networks cs.GT · 2016 · author #2
  56. A Deep Hierarchical Approach to Lifelong Learning in Minecraft cs.AI · 2016 · author #5
  57. Hierarchical Decision Making In Electricity Grid Management cs.AI · 2016 · author #3
  58. Adaptive Skills, Adaptive Partitions (ASAP) cs.LG · 2016 · author #3
  59. Iterative Hierarchical Optimization for Misspecified Problems (IHOMP) cs.LG · 2016 · author #3
  60. Graying the black box: Understanding DQNs cs.LG · 2016 · author #3
  61. Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms cs.LG · 2016 · author #6
  62. Distributed Scenario-Based Optimization for Asset Management in a Hierarchical Decision Making Environment cs.SY · 2016 · author #3
  63. Learn on Source, Refine on Target:A Model Transfer Learning Framework with Random Forests cs.LG · 2015 · author #3
  64. Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis stat.ML · 2015 · author #4
  65. Emphatic TD Bellman Operator is a Contraction stat.ML · 2015 · author #3
  66. Reinforcement Learning for the Unit Commitment Problem cs.AI · 2015 · author #2
  67. Bootstrapping Skills cs.AI · 2015 · author #3
  68. Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach cs.AI · 2015 · author #3
  69. Multi-user lax communications: a multi-armed bandit approach cs.LG · 2015 · author #2
  70. Overlapping Community Detection by Online Cluster Aggregation cs.LG · 2015 · author #2
  71. Overlapping Communities Detection via Measure Space Embedding cs.LG · 2015 · author #2
  72. Actively Learning to Attract Followers on Twitter stat.ML · 2015 · author #3
  73. Policy Gradient for Coherent Risk Measures cs.AI · 2015 · author #4
  74. Off-policy evaluation for MDPs with unknown structure stat.ML · 2015 · author #4
  75. Contextual Markov Decision Processes stat.ML · 2015 · author #3
  76. Formation Games of Reliable Networks cs.GT · 2014 · author #2
  77. Implicit Temporal Differences stat.ML · 2014 · author #3
  78. Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback cs.LG · 2014 · author #4
  79. Distributed Robust Learning stat.ML · 2014 · author #3
  80. Thompson Sampling for Learning Parameterized Markov Decision Processes stat.ML · 2014 · author #2
  81. Concurrent bandits and cognitive radio networks cs.LG · 2014 · author #2
  82. Optimizing the CVaR via Sampling stat.ML · 2014 · author #3
  83. Oracle-Based Robust Optimization via Online Learning math.OC · 2014 · author #4
  84. Localized epidemic detection in networks with overwhelming noise cs.SI · 2014 · author #4
  85. Thompson Sampling for Complex Bandit Problems stat.ML · 2013 · author #2
  86. Variance Adjusted Actor Critic Algorithms stat.ML · 2013 · author #2
  87. Distinguishing Infections on Different Graph Topologies cs.SI · 2013 · author #3
  88. Network Formation Games with Heterogeneous Players and the Internet Structure cs.GT · 2013 · author #2
  89. Scaling Up Robust MDPs by Reinforcement Learning cs.LG · 2013 · author #3
  90. Online Convex Optimization Against Adversaries with Memory and Application to Statistical Arbitrage cs.LG · 2013 · author #3
  91. Online Learning for Time Series Prediction cs.LG · 2013 · author #3
  92. Robust High Dimensional Sparse Regression and Matching Pursuit stat.ML · 2013 · author #3
  93. Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes cs.LG · 2013 · author #3
  94. The Perturbed Variation cs.LG · 2012 · author #2
  95. More Is Better: Large Scale Partially-supervised Sentiment Classification - Appendix cs.LG · 2012 · author #3
  96. How to sample if you must: on optimal functional sampling stat.ML · 2012 · author #2
  97. Clustered Bandits cs.LG · 2012 · author #3
  98. Decoupling Exploration and Exploitation in Multi-Armed Bandits cs.LG · 2012 · author #2
  99. Relaxed Half-Stochastic Belief Propagation cs.AR · 2012 · author #3
  100. Go Viral, or Not: Rate-Optimal Control for Resource-Constrained Branching Processes math.OC · 2012 · author #1
  101. Regulation, Volatility and Efficiency in Continuous-Time Markets cs.SY · 2011 · author #2
  102. Bandits with an Edge cs.LG · 2011 · author #3
  103. From Bandits to Experts: On the Value of Side-Observations cs.LG · 2011 · author #1
  104. A Maximal Large Deviation Inequality for Sub-Gaussian Variables cs.LG · 2011 · author #3
  105. Mean-Variance Optimization in Markov Decision Processes cs.LG · 2011 · author #1
  106. The Sample Complexity of Dictionary Learning stat.ML · 2010 · author #2
  107. Robustness and Generalization cs.LG · 2010 · author #2
  108. Adaptive Bases for Reinforcement Learning cs.LG · 2010 · author #2
  109. Learning from Multiple Outlooks cs.LG · 2010 · author #2
  110. Principal Component Analysis with Contaminated Data: The High Dimensional Case stat.ML · 2010 · author #3
  111. Robust Regression and Lasso cs.IT · 2008 · author #3
  112. Robustness and Regularization of Support Vector Machines cs.LG · 2008 · author #3
  113. Strategies for prediction under imperfect monitoring math.ST · 2007 · author #2

Mentions

  • 1404.5421 #2 · backfill · confidence 0.70 Shie Mannor
  • 1404.3862 #3 · backfill · confidence 0.70 Shie Mannor
  • 1402.6361 #4 · backfill · confidence 0.70 Shie Mannor
  • 1402.1263 #4 · backfill · confidence 0.70 Shie Mannor
  • 1311.0466 #2 · backfill · confidence 0.70 Shie Mannor
  • 1310.3697 #2 · backfill · confidence 0.70 Shie Mannor
  • 1309.6545 #3 · backfill · confidence 0.70 Shie Mannor
  • 2605.28506 #5 · arxiv_oai · confidence 0.70 Shie Mannor
  • 1307.4102 #2 · backfill · confidence 0.70 Shie Mannor
  • 1306.6189 #3 · backfill · confidence 0.70 Shie Mannor
  • 1302.6937 #3 · backfill · confidence 0.70 Shie Mannor
  • 1302.6927 #3 · backfill · confidence 0.70 Shie Mannor
  • 1301.2725 #3 · backfill · confidence 0.70 Shie Mannor
  • 1301.0104 #3 · backfill · confidence 0.70 Shie Mannor
  • 1210.4006 #2 · backfill · confidence 0.70 Shie Mannor
  • 1209.6329 #3 · backfill · confidence 0.70 Shie Mannor
  • 1208.2417 #2 · backfill · confidence 0.70 Shie Mannor
  • 1206.4169 #3 · backfill · confidence 0.70 Shie Mannor
  • 1205.2874 #2 · backfill · confidence 0.70 Shie Mannor
  • 1205.2428 #3 · backfill · confidence 0.70 Shie Mannor
  • 1203.1072 #1 · backfill · confidence 0.70 Shie Mannor
  • 1109.3151 #2 · backfill · confidence 0.70 Shie Mannor
  • 1109.2296 #3 · backfill · confidence 0.70 Shie Mannor
  • 2509.22963 #8 · arxiv_oai · confidence 0.70 Shie Mannor
  • 1106.2436 #1 · backfill · confidence 0.70 Shie Mannor
  • 1105.2550 #3 · backfill · confidence 0.70 Shie Mannor
  • 1104.5601 #1 · backfill · confidence 0.70 Shie Mannor
  • 2605.17559 #2 · arxiv_oai · confidence 0.70 Shie Mannor
  • 1011.5395 #2 · backfill · confidence 0.70 Shie Mannor
  • 1005.2243 #2 · backfill · confidence 0.70 Shie Mannor
  • 1005.0125 #2 · backfill · confidence 0.70 Shie Mannor
  • 1005.0027 #2 · backfill · confidence 0.70 Shie Mannor
  • 1002.4658 #3 · backfill · confidence 0.70 Shie Mannor
  • 0811.1790 #3 · backfill · confidence 0.70 Shie Mannor
  • 0803.3490 #3 · backfill · confidence 0.70 Shie Mannor

Frequent Coauthors