Search the dblp DataBase
Shie Mannor :
[Publications ]
[Author Rank by year ]
[Co-authors ]
[Prefers ]
[Cites ]
[Cited by ]
Publications of Author
Constantine Caramanis , Shie Mannor An Inequality for Nearly Log-Concave Distributions with Applications to Learning. [Citation Graph (0, 0)][DBLP ] COLT, 2004, pp:534-548 [Conf ] Eyal Even-Dar , Shie Mannor , Yishay Mansour PAC Bounds for Multi-armed Bandit and Markov Decision Processes. [Citation Graph (0, 0)][DBLP ] COLT, 2002, pp:255-270 [Conf ] Shie Mannor Reinforcement Learning for Average Reward Zero-Sum Games. [Citation Graph (0, 0)][DBLP ] COLT, 2004, pp:49-63 [Conf ] Shie Mannor , Ron Meir Geometric Bounds for Generalization in Boosting. [Citation Graph (0, 0)][DBLP ] COLT/EuroCOLT, 2001, pp:461-472 [Conf ] Shie Mannor , Ron Meir , Tong Zhang The Consistency of Greedy Algorithms for Classification. [Citation Graph (0, 0)][DBLP ] COLT, 2002, pp:319-333 [Conf ] Shie Mannor , Nahum Shimkin Adaptive Strategies and Regret Minimization in Arbitrarily Varying Markov Environments. [Citation Graph (0, 0)][DBLP ] COLT/EuroCOLT, 2001, pp:128-142 [Conf ] Shie Mannor , Nahum Shimkin On-Line Learning with Imperfect Monitoring. [Citation Graph (0, 0)][DBLP ] COLT, 2003, pp:552-566 [Conf ] Shie Mannor , Nahum Shimkin Online Learning with Variable Stage Duration. [Citation Graph (0, 0)][DBLP ] COLT, 2006, pp:408-422 [Conf ] Shie Mannor , John N. Tsitsiklis Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem. [Citation Graph (0, 0)][DBLP ] COLT, 2003, pp:418-432 [Conf ] Shie Mannor , John N. Tsitsiklis Online Learning with Constraints. [Citation Graph (0, 0)][DBLP ] COLT, 2006, pp:529-543 [Conf ] Yaakov Engel , Shie Mannor , Ron Meir Sparse Online Greedy Support Vector Regression. [Citation Graph (0, 0)][DBLP ] ECML, 2002, pp:84-96 [Conf ] Ishai Menache , Shie Mannor , Nahum Shimkin Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning. [Citation Graph (0, 0)][DBLP ] ECML, 2002, pp:295-306 [Conf ] Yaakov Engel , Shie Mannor Learning Embedded Maps of Markov Processes. [Citation Graph (0, 0)][DBLP ] ICML, 2001, pp:138-145 [Conf ] Yaakov Engel , Shie Mannor , Ron Meir Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning. [Citation Graph (0, 0)][DBLP ] ICML, 2003, pp:154-161 [Conf ] Yaakov Engel , Shie Mannor , Ron Meir Reinforcement learning with Gaussian processes. [Citation Graph (0, 0)][DBLP ] ICML, 2005, pp:201-208 [Conf ] Eyal Even-Dar , Shie Mannor , Yishay Mansour Action Elimination and Stopping Conditions for Reinforcement Learning. [Citation Graph (0, 0)][DBLP ] ICML, 2003, pp:162-169 [Conf ] Philipp W. Keller , Shie Mannor , Doina Precup Automatic basis function construction for approximate dynamic programming and reinforcement learning. [Citation Graph (0, 0)][DBLP ] ICML, 2006, pp:449-456 [Conf ] Shie Mannor , Ishai Menache , Amit Hoze , Uri Klein Dynamic abstraction in reinforcement learning via clustering. [Citation Graph (0, 0)][DBLP ] ICML, 2004, pp:- [Conf ] Shie Mannor , Dori Peleg , Reuven Y. Rubinstein The cross entropy method for classification. [Citation Graph (0, 0)][DBLP ] ICML, 2005, pp:561-568 [Conf ] Shie Mannor , Reuven Y. Rubinstein , Yohai Gat The Cross Entropy Method for Fast Policy Search. [Citation Graph (0, 0)][DBLP ] ICML, 2003, pp:512-519 [Conf ] Shie Mannor , Duncan Simester , Peng Sun , John N. Tsitsiklis Bias and variance in value function estimation. [Citation Graph (0, 0)][DBLP ] ICML, 2004, pp:- [Conf ] Shie Mannor , Ron Meir Weak Learners and Improved Rates of Convergence in Boosting. [Citation Graph (0, 0)][DBLP ] NIPS, 2000, pp:280-286 [Conf ] Shie Mannor , Nahum Shimkin The Steering Approach for Multi-Criteria Reinforcement Learning. [Citation Graph (0, 0)][DBLP ] NIPS, 2001, pp:1563-1570 [Conf ] Ion Muslea , Virginia Dignum , Daniel D. Corkill , Catholijn M. Jonker , Frank Dignum , Silvia Coradeschi , Alessandro Saffiotti , Dan Fu , Jeff Orkin , William Cheetham , Kai Goebel , Piero P. Bonissone , Leen-Kiat Soh , Randolph M. Jones , Robert E. Wray III , Matthias Scheutz , Daniela Pucci de Farias , Shie Mannor , Georgios Theocharous , Doina Precup , Bamshad Mobasher , Sarabjot S. Anand , Bettina Berendt , Andreas Hotho , Hans W. Guesgen , Michael T. Rosenstein , Mohammad Ghavamzadeh The Workshop Program at the Nineteenth National Conference on Artificial Intelligence. [Citation Graph (0, 0)][DBLP ] AI Magazine, 2005, v:26, n:1, pp:103-108 [Journal ] Shie Mannor , Ron Meir , Tong Zhang Greedy Algorithms for Classification -- Consistency, Convergence Rates, and Adaptivity. [Citation Graph (0, 0)][DBLP ] Journal of Machine Learning Research, 2003, v:4, n:, pp:713-741 [Journal ] Shie Mannor , Nahum Shimkin A Geometric Approach to Multi-Criterion Reinforcement Learning. [Citation Graph (0, 0)][DBLP ] Journal of Machine Learning Research, 2004, v:5, n:, pp:325-360 [Journal ] Shie Mannor , John N. Tsitsiklis The Sample Complexity of Exploration in the Multi-Armed Bandit Problem. [Citation Graph (0, 0)][DBLP ] Journal of Machine Learning Research, 2004, v:5, n:, pp:623-648 [Journal ] Eyal Even-Dar , Shie Mannor , Yishay Mansour Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems. [Citation Graph (0, 0)][DBLP ] Journal of Machine Learning Research, 2006, v:7, n:, pp:1079-1105 [Journal ] Shie Mannor , Ron Meir On the Existence of Linear Weak Learners and Applications to Boosting. [Citation Graph (0, 0)][DBLP ] Machine Learning, 2002, v:48, n:1-3, pp:219-251 [Journal ] Shie Mannor , Nahum Shimkin The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes. [Citation Graph (0, 0)][DBLP ] Math. Oper. Res., 2003, v:28, n:2, pp:327-345 [Journal ] Constantine Caramanis , Shie Mannor An Inequality for Nearly Log-Concave Distributions With Applications to Learning. [Citation Graph (0, 0)][DBLP ] IEEE Transactions on Information Theory, 2007, v:53, n:3, pp:1043-1057 [Journal ] Chih-Han Yu , Shie Mannor , Georgios Theocharous , Avi Pfeffer User Model and Utility Based Power Management. [Citation Graph (0, 0)][DBLP ] AAAI, 2007, pp:1918-1919 [Conf ] Branislav Kveton , Prashant Gandhi , Georgios Theocharous , Shie Mannor , Barbara Rosario , Nilesh Shah Adaptive Timeout Policies for Fast Fine-Grained Power Management. [Citation Graph (0, 0)][DBLP ] AAAI, 2007, pp:1795-1800 [Conf ] Gábor Lugosi , Shie Mannor , Gilles Stoltz Strategies for Prediction Under Imperfect Monitoring. [Citation Graph (0, 0)][DBLP ] COLT, 2007, pp:248-262 [Conf ] Erick Delage , Shie Mannor Percentile optimization in uncertain Markov decision processes with application to efficient exploration. [Citation Graph (0, 0)][DBLP ] ICML, 2007, pp:225-232 [Conf ] Jia Yuan Yu , Shie Mannor Asymptotics of Efficiency Loss in Competitive Market Mechanisms. [Citation Graph (0, 0)][DBLP ] INFOCOM, 2006, pp:- [Conf ] Huan Xu , Shie Mannor The Robustness-Performance Tradeoff in Markov Decision Processes. [Citation Graph (0, 0)][DBLP ] NIPS, 2006, pp:1537-1544 [Conf ] Shie Mannor , Jeff S. Shamma Multi-agent learning for engineers. [Citation Graph (0, 0)][DBLP ] Artif. Intell., 2007, v:171, n:7, pp:417-422 [Journal ] Gábor Lugosi , Shie Mannor , Gilles Stoltz Strategies for prediction under imperfect monitoring [Citation Graph (0, 0)][DBLP ] CoRR, 2007, v:0, n:, pp:- [Journal ] Shie Mannor , Jeff S. Shamma , Gürdal Arslan Online calibrated forecasts: Memory efficiency versus universality for learning in games. [Citation Graph (0, 0)][DBLP ] Machine Learning, 2007, v:67, n:1-2, pp:77-115 [Journal ] Online Learning with Expert Advice and Finite-Horizon Constraints. [Citation Graph (, )][DBLP ] Activity and Gait Recognition with Time-Delay Embeddings. [Citation Graph (, )][DBLP ] Learning in the Limit with Adversarial Disturbances. [Citation Graph (, )][DBLP ] Stochastic Decoding of LDPC Codes over GF(q). [Citation Graph (, )][DBLP ] Resource Allocation with Supply Adjustment in Distributed Computing Systems. [Citation Graph (, )][DBLP ] Reinforcement learning in the presence of rare events. [Citation Graph (, )][DBLP ] Piecewise-stationary bandit problems with side observations. [Citation Graph (, )][DBLP ] Survey of Stochastic Computation on Factor Graphs. [Citation Graph (, )][DBLP ] Reinforcement Learning-Based Load Shared Sequential Routing. [Citation Graph (, )][DBLP ] Regularized Policy Iteration. [Citation Graph (, )][DBLP ] Robust Regression and Lasso. [Citation Graph (, )][DBLP ] Adaptive Bases for Reinforcement Learning. [Citation Graph (, )][DBLP ] Local Two-Stage Myopic Dynamics for Network Formation Games. [Citation Graph (, )][DBLP ] Network Formation: Bilateral Contracting and Myopic Dynamics. [Citation Graph (, )][DBLP ] Non-Cooperative Design of Translucent Networks. [Citation Graph (, )][DBLP ] A Relaxed Half-Stochastic Iterative Decoder for LDPC Codes. [Citation Graph (, )][DBLP ] Regularized Fitted Q-Iteration: Application to Planning. [Citation Graph (, )][DBLP ] Markov Decision Processes with Arbitrary Reward Processes. [Citation Graph (, )][DBLP ] Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case. [Citation Graph (, )][DBLP ] Arbitrarily modulated Markov decision processes. [Citation Graph (, )][DBLP ] Risk sensitive robust support vector machines. [Citation Graph (, )][DBLP ] Parametric regret in uncertain Markov decision processes. [Citation Graph (, )][DBLP ] An Area-Efficient FPGA-Based Architecture for Fully-Parallel Stochastic LDPC Decoding. [Citation Graph (, )][DBLP ] Bidirectional interleavers for LDPC decoders using transmission gates. [Citation Graph (, )][DBLP ] Tracking Forecast Memories in stochastic decoders. [Citation Graph (, )][DBLP ] Efficiency Loss in a Network Resource Allocation Game: The Case of Elastic Supply [Citation Graph (, )][DBLP ] Robustness, Risk, and Regularization in Support Vector Machines [Citation Graph (, )][DBLP ] Robust Regression and Lasso [Citation Graph (, )][DBLP ] Learning from Multiple Outlooks [Citation Graph (, )][DBLP ] Adaptive Bases for Reinforcement Learning [Citation Graph (, )][DBLP ] Robustness and Generalization [Citation Graph (, )][DBLP ] Search in 0.805secs, Finished in 0.810secs