The SCEAS System
Navigation Menu

Search the dblp DataBase

Title:
Author:

Shie Mannor: [Publications] [Author Rank by year] [Co-authors] [Prefers] [Cites] [Cited by]

Publications of Author

  1. Constantine Caramanis, Shie Mannor
    An Inequality for Nearly Log-Concave Distributions with Applications to Learning. [Citation Graph (0, 0)][DBLP]
    COLT, 2004, pp:534-548 [Conf]
  2. Eyal Even-Dar, Shie Mannor, Yishay Mansour
    PAC Bounds for Multi-armed Bandit and Markov Decision Processes. [Citation Graph (0, 0)][DBLP]
    COLT, 2002, pp:255-270 [Conf]
  3. Shie Mannor
    Reinforcement Learning for Average Reward Zero-Sum Games. [Citation Graph (0, 0)][DBLP]
    COLT, 2004, pp:49-63 [Conf]
  4. Shie Mannor, Ron Meir
    Geometric Bounds for Generalization in Boosting. [Citation Graph (0, 0)][DBLP]
    COLT/EuroCOLT, 2001, pp:461-472 [Conf]
  5. Shie Mannor, Ron Meir, Tong Zhang
    The Consistency of Greedy Algorithms for Classification. [Citation Graph (0, 0)][DBLP]
    COLT, 2002, pp:319-333 [Conf]
  6. Shie Mannor, Nahum Shimkin
    Adaptive Strategies and Regret Minimization in Arbitrarily Varying Markov Environments. [Citation Graph (0, 0)][DBLP]
    COLT/EuroCOLT, 2001, pp:128-142 [Conf]
  7. Shie Mannor, Nahum Shimkin
    On-Line Learning with Imperfect Monitoring. [Citation Graph (0, 0)][DBLP]
    COLT, 2003, pp:552-566 [Conf]
  8. Shie Mannor, Nahum Shimkin
    Online Learning with Variable Stage Duration. [Citation Graph (0, 0)][DBLP]
    COLT, 2006, pp:408-422 [Conf]
  9. Shie Mannor, John N. Tsitsiklis
    Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem. [Citation Graph (0, 0)][DBLP]
    COLT, 2003, pp:418-432 [Conf]
  10. Shie Mannor, John N. Tsitsiklis
    Online Learning with Constraints. [Citation Graph (0, 0)][DBLP]
    COLT, 2006, pp:529-543 [Conf]
  11. Yaakov Engel, Shie Mannor, Ron Meir
    Sparse Online Greedy Support Vector Regression. [Citation Graph (0, 0)][DBLP]
    ECML, 2002, pp:84-96 [Conf]
  12. Ishai Menache, Shie Mannor, Nahum Shimkin
    Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    ECML, 2002, pp:295-306 [Conf]
  13. Yaakov Engel, Shie Mannor
    Learning Embedded Maps of Markov Processes. [Citation Graph (0, 0)][DBLP]
    ICML, 2001, pp:138-145 [Conf]
  14. Yaakov Engel, Shie Mannor, Ron Meir
    Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning. [Citation Graph (0, 0)][DBLP]
    ICML, 2003, pp:154-161 [Conf]
  15. Yaakov Engel, Shie Mannor, Ron Meir
    Reinforcement learning with Gaussian processes. [Citation Graph (0, 0)][DBLP]
    ICML, 2005, pp:201-208 [Conf]
  16. Eyal Even-Dar, Shie Mannor, Yishay Mansour
    Action Elimination and Stopping Conditions for Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    ICML, 2003, pp:162-169 [Conf]
  17. Philipp W. Keller, Shie Mannor, Doina Precup
    Automatic basis function construction for approximate dynamic programming and reinforcement learning. [Citation Graph (0, 0)][DBLP]
    ICML, 2006, pp:449-456 [Conf]
  18. Shie Mannor, Ishai Menache, Amit Hoze, Uri Klein
    Dynamic abstraction in reinforcement learning via clustering. [Citation Graph (0, 0)][DBLP]
    ICML, 2004, pp:- [Conf]
  19. Shie Mannor, Dori Peleg, Reuven Y. Rubinstein
    The cross entropy method for classification. [Citation Graph (0, 0)][DBLP]
    ICML, 2005, pp:561-568 [Conf]
  20. Shie Mannor, Reuven Y. Rubinstein, Yohai Gat
    The Cross Entropy Method for Fast Policy Search. [Citation Graph (0, 0)][DBLP]
    ICML, 2003, pp:512-519 [Conf]
  21. Shie Mannor, Duncan Simester, Peng Sun, John N. Tsitsiklis
    Bias and variance in value function estimation. [Citation Graph (0, 0)][DBLP]
    ICML, 2004, pp:- [Conf]
  22. Shie Mannor, Ron Meir
    Weak Learners and Improved Rates of Convergence in Boosting. [Citation Graph (0, 0)][DBLP]
    NIPS, 2000, pp:280-286 [Conf]
  23. Shie Mannor, Nahum Shimkin
    The Steering Approach for Multi-Criteria Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    NIPS, 2001, pp:1563-1570 [Conf]
  24. Ion Muslea, Virginia Dignum, Daniel D. Corkill, Catholijn M. Jonker, Frank Dignum, Silvia Coradeschi, Alessandro Saffiotti, Dan Fu, Jeff Orkin, William Cheetham, Kai Goebel, Piero P. Bonissone, Leen-Kiat Soh, Randolph M. Jones, Robert E. Wray III, Matthias Scheutz, Daniela Pucci de Farias, Shie Mannor, Georgios Theocharous, Doina Precup, Bamshad Mobasher, Sarabjot S. Anand, Bettina Berendt, Andreas Hotho, Hans W. Guesgen, Michael T. Rosenstein, Mohammad Ghavamzadeh
    The Workshop Program at the Nineteenth National Conference on Artificial Intelligence. [Citation Graph (0, 0)][DBLP]
    AI Magazine, 2005, v:26, n:1, pp:103-108 [Journal]
  25. Shie Mannor, Ron Meir, Tong Zhang
    Greedy Algorithms for Classification -- Consistency, Convergence Rates, and Adaptivity. [Citation Graph (0, 0)][DBLP]
    Journal of Machine Learning Research, 2003, v:4, n:, pp:713-741 [Journal]
  26. Shie Mannor, Nahum Shimkin
    A Geometric Approach to Multi-Criterion Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    Journal of Machine Learning Research, 2004, v:5, n:, pp:325-360 [Journal]
  27. Shie Mannor, John N. Tsitsiklis
    The Sample Complexity of Exploration in the Multi-Armed Bandit Problem. [Citation Graph (0, 0)][DBLP]
    Journal of Machine Learning Research, 2004, v:5, n:, pp:623-648 [Journal]
  28. Eyal Even-Dar, Shie Mannor, Yishay Mansour
    Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems. [Citation Graph (0, 0)][DBLP]
    Journal of Machine Learning Research, 2006, v:7, n:, pp:1079-1105 [Journal]
  29. Shie Mannor, Ron Meir
    On the Existence of Linear Weak Learners and Applications to Boosting. [Citation Graph (0, 0)][DBLP]
    Machine Learning, 2002, v:48, n:1-3, pp:219-251 [Journal]
  30. Shie Mannor, Nahum Shimkin
    The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes. [Citation Graph (0, 0)][DBLP]
    Math. Oper. Res., 2003, v:28, n:2, pp:327-345 [Journal]
  31. Constantine Caramanis, Shie Mannor
    An Inequality for Nearly Log-Concave Distributions With Applications to Learning. [Citation Graph (0, 0)][DBLP]
    IEEE Transactions on Information Theory, 2007, v:53, n:3, pp:1043-1057 [Journal]
  32. Chih-Han Yu, Shie Mannor, Georgios Theocharous, Avi Pfeffer
    User Model and Utility Based Power Management. [Citation Graph (0, 0)][DBLP]
    AAAI, 2007, pp:1918-1919 [Conf]
  33. Branislav Kveton, Prashant Gandhi, Georgios Theocharous, Shie Mannor, Barbara Rosario, Nilesh Shah
    Adaptive Timeout Policies for Fast Fine-Grained Power Management. [Citation Graph (0, 0)][DBLP]
    AAAI, 2007, pp:1795-1800 [Conf]
  34. Gábor Lugosi, Shie Mannor, Gilles Stoltz
    Strategies for Prediction Under Imperfect Monitoring. [Citation Graph (0, 0)][DBLP]
    COLT, 2007, pp:248-262 [Conf]
  35. Erick Delage, Shie Mannor
    Percentile optimization in uncertain Markov decision processes with application to efficient exploration. [Citation Graph (0, 0)][DBLP]
    ICML, 2007, pp:225-232 [Conf]
  36. Jia Yuan Yu, Shie Mannor
    Asymptotics of Efficiency Loss in Competitive Market Mechanisms. [Citation Graph (0, 0)][DBLP]
    INFOCOM, 2006, pp:- [Conf]
  37. Huan Xu, Shie Mannor
    The Robustness-Performance Tradeoff in Markov Decision Processes. [Citation Graph (0, 0)][DBLP]
    NIPS, 2006, pp:1537-1544 [Conf]
  38. Shie Mannor, Jeff S. Shamma
    Multi-agent learning for engineers. [Citation Graph (0, 0)][DBLP]
    Artif. Intell., 2007, v:171, n:7, pp:417-422 [Journal]
  39. Gábor Lugosi, Shie Mannor, Gilles Stoltz
    Strategies for prediction under imperfect monitoring [Citation Graph (0, 0)][DBLP]
    CoRR, 2007, v:0, n:, pp:- [Journal]
  40. Shie Mannor, Jeff S. Shamma, Gürdal Arslan
    Online calibrated forecasts: Memory efficiency versus universality for learning in games. [Citation Graph (0, 0)][DBLP]
    Machine Learning, 2007, v:67, n:1-2, pp:77-115 [Journal]

  41. Online Learning with Expert Advice and Finite-Horizon Constraints. [Citation Graph (, )][DBLP]


  42. Activity and Gait Recognition with Time-Delay Embeddings. [Citation Graph (, )][DBLP]


  43. Learning in the Limit with Adversarial Disturbances. [Citation Graph (, )][DBLP]


  44. Stochastic Decoding of LDPC Codes over GF(q). [Citation Graph (, )][DBLP]


  45. Resource Allocation with Supply Adjustment in Distributed Computing Systems. [Citation Graph (, )][DBLP]


  46. Reinforcement learning in the presence of rare events. [Citation Graph (, )][DBLP]


  47. Piecewise-stationary bandit problems with side observations. [Citation Graph (, )][DBLP]


  48. Survey of Stochastic Computation on Factor Graphs. [Citation Graph (, )][DBLP]


  49. Reinforcement Learning-Based Load Shared Sequential Routing. [Citation Graph (, )][DBLP]


  50. Regularized Policy Iteration. [Citation Graph (, )][DBLP]


  51. Robust Regression and Lasso. [Citation Graph (, )][DBLP]


  52. Adaptive Bases for Reinforcement Learning. [Citation Graph (, )][DBLP]


  53. Local Two-Stage Myopic Dynamics for Network Formation Games. [Citation Graph (, )][DBLP]


  54. Network Formation: Bilateral Contracting and Myopic Dynamics. [Citation Graph (, )][DBLP]


  55. Non-Cooperative Design of Translucent Networks. [Citation Graph (, )][DBLP]


  56. A Relaxed Half-Stochastic Iterative Decoder for LDPC Codes. [Citation Graph (, )][DBLP]


  57. Regularized Fitted Q-Iteration: Application to Planning. [Citation Graph (, )][DBLP]


  58. Markov Decision Processes with Arbitrary Reward Processes. [Citation Graph (, )][DBLP]


  59. Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case. [Citation Graph (, )][DBLP]


  60. Arbitrarily modulated Markov decision processes. [Citation Graph (, )][DBLP]


  61. Risk sensitive robust support vector machines. [Citation Graph (, )][DBLP]


  62. Parametric regret in uncertain Markov decision processes. [Citation Graph (, )][DBLP]


  63. An Area-Efficient FPGA-Based Architecture for Fully-Parallel Stochastic LDPC Decoding. [Citation Graph (, )][DBLP]


  64. Bidirectional interleavers for LDPC decoders using transmission gates. [Citation Graph (, )][DBLP]


  65. Tracking Forecast Memories in stochastic decoders. [Citation Graph (, )][DBLP]


  66. Efficiency Loss in a Network Resource Allocation Game: The Case of Elastic Supply [Citation Graph (, )][DBLP]


  67. Robustness, Risk, and Regularization in Support Vector Machines [Citation Graph (, )][DBLP]


  68. Robust Regression and Lasso [Citation Graph (, )][DBLP]


  69. Learning from Multiple Outlooks [Citation Graph (, )][DBLP]


  70. Adaptive Bases for Reinforcement Learning [Citation Graph (, )][DBLP]


  71. Robustness and Generalization [Citation Graph (, )][DBLP]


Search in 0.008secs, Finished in 0.421secs
NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades
NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP
 
System created by asidirop@csd.auth.gr [http://users.auth.gr/~asidirop/] © 2002
for Data Engineering Laboratory, Department of Informatics, Aristotle University © 2002