The SCEAS System
Navigation Menu

Search the dblp DataBase

Title:
Author:

Rémi Munos: [Publications] [Author Rank by year] [Co-authors] [Prefers] [Cites] [Cited by]

Publications of Author

  1. Rémi Munos
    Error Bounds for Approximate Value Iteration. [Citation Graph (0, 0)][DBLP]
    AAAI, 2005, pp:1006-1011 [Conf]
  2. Rémi Munos
    Geometric Variance Reduction in Markov Chains. Application to Value Function and Gradient Estimation. [Citation Graph (0, 0)][DBLP]
    AAAI, 2005, pp:1012-1017 [Conf]
  3. Rémi Munos
    Policy gradient in continuous time. [Citation Graph (0, 0)][DBLP]
    CAP, 2005, pp:201-216 [Conf]
  4. András Antos, Csaba Szepesvári, Rémi Munos
    Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path. [Citation Graph (0, 0)][DBLP]
    COLT, 2006, pp:574-588 [Conf]
  5. Rémi Munos
    Finite-Element Methods with Local Triangulation Refinement for Continuous Reimforcement Learning Problems. [Citation Graph (0, 0)][DBLP]
    ECML, 1997, pp:170-182 [Conf]
  6. Rémi Munos
    A General Convergence Method for Reinforcement Learning in the Continuous Case. [Citation Graph (0, 0)][DBLP]
    ECML, 1998, pp:394-405 [Conf]
  7. Rémi Munos
    Error Bounds for Approximate Policy Iteration. [Citation Graph (0, 0)][DBLP]
    ICML, 2003, pp:560-567 [Conf]
  8. Rémi Munos
    A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    ICML, 1996, pp:337-345 [Conf]
  9. Rémi Munos, Andrew W. Moore
    Rates of Convergence for Variable Resolution Schemes in Optimal Control. [Citation Graph (0, 0)][DBLP]
    ICML, 2000, pp:647-654 [Conf]
  10. Csaba Szepesvári, Rémi Munos
    Finite time bounds for sampling based fitted value iteration. [Citation Graph (0, 0)][DBLP]
    ICML, 2005, pp:880-887 [Conf]
  11. Rémi Munos
    A Convergent Reinforcement Learning Algorithm in the Continuous Case Based on a Finite Difference Method. [Citation Graph (0, 0)][DBLP]
    IJCAI (2), 1997, pp:826-831 [Conf]
  12. Rémi Munos, Andrew W. Moore
    Variable Resolution Discretization for High-Accuracy Solutions of Optimal Control Problems. [Citation Graph (0, 0)][DBLP]
    IJCAI, 1999, pp:1348-1355 [Conf]
  13. Rémi Munos
    Efficient Resources Allocation for Markov Decision Processes. [Citation Graph (0, 0)][DBLP]
    NIPS, 2001, pp:1571-1578 [Conf]
  14. Rémi Munos, Paul Bourgine
    Reinforcement Learning for Continuous Stochastic Control Problems. [Citation Graph (0, 0)][DBLP]
    NIPS, 1997, pp:- [Conf]
  15. Rémi Munos, Andrew W. Moore
    Barycentric Interpolators for Continuous Space and Time Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    NIPS, 1998, pp:1024-1030 [Conf]
  16. Rémi Munos
    Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation. [Citation Graph (0, 0)][DBLP]
    Journal of Machine Learning Research, 2006, v:7, n:, pp:413-427 [Journal]
  17. Rémi Munos
    Policy Gradient in Continuous Time. [Citation Graph (0, 0)][DBLP]
    Journal of Machine Learning Research, 2006, v:7, n:, pp:771-791 [Journal]
  18. Rémi Munos
    A Study of Reinforcement Learning in the Continuous Case by the Means of Viscosity Solutions. [Citation Graph (0, 0)][DBLP]
    Machine Learning, 2000, v:40, n:3, pp:265-299 [Journal]
  19. Rémi Munos, Andrew W. Moore
    Variable Resolution Discretization in Optimal Control. [Citation Graph (0, 0)][DBLP]
    Machine Learning, 2002, v:49, n:2-3, pp:291-323 [Journal]
  20. Jean-Yves Audibert, Rémi Munos, Csaba Szepesvári
    Tuning Bandit Algorithms in Stochastic Environments. [Citation Graph (0, 0)][DBLP]
    ALT, 2007, pp:150-165 [Conf]
  21. Pierre-Arnaud Coquelin, Rémi Munos
    Bandit Algorithms for Tree Search [Citation Graph (0, 0)][DBLP]
    CoRR, 2007, v:0, n:, pp:- [Journal]

  22. Pure Exploration in Multi-armed Bandits Problems. [Citation Graph (, )][DBLP]


  23. Adaptive play in Texas Hold'em Poker. [Citation Graph (, )][DBLP]


  24. Workshop summary: On-line learning with limited feedback. [Citation Graph (, )][DBLP]


  25. Analysis of a Classification-based Policy Iteration Algorithm. [Citation Graph (, )][DBLP]


  26. Finite-Sample Analysis of LSTD. [Citation Graph (, )][DBLP]


  27. Fitted Q-iteration in continuous action-space MDPs. [Citation Graph (, )][DBLP]


  28. Algorithms for Infinitely Many-Armed Bandits. [Citation Graph (, )][DBLP]


  29. Particle Filter-based Policy Gradient in POMDPs. [Citation Graph (, )][DBLP]


  30. Online Optimization in X-Armed Bandits. [Citation Graph (, )][DBLP]


  31. Online Learning in Adversarial Lipschitz Environments. [Citation Graph (, )][DBLP]


  32. Optimistic Planning of Deterministic Systems. [Citation Graph (, )][DBLP]


  33. Pure Exploration for Multi-Armed Bandit Problems [Citation Graph (, )][DBLP]


  34. X-Armed Bandits [Citation Graph (, )][DBLP]


Search in 0.030secs, Finished in 0.032secs
NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades
NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP
 
System created by asidirop@csd.auth.gr [http://users.auth.gr/~asidirop/] © 2002
for Data Engineering Laboratory, Department of Informatics, Aristotle University © 2002