The SCEAS System
Navigation Menu

Search the dblp DataBase

Title:
Author:

Mohammad Ghavamzadeh: [Publications] [Author Rank by year] [Co-authors] [Prefers] [Cites] [Cited by]

Publications of Author

  1. Rajbala Makar, Sridhar Mahadevan, Mohammad Ghavamzadeh
    Hierarchical multi-agent reinforcement learning. [Citation Graph (0, 0)][DBLP]
    Agents, 2001, pp:246-253 [Conf]
  2. Mohammad Ghavamzadeh, Sridhar Mahadevan
    A multiagent reinforcement learning algorithm by dynamically merging markov decision processes. [Citation Graph (0, 0)][DBLP]
    AAMAS, 2002, pp:845-846 [Conf]
  3. Mohammad Ghavamzadeh, Sridhar Mahadevan
    Learning to Communicate and Act Using Hierarchical Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    AAMAS, 2004, pp:1114-1121 [Conf]
  4. Mohammad Ghavamzadeh, Sridhar Mahadevan
    Continuous-Time Hierarchical Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    ICML, 2001, pp:186-193 [Conf]
  5. Mohammad Ghavamzadeh, Sridhar Mahadevan
    Hierarchically Optimal Average Reward Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
    ICML, 2002, pp:195-202 [Conf]
  6. Mohammad Ghavamzadeh, Sridhar Mahadevan
    Hierarchical Policy Gradient Algorithms. [Citation Graph (0, 0)][DBLP]
    ICML, 2003, pp:226-233 [Conf]
  7. Mohammad Ghavamzadeh, Sridhar Mahadevan, Rajbala Makar
    Hierarchical multi-agent reinforcement learning. [Citation Graph (0, 0)][DBLP]
    Autonomous Agents and Multi-Agent Systems, 2006, v:13, n:2, pp:197-229 [Journal]
  8. Ion Muslea, Virginia Dignum, Daniel D. Corkill, Catholijn M. Jonker, Frank Dignum, Silvia Coradeschi, Alessandro Saffiotti, Dan Fu, Jeff Orkin, William Cheetham, Kai Goebel, Piero P. Bonissone, Leen-Kiat Soh, Randolph M. Jones, Robert E. Wray III, Matthias Scheutz, Daniela Pucci de Farias, Shie Mannor, Georgios Theocharous, Doina Precup, Bamshad Mobasher, Sarabjot S. Anand, Bettina Berendt, Andreas Hotho, Hans W. Guesgen, Michael T. Rosenstein, Mohammad Ghavamzadeh
    The Workshop Program at the Nineteenth National Conference on Artificial Intelligence. [Citation Graph (0, 0)][DBLP]
    AI Magazine, 2005, v:26, n:1, pp:103-108 [Journal]
  9. Mohammad Ghavamzadeh, Yaakov Engel
    Bayesian actor-critic algorithms. [Citation Graph (0, 0)][DBLP]
    ICML, 2007, pp:297-304 [Conf]
  10. Mohammad Ghavamzadeh, Yaakov Engel
    Bayesian Policy Gradient Algorithms. [Citation Graph (0, 0)][DBLP]
    NIPS, 2006, pp:457-464 [Conf]

  11. Bayesian Multi-Task Reinforcement Learning. [Citation Graph (, )][DBLP]


  12. Analysis of a Classification-based Policy Iteration Algorithm. [Citation Graph (, )][DBLP]


  13. Finite-Sample Analysis of LSTD. [Citation Graph (, )][DBLP]


  14. Incremental Natural Actor-Critic Algorithms. [Citation Graph (, )][DBLP]


  15. Regularized Policy Iteration. [Citation Graph (, )][DBLP]


  16. Regularized Fitted Q-Iteration: Application to Planning. [Citation Graph (, )][DBLP]


Search in 0.002secs, Finished in 0.002secs
NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades
NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP
 
System created by asidirop@csd.auth.gr [http://users.auth.gr/~asidirop/] © 2002
for Data Engineering Laboratory, Department of Informatics, Aristotle University © 2002