SCEAS

The SCEAS System

Search the dblp DataBase

Title:
Author:

Andrew G. Barto: [Publications] [Author Rank by year] [Co-authors] [Prefers] [Cites] [Cited by]

Publications of Author

Alicia P. Wolfe, Andrew G. Barto
Decision Tree Methods for Finding Reusable MDP Homomorphisms. [Citation Graph (0, 0)][DBLP]
AAAI, 2006, pp:- [Conf]
Richard C. Yee, Sharad Saxena, Paul E. Utgoff, Andrew G. Barto
Explaining Temporal Differences to Create Useful Concepts for Evaluating States. [Citation Graph (0, 0)][DBLP]
AAAI, 1990, pp:882-888 [Conf]
Anders Jonsson, Andrew G. Barto
A causal approach to hierarchical decomposition of factored MDPs. [Citation Graph (0, 0)][DBLP]
ICML, 2005, pp:401-408 [Conf]
George Konidaris, Andrew G. Barto
Autonomous shaping: knowledge transfer in reinforcement learning. [Citation Graph (0, 0)][DBLP]
ICML, 2006, pp:489-496 [Conf]
Amy McGovern, Andrew G. Barto
Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density. [Citation Graph (0, 0)][DBLP]
ICML, 2001, pp:361-368 [Conf]
Robert Moll, Theodore J. Perkins, Andrew G. Barto
Machine Learning for Subproblem Selection. [Citation Graph (0, 0)][DBLP]
ICML, 2000, pp:615-622 [Conf]
Theodore J. Perkins, Andrew G. Barto
Lyapunov-Constrained Action Sets for Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
ICML, 2001, pp:409-416 [Conf]
Marc Pickett, Andrew G. Barto
PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
ICML, 2002, pp:506-513 [Conf]
Jette Randløv, Andrew G. Barto, Michael T. Rosenstein
Combining Reinforcement Learning with a Local Control Algorithm. [Citation Graph (0, 0)][DBLP]
ICML, 2000, pp:775-782 [Conf]
Balaraman Ravindran, Andrew G. Barto
Relativized Options: Choosing the Right Transformation. [Citation Graph (0, 0)][DBLP]
ICML, 2003, pp:608-615 [Conf]
Özgür Simsek, Alicia P. Wolfe, Andrew G. Barto
Identifying useful subgoals in reinforcement learning by local graph partitioning. [Citation Graph (0, 0)][DBLP]
ICML, 2005, pp:816-823 [Conf]
Özgür Simsek, Andrew G. Barto
Using relative novelty to identify useful temporal abstractions in reinforcement learning. [Citation Graph (0, 0)][DBLP]
ICML, 2004, pp:- [Conf]
Özgür Simsek, Andrew G. Barto
An intrinsic reward mechanism for efficient exploration. [Citation Graph (0, 0)][DBLP]
ICML, 2006, pp:833-840 [Conf]
Vijaykumar Gullapalli, Andrew G. Barto, Roderic A. Grupen
Learning Admittance Mappings for Force-Guided Assembly. [Citation Graph (0, 0)][DBLP]
ICRA, 1994, pp:2633-2638 [Conf]
Theodore J. Perkins, Andrew G. Barto
Heuristic Search in Infinite State Spaces Guided by Lyapunov Analysis. [Citation Graph (0, 0)][DBLP]
IJCAI, 2001, pp:242-247 [Conf]
Balaraman Ravindran, Andrew G. Barto
SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes. [Citation Graph (0, 0)][DBLP]
IJCAI, 2003, pp:1011-1018 [Conf]
Michael T. Rosenstein, Andrew G. Barto
Robot Weightlifting By Direct Policy Search. [Citation Graph (0, 0)][DBLP]
IJCAI, 2001, pp:839-846 [Conf]
Oliver G. Selfridge, Richard S. Sutton, Andrew G. Barto
Training and Tracking in Robotics. [Citation Graph (0, 0)][DBLP]
IJCAI, 1985, pp:670-672 [Conf]
Balaraman Ravindran, Andrew G. Barto, Vimal Mathew
Deictic Option Schemas. [Citation Graph (0, 0)][DBLP]
IJCAI, 2007, pp:1023-1028 [Conf]
George Konidaris, Andrew G. Barto
Building Portable Options: Skill Transfer in Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
IJCAI, 2007, pp:895-900 [Conf]
Kimberly Ferguson, Ivon Arroyo, Sridhar Mahadevan, Beverly Park Woolf, Andrew G. Barto
Improving Intelligent Tutoring Systems: Using Expectation Maximization to Learn Student Skill Levels. [Citation Graph (0, 0)][DBLP]
Intelligent Tutoring Systems, 2006, pp:453-462 [Conf]
N. E. Berthier, Satinder P. Singh, Andrew G. Barto, James C. Houk
A Cortico-Cerebellar Model that Learns to Generate Distributed Motor Commands to Control a Kinematic Arm. [Citation Graph (0, 0)][DBLP]
NIPS, 1991, pp:611-618 [Conf]
Andrew G. Barto, Michael O. Duff
Monte Carlo Matrix Inversion and Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
NIPS, 1993, pp:687-694 [Conf]
Andrew G. Barto, James C. Houk
A Predictive Switching Model of Cerebellar Movement Control. [Citation Graph (0, 0)][DBLP]
NIPS, 1995, pp:138-144 [Conf]
Andrew G. Barto, Richard S. Sutton, Christopher J. C. H. Watkins
Sequential Decision Probelms and Neural Networks. [Citation Graph (0, 0)][DBLP]
NIPS, 1989, pp:686-693 [Conf]
Robert H. Crites, Andrew G. Barto
An Actor/Critic Algorithm that is Equivalent to Q-Learning. [Citation Graph (0, 0)][DBLP]
NIPS, 1994, pp:401-408 [Conf]
Robert H. Crites, Andrew G. Barto
Improving Elevator Performance Using Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
NIPS, 1995, pp:1017-1023 [Conf]
Michael O. Duff, Andrew G. Barto
Local Bandit Approximation for Optimal Learning Problems. [Citation Graph (0, 0)][DBLP]
NIPS, 1996, pp:1019-1025 [Conf]
Vijaykumar Gullapalli, Andrew G. Barto
Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms. [Citation Graph (0, 0)][DBLP]
NIPS, 1993, pp:695-702 [Conf]
Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstein
Reinforcement Learning for Mixed Open-loop and Closed-loop Control. [Citation Graph (0, 0)][DBLP]
NIPS, 1996, pp:1026-1032 [Conf]
Anders Jonsson, Andrew G. Barto
Automated State Abstraction for Options using the U-Tree Algorithm. [Citation Graph (0, 0)][DBLP]
NIPS, 2000, pp:1054-1060 [Conf]
Michael Kositsky, Andrew G. Barto
The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay. [Citation Graph (0, 0)][DBLP]
NIPS, 2001, pp:43-50 [Conf]
Robert Moll, Andrew G. Barto, Theodore J. Perkins, Richard S. Sutton
Learning Instance-Independent Value Functions to Enhance Local Search. [Citation Graph (0, 0)][DBLP]
NIPS, 1998, pp:1017-1023 [Conf]
Jeffrey F. Monaco, David G. Ward, Andrew G. Barto
Automated Aircraft Recovery via Reinforcement Learning: Initial Experiments. [Citation Graph (0, 0)][DBLP]
NIPS, 1997, pp:- [Conf]
Ron Papka, James P. Callan, Andrew G. Barto
Text-Based Information Retrieval Using Exponentiated Gradient Descent. [Citation Graph (0, 0)][DBLP]
NIPS, 1996, pp:3-9 [Conf]
Satinder P. Singh, Andrew G. Barto, Nuttapong Chentanez
Intrinsically Motivated Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
NIPS, 2004, pp:- [Conf]
Satinder P. Singh, Andrew G. Barto, Roderic A. Grupen, Christopher I. Connolly
Robust Reinforcement Learning in Motion Planning. [Citation Graph (0, 0)][DBLP]
NIPS, 1993, pp:655-662 [Conf]
George Konidaris, Andrew G. Barto
An Adaptive Robot Motivational System. [Citation Graph (0, 0)][DBLP]
SAB, 2006, pp:346-356 [Conf]
Balaraman Ravindran, Andrew G. Barto
Model Minimization in Hierarchical Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
SARA, 2002, pp:196-211 [Conf]
Özgür Simsek, Andrew G. Barto
Learning Skills in Reinforcement Learning Using Relative Novelty. [Citation Graph (0, 0)][DBLP]
SARA, 2005, pp:367-374 [Conf]
Robert A. Jacobs, Michael I. Jordan, Andrew G. Barto
Task Decompostiion Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks. [Citation Graph (0, 0)][DBLP]
Machine Learning: From Theory to Applications, 1993, pp:175-202 [Conf]
Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh
Learning to Act Using Real-Time Dynamic Programming. [Citation Graph (0, 0)][DBLP]
Artif. Intell., 1995, v:72, n:1-2, pp:81-138 [Journal]
Robert A. Jacobs, Michael I. Jordan, Andrew G. Barto
Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks. [Citation Graph (0, 0)][DBLP]
Cognitive Science, 1991, v:15, n:2, pp:219-250 [Journal]
Michael Kositsky, Andrew G. Barto
The emergence of movement units through learning with noisy efferent signals and delayed sensory feedback. [Citation Graph (0, 0)][DBLP]
Neurocomputing, 2002, v:44, n:, pp:889-895 [Journal]
Andrew G. Barto
A Note on Pattern Reproduction in Tessellation Structures. [Citation Graph (0, 0)][DBLP]
J. Comput. Syst. Sci., 1978, v:16, n:3, pp:445-455 [Journal]
Theodore J. Perkins, Andrew G. Barto
Lyapunov Design for Safe Reinforcement Learning. [Citation Graph (0, 0)][DBLP]
Journal of Machine Learning Research, 2002, v:3, n:, pp:803-832 [Journal]
Anders Jonsson, Andrew G. Barto
Causal Graph Based Decomposition of Factored MDPs. [Citation Graph (0, 0)][DBLP]
Journal of Machine Learning Research, 2006, v:7, n:, pp:2259-2301 [Journal]
Steven J. Bradtke, Andrew G. Barto
Linear Least-Squares Algorithms for Temporal Difference Learning. [Citation Graph (0, 0)][DBLP]
Machine Learning, 1996, v:22, n:1-3, pp:33-57 [Journal]
Robert H. Crites, Andrew G. Barto
Elevator Group Control Using Multiple Reinforcement Learning Agents. [Citation Graph (0, 0)][DBLP]
Machine Learning, 1998, v:33, n:2-3, pp:235-262 [Journal]
Amy McGovern, J. Eliot B. Moss, Andrew G. Barto
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts. [Citation Graph (0, 0)][DBLP]
Machine Learning, 2002, v:49, n:2-3, pp:141-160 [Journal]
Andrew G. Barto, Andrew H. Fagg, Nathan Sitkoff, James C. Houk
A Cerebellar Model of Timing and Prediction in the Control of Reaching. [Citation Graph (0, 0)][DBLP]
Neural Computation, 1999, v:11, n:3, pp:565-594 [Journal]
Michael T. Rosenstein, Andrew G. Barto, Richard E. A. Van Emmerik
Learning at the level of synergies for a robot weightlifter. [Citation Graph (0, 0)][DBLP]
Robotics and Autonomous Systems, 2006, v:54, n:8, pp:706-717 [Journal]
Anders Jonsson, Andrew G. Barto
Active Learning of Dynamic Bayesian Networks in Markov Decision Processes. [Citation Graph (0, 0)][DBLP]
SARA, 2007, pp:273-284 [Conf]
Repairing Disengagement With Non-Invasive Interventions. [Citation Graph (, )][DBLP]
Efficient Skill Learning using Abstraction Selection. [Citation Graph (, )][DBLP]
Skill Characterization Based on Betweenness. [Citation Graph (, )][DBLP]
Adaptive Control of Duty Cycling in Energy-Harvesting Wireless Sensor Networks. [Citation Graph (, )][DBLP]
Linear systems analysis of the relationship between firing of deep cerebellar neurons and the classically conditioned nictitating membrane response in rabbits. [Citation Graph (, )][DBLP]

Search in 0.112secs, Finished in 0.113secs

NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades

NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP