The SCEAS System
Navigation Menu

Conferences in DBLP

(ewrl)
2008 (conf/ewrl/2008)


  1. Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees. [Citation Graph (, )][DBLP]


  2. Exploiting Additive Structure in Factored MDPs for Reinforcement Learning. [Citation Graph (, )][DBLP]


  3. Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration. [Citation Graph (, )][DBLP]


  4. Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case. [Citation Graph (, )][DBLP]


  5. Regularized Fitted Q-Iteration: Application to Planning. [Citation Graph (, )][DBLP]


  6. A Near Optimal Policy for Channel Allocation in Cognitive Radio. [Citation Graph (, )][DBLP]


  7. Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets. [Citation Graph (, )][DBLP]


  8. Bayesian Reward Filtering. [Citation Graph (, )][DBLP]


  9. Basis Expansion in Natural Actor Critic Methods. [Citation Graph (, )][DBLP]


  10. Reinforcement Learning with the Use of Costly Features. [Citation Graph (, )][DBLP]


  11. Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem. [Citation Graph (, )][DBLP]


  12. Optimistic Planning of Deterministic Systems. [Citation Graph (, )][DBLP]


  13. Policy Iteration for Learning an Exercise Policy for American Options. [Citation Graph (, )][DBLP]


  14. Tile Coding Based on Hyperplane Tiles. [Citation Graph (, )][DBLP]


  15. Use of Reinforcement Learning in Two Real Applications. [Citation Graph (, )][DBLP]


  16. Applications of Reinforcement Learning to Structured Prediction. [Citation Graph (, )][DBLP]


  17. Policy Learning - A Unified Perspective with Applications in Robotics. [Citation Graph (, )][DBLP]


  18. Probabilistic Inference for Fast Learning in Control. [Citation Graph (, )][DBLP]


  19. United We Stand: Population Based Methods for Solving Unknown POMDPs. [Citation Graph (, )][DBLP]


  20. New Error Bounds for Approximations from Projected Linear Equations. [Citation Graph (, )][DBLP]


  21. Markov Decision Processes with Arbitrary Reward Processes. [Citation Graph (, )][DBLP]

NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades
NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP
 
System created by asidirop@csd.auth.gr [http://users.auth.gr/~asidirop/] © 2002
for Data Engineering Laboratory, Department of Informatics, Aristotle University © 2002