Bruno Scherrer Parallel asynchronous distributed computations of optimal control in large state space Markov Decision processes. [Citation Graph (0, 0)][DBLP] ESANN, 2003, pp:325-330 [Conf]
Bruno Scherrer Asynchronous neurocomputing for optimal control and reinforcement learning with large state spaces. [Citation Graph (0, 0)][DBLP] Neurocomputing, 2005, v:63, n:, pp:229-251 [Journal]
Convergence and rate of convergence of a simple ant model. [Citation Graph (, )][DBLP]
Should one compute the Temporal Difference fix point or minimize the Bellman Residual? The unified oblique projection view. [Citation Graph (, )][DBLP]
Least-Squares Policy Iteration: Bias-Variance Trade-off in Control Problems. [Citation Graph (, )][DBLP]
Biasing Approximate Dynamic Programming with a Lower Discount Factor. [Citation Graph (, )][DBLP]
Convergence and rate of convergence of a foraging ant model. [Citation Graph (, )][DBLP]
Embedded Harmonic Control for Trajectory Planning in Large Environments. [Citation Graph (, )][DBLP]
Performance Bounds for Lambda Policy Iteration [Citation Graph (, )][DBLP]
Search in 0.001secs, Finished in 0.002secs
NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades
NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP