Harukazu Igarashi Path Planning of a Mobile Robot as a Discrete Optimization Problem an Adjustment of Weight Parameters in the Objective Function by Reinforcement Learning. [Citation Graph (0, 0)][DBLP] RoboCup, 2000, pp:315-320 [Conf]
Harukazu Igarashi, Kiyoshi Ioi Navigation of a mobile robot formulated in terms of discrete optimization problems. [Citation Graph (0, 0)][DBLP] Systems and Computers in Japan, 2003, v:34, n:6, pp:59-68 [Journal]
Seiji Ishihara, Harukazu Igarashi Applying the policy gradient method to behavior learning in multiagent systems: The pursuit problem. [Citation Graph (0, 0)][DBLP] Systems and Computers in Japan, 2006, v:37, n:10, pp:101-109 [Journal]
Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks. [Citation Graph (, )][DBLP]
Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State Values in Policies. [Citation Graph (, )][DBLP]
Search in 0.002secs, Finished in 0.002secs
NOTICE1
System may not be available sometimes or not working properly, since it is still in development with continuous upgrades
NOTICE2
The rankings that are presented on this page should NOT be considered as formal since the citation info is incomplete in DBLP