[ros-users] [Discourse.ros.org] [general] A toolkit for Reinforcement Learning using ROS and Gazebo

Ross Story ros.discourse at gmail.com
Thu Dec 8 02:29:16 UTC 2016




Sure, here are some references and reading material.

A benchmark that sadly doesn't include DQN, but does include TRPO and DDPG:
https://arxiv.org/pdf/1604.06778v3.pdf

DDPG:
https://arxiv.org/pdf/1509.02971v5.pdf

Asynchronous RL learning showing improved performance of asynchronous actor critic over asynchronous Q learning.
https://arxiv.org/pdf/1602.01783v2.pdf

@spk921 Apologies, it's NAF not FAN. It was designed for robotic manipulation and outperforms DDPG. Paper is here:
https://arxiv.org/pdf/1610.00633v1.pdf






---
[Visit Topic](https://discourse.ros.org/t/a-toolkit-for-reinforcement-learning-using-ros-and-gazebo/442/14) or reply to this email to respond.




More information about the ros-users mailing list