Good work! Liked the moving obstacles scenario, specially.

@Roser, if I'm not mistaken, the simulation has been (at least partially) inspired by https://github.com/erlerobot/gym-gazebo environments. Is that right @Gilbert?   

A very cool project would be to extend this and try a few policy gradient methods. You could even go ahead an compare it with the value iteration one you just tried (DQN). I did a while ago [a tutorial](https://github.com/vmayoral/basic_reinforcement_learning/blob/master/tutorial14/README.md) comparing different methods for a simple environment but yours is indeed much cooler.


---
[Visit Topic](https://discourse.ros.org/t/tb3-reinforcement-learning-with-tb3/4842/4) or reply to this email to respond.


If you do not want to receive messages from ros-users please use the unsubscribe link below. If you use the one above, you will stop all of ros-users from receiving updates.
______________________________________________________________________________
ros-users mailing list
ros-users@lists.ros.org
http://lists.ros.org/mailman/listinfo/ros-users
Unsubscribe: <http://lists.ros.org/mailman//options/ros-users>