Good work! Liked the moving obstacles scenario, specially. @Roser, if I'm not mistaken, the simulation has been (at least partially) inspired by https://github.com/erlerobot/gym-gazebo environments. Is that right @Gilbert? A very cool project would be to extend this and try a few policy gradient methods. You could even go ahead an compare it with the value iteration one you just tried (DQN). I did a while ago [a tutorial](https://github.com/vmayoral/basic_reinforcement_learning/blob/master/tutorial14/README.md) comparing different methods for a simple environment but yours is indeed much cooler. --- [Visit Topic](https://discourse.ros.org/t/tb3-reinforcement-learning-with-tb3/4842/4) or reply to this email to respond. If you do not want to receive messages from ros-users please use the unsubscribe link below. If you use the one above, you will stop all of ros-users from receiving updates. ______________________________________________________________________________ ros-users mailing list ros-users@lists.ros.org http://lists.ros.org/mailman/listinfo/ros-users Unsubscribe: