Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.
reinforcement-learning robotics motion deep-reinforcement-learning openai-gym pytorch reinforcement-learning-algorithms trpo robotics-simulation pybullet reinforcement-learning-analysis gym-environment ppo reinforcement-learning-agent gym-environments reinforcement-learning-environments robot-walking pybullet-environments pybullet-physics
-
Updated
Mar 9, 2021 - Python