OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
reinforcement-learning
deep-learning
algorithms
openai-gym
policy
policy-gradient
machine-learning-engineering
trpo
proximal-policy-optimization
ppo
self-play
dota2-bot
openai-five
-
Updated
Jun 20, 2018 - Python