I work on robust and aligned AI at Anthropic.
Previously at @openai and @brain-research
-
Anthropic
- San Francisco
- anthropic.com
- @NotTomBrown
Pinned Loading
-
rl-teacher
rl-teacher PublicCode for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
-
openphilanthropy/unrestricted-adversarial-examples
openphilanthropy/unrestricted-adversarial-examples PublicContest Proposal and infrastructure for the Unrestricted Adversarial Examples Challenge
-
cleverhans-lab/cleverhans
cleverhans-lab/cleverhans PublicAn adversarial example library for constructing attacks, building defenses, and benchmarking both
-
openai/gym
openai/gym PublicA toolkit for developing and comparing reinforcement learning algorithms.
-
openai/baselines
openai/baselines PublicOpenAI Baselines: high-quality implementations of reinforcement learning algorithms
-
openai/universe
openai/universe Public archiveUniverse: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.