Safety Gym
OpenAI releases Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints during training.
3 articles
OpenAI releases Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints during training.
OpenAI releases a benchmark for evaluating safe exploration in deep reinforcement learning. The study measures agents' ability to explore efficiently while respecting safety constraints, a key criterion for real-world applications.
OpenAI releases the 1.5B parameter version of GPT-2 with code and model weights, completing its staged release plan. Goal: test a responsible publication process and provide detection tools for GPT-2 outputs.