August 2019

2 articles

Testing robustness against unforeseen adversaries

OpenAI introduces a method to assess neural network classifier robustness against adversarial attacks unseen during training. The UAR (Unforeseen Attack Robustness) metric measures a single model's ability to withstand unanticipated attacks and emphasizes the need for performance evaluation across diverse unforeseen attack scenarios.

OpenAI AI safety Evals

SIG

HYP

OpenAI Blog·Aug 20

GPT-2: 6-month follow-up

OpenAI releases the full GPT-2 model (774M parameters) following staged releases since February (124M, 355M). Includes open-source legal agreement for model-sharing partnerships and technical report on publication norms coordination with the AI research community.

GPT OpenAI Open source

SIG

HYP