OpenAI researchers want to predict how often AI models will fail before launch
Problem — This paper addresses the inadequacies in existing safety testing methodologies for AI models, particularly the inability to predict failure rates after deployment. The authors highlight that traditional evaluation...