Estimating worst case frontier risks of open weight LLMs
Signal
72
Hype
25
In three linesOpenAI studies worst-case frontier risks of releasing open-weight models through malicious fine-tuning (MFT) on gpt-oss. Experiment tests maximum capabilities after adversarial fine-tuning in biology and cybersecurity domains. Risk boundary assessment for open-source LLMs.Read source
Your take?
Summary generated by Claude — human-verified