We evaluate whether GPT-5 poses significant catastrophic risks via AI self-improvement, rogue replication, or sabotage of AI labs. We conclude that this seems unlikely. However, capability trends continue rapidly, and models display increasing eval awareness.| METR’s Autonomy Evaluation Resources
Resources for testing dangerous autonomous capabilities in frontier models| METR’s Autonomy Evaluation Resources
Resources for testing dangerous autonomous capabilities in frontier models| METR’s Autonomy Evaluation Resources
Resources for testing dangerous autonomous capabilities in frontier models| METR’s Autonomy Evaluation Resources
Resources for testing dangerous autonomous capabilities in frontier models| METR’s Autonomy Evaluation Resources