Resources for testing dangerous autonomous capabilities in frontier models| METR’s Autonomy Evaluation Resources
If you are not redirected automatically, please click the link above.| metr.github.io