Wild times Posted on January 5, 2026January 5, 2026 From “Measuring AI Ability to Complete Long Tasks” via Model Evaluation & Threat Research (METR) Like Loading...