3 min read
OpenAI's Chain-of-Thought Monitorability: Trust, But Verify (Especially the Trust Part)
OpenAI just published research on "chain-of-thought monitorability"—the ability to monitor AI models' internal reasoning to detect misbehavior before...