Stay up to date with our latest posts.
What changed in the API, what failed during agent testing, and how to design tool-using systems that stay inside the rails.
Feb 11, 2026
Anthropic's randomized trial with 52 developers. But some interaction patterns beat hand-coding.
Feb 9, 2026
GPT-5.3-Codex debugged its own training. It also triggered OpenAI's first "High" cybersecurity rating.
Feb 7, 2026
Anthropic's new flagship outperforms GPT-5.2 by 144 Elo on knowledge work. Here's the technical breakdown.
Feb 6, 2026
285 billion in value erased. Thomson Reuters hit its largest single-day drop ever. Here is the full technical breakdown.
Feb 5, 2026