This has already been a crazy week in the world of AI. The Wall Street Journal reported that OpenAI missed its monthly revenue targets multiple times this year. The company also missed its internal goal of one billion weekly active ChatGPT users by year-end 2025, and OpenAI CFO Sarah Friar reportedly told colleagues she’s worried Continue Reading →
Anthropic released Claude Opus 4.7 on Wednesday with impressive numbers: 10.9 percentage points higher on SWE-bench Pro (the gold-standard coding test), 3x more production tasks resolved on Rakuten’s benchmark, 98.5% on visual acuity up from 54.5%, and state-of-the-art scores on finance evaluations. For devs, this is a genuine step forward. For consumers, the story is a bit different. Continue Reading →

Managed Agents Are Here

Anthropic launched Claude Managed Agents in public beta at eight cents per agent runtime hour plus model usage fees. Developers get sandboxed containers, authentication, checkpointing, error recovery, session persistence, and end-to-end execution tracing: every piece of infrastructure that separates a demo from a production deployment, available as a set of composable APIs. Continue Reading →

Claude Code Auto Mode

Anthropic just shipped "auto mode" for Claude Code. According to the company, users approve 93% of permission prompts without reading them. The guardrails had become a rubber stamp, and everyone knew it. Continue Reading →
OpenAI CEO Sam Altman told employees Tuesday they don't get to make "operational decisions" about how the military uses their AI technology: "Maybe you think the Iran strike was good and the Venezuela invasion was bad," Altman said in an all-hands meeting. "You don't get to weigh in on that." Continue Reading →