Both OpenAI and Google DeepMind announced gold-medal performances at the 2025 International Math Olympiad (IMO) this week, correctly answering five out of six questions in one of the world's most challenging high school math competitions. The achievement marks a significant leap from Google's silver-medal performance last year and represents the first time AI systems competed using "informal" methods (processing questions in natural language) rather than requiring human translation into machine-readable code. Continue Reading →
Meta scored a decisive victory in federal court this week. A lawsuit filed by thirteen authors – including Sarah Silverman, Richard Kadrey, and Christopher Golden – accusing Meta of copyright infringement was largely dismissed by U.S. District Judge Vince Chhabria. Continue Reading →
Yesterday at Anthropic’s first “Code with Claude” conference in San Francisco, the company introduced Claude Opus 4 and its companion, Claude Sonnet 4. The headline is clear: Opus 4 can pursue a complex coding task for about seven consecutive hours without losing context. That leap takes us from last year’s five-minute attention span to what feels like a full work shift at silicon speed. Continue Reading →
Claude
Most heavy LLM users will tell you that ChatGPT is the GOAT, but they prefer Claude for writing. Why wasn't Claude the GOAT? For people who use off-the-shelf chat interfaces, convenience always wins. ChatGPT had web access; Claude didn't. New day, new tech. Anthropic has finally given Claude what it's been missing: the ability to search the web. This may seem like a minor feature update, but it's not—it's huge. Continue Reading →
Anthropic announced Claude 3.7 Sonnet, its latest AI model they say is designed for practical use in business and development. The company describes it as a hybrid system, blending fast responses with detailed reasoning, adjustable for tasks like quick answers or complex problem-solving. Anthropic claims this makes it versatile, avoiding the need for separate models. They say it is particularly strong in coding, with a high success rate on real-world software tasks. Continue Reading →
Anthropic, the AI research company behind the Claude family of LLMs, has launched a public test of its new Constitutional Classifier, a system designed to block jailbreaks that circumvent content restrictions. The test follows an extensive internal bug bounty program, where 183 security researchers spent more than 3,000 hours attempting to bypass the system—with limited success. Continue Reading →