GPT-5.4 Debuts

OpenAI just released GPT-5.4, which adds native computer control to their most capable reasoning model. The system can operate desktops through screenshots and mouse commands, achieving 75% accuracy on OSWorld desktop tasks compared to 72% for humans. This is OpenAI's first general-purpose model with built-in computer use capabilities. Continue Reading →
OpenAI CEO Sam Altman told employees Tuesday they don't get to make "operational decisions" about how the military uses their AI technology: "Maybe you think the Iran strike was good and the Venezuela invasion was bad," Altman said in an all-hands meeting. "You don't get to weigh in on that." Continue Reading →
The U.S. Supreme Court declined to hear Stephen Thaler's appeal over whether AI-generated art can receive copyright protection. Thaler applied for a copyright in 2018 covering a visual work his AI system DABUS created autonomously. The Copyright Office rejected it. A federal judge upheld that decision. The D.C. Circuit affirmed it. Now, the Supreme Court has let the ruling stand. The legal chain is complete: no human author, no copyright. Continue Reading →
Google yesterday launched Nano Banana 2, the consumer brand for Gemini 3.1 Flash Image. The upgrade includes sub-second 4K image synthesis across multiple aspect ratios, character consistency for up to five characters, fidelity for up to 14 objects in a single workflow, and precise text rendering accurate enough for marketing mockups and greeting cards. Continue Reading →
Burger King is rolling out an AI platform called "BK Assistant" with a voice assistant named Patty. Patty takes drive-thru orders, monitors restaurant operations, and notifies managers when equipment needs maintenance or products run low. Every U.S. Burger King will have one by the end of 2026. Sounds reasonable, except… Continue Reading →

How Much Does An LLM Remember?

Stanford and Yale researchers extracted 95.8% of a copyrighted novel, word for word, from Claude 3.7 Sonnet. Gemini 2.5 Pro gave up 76.8% of Harry Potter without even requiring a jailbreak. Grok 3 handed over 70.3%. GPT-4.1 was the most resistant at 4.0%, but it still coughed up text after enough attempts. Thirteen books were tested. The words came out. Continue Reading →

Google’s Free Photo Studio

Google Labs just launched Pomelli Photoshoot, a free tool that turns any product photo into a professional studio or lifestyle shot. Pick a product image, choose a template, generate, and refine. The tool applies your brand's visual identity (what Google calls "Business DNA") to keep everything on-brand across campaigns. The target audience is small and medium-sized businesses that cannot afford professional product photography. I tested it. It works as described. Continue Reading →
Google DeepMind just launched Lyria 3 in the Gemini app. Type a text prompt (or upload a photo) and you will get a 30-second track with auto-generated lyrics, vocals, and custom cover art. The model is available in eight languages to anyone 18+. YouTube creators worldwide can also access it through Dream Track for Shorts soundtracks. Every track carries a SynthID watermark. Continue Reading →
An early preview of WebMCP, a standard co-authored by Google and Microsoft that makes it easy for agents to navigate websites, is now available in Chrome 146. This feature was added so quietly last week that a lot of people missed it. Continue Reading →