Opus 4.8
NVIDIA and Microsoft birthed a new computer
Hey folks,
I’m spending as much time as possible off Twitter, and any other general doomscroll distractions. I’ve got work to do (the course/manual/whatever it’ll be called).
Aiming to get at least the preview ‘lessons’ out this month.
I’m not going to force any wisdom out in today’s intro because it’s not my style.
On to the condensed version of the AI madness…
Ben’s Bites is brought to you by Smallest AI
Pulse is world’s fastest speech-to-text model (#1 on Sierra’s μ-Bench for P95 latency) with top performing accuracy (under 5% WER on Artificial Analysis leaderboard), works across 39 languages and 100+ accents.
Get $25 free credits, use code BYTE25-N3CX3UKV (valid till 6/6)
Headlines
Claude Opus 4.8 is out, with dynamic workflows in Claude Code. Claude now writes an orchestration script, then spins up subagents in parallel to work through complex tasks.
Dex’s take: this doesn’t prove loose multi-agent systems work. Deterministic workflows around small agent loops are more reliable.
Claude Opus 4.8 - Simon Willison calls it a modest but useful upgrade, mostly because it’s more honest about uncertainty and less likely to miss flaws in its own code. Every’s vibe check is more bullish: they found it a big jump from 4.7, strong at coding/writing/knowledge work, and competitive with GPT-5.5 on their internal senior-engineer benchmark. The catch is the harness: the model is back, but Claude’s app still feels messier than Codex.
Scores top on ARC-AGI-3, tripling 5.5’s score
Datacurve’s new benchmark places it below gpt 5.5 and only marginally better than 5.4. Using a lot more tokens…$$$.
Anthropic filed a confidential S-1 and raised a round $65B Series H at $965B post-money. IPO this year?
NVIDIA and Microsoft are reinventing Windows PCs for personal AI agents — RTX Spark is a 1-petaflop Windows superchip with up to 128GB unified memory, full CUDA/RTX support, local 120B model support, and new Windows agent security primitives + NVIDIA OpenShell. Microsoft’s Surface Laptop Ultra is the flagship device.
Stacker's AI Accelerator is offering $500k in inference credits to businesses ready to go AI-first. Selected companies get credits and hands-on mentoring to deploy AI agents across their operations. Applications close June 9th. Apply now.*
My feed
Stacker is an AI coworker that joins the dots in your business. Stacker lives in Slack, connects your tools, and does the work.*
ChatGPT has a new table of contents UI for long chats, and a full-screen long-form writing mode that can save drafts to your Library.
Codex got computer use + mobile remote control on Windows and a Python SDK.
Replit added a Canvas to create variants, annotate, compare design directions and apply changes back into your app.
Figma Make now works on your local code - visually edit the app, annotate, chat and open PRs.
Linear’s new product Diffs adds PR review into Linear, with guided reviews and agent iteration.
SkillSpector - a new security scanner for skills by NVIDIA
Google AI Studio now lets you build apps that connect to Gmail, Drive, Sheets and more without jumping through other Google Cloud screens.
Agent Cookie - syncs cookies, CLI tokens and API keys from your laptop to a Mac mini running OpenClaw/Hermes.
Agent Handler for Employees: secure AI access for every employee.
Impeccable 3.5 - design skill for coding agents with model-specific anti-pattern rules.
Four tips to help agents understand your codebase.
Sandboxes are becoming the OS for agents.
30 mins epsiode on /goal and how to use it in Codex.
Building Grok Imagine in 3 months and Video Agents.
The most rational take on AI you’ll hear this year
Afters
Read about me and Ben’s Bites
📷 thumbnail by @keshavatearth
* sponsors who make this newsletter possible :)
Wanna partner with us for the next quarter?
Email us at shanice@bensbites.com or k@bensbites.com








