The newsletter for ai builders of all levels. Mini-tutorials, tool reviews, and lay of the land from an exited founder turned investor and forever tinkerer.
Hey folks,
I’ll be jetting off to Sardinia with my wife Sunday 🍕🍝☀️ (no kids! 🫢) - any recs, hit me up 🙏. On to AI land…
ChatGPT has a developer mode now, and you can use it too. Enable developer mode in Settings → Connectors → Advanced → Developer mode. This allows you to bring any custom MCP server with write actions into ChatGPT. For example, you can use Stripe’s MCP to get your account’s data or even create an invoice. Or use Vercel’s MCP server to take care of deployment.
Let’s talk about two new features: one each from Claude and Gemini, and the different ways both companies marketed their launch.
Claude can now create and edit files from spreadsheets and documents to PDFs and slide decks. It’s essentially a code interpreter, i.e. Claude can run code in the background to do these file creation/editing tasks. But normal people don’t care about the tech; they care about what the tool can do for you.
Now, in complete contrast, Gemini now supports adding audio files to your chats in the app. NO OTHER MAJOR AI APP HAS THIS, and they announced it as “papercut fixed” 🤦♂️ . They are doing well onboarding people with nano-banana hype, but the excitement for these (not so) little features also adds up. Who’s gonna tell them?
Replit released v3 of their Agent. Key upgrades: 1) It can go up to 200 minutes of working autonomously. 2) Agent tests your apps in the browser periodically, like clicking a button, trying to log in, etc. 3) In beta, but Replit Agent can build other agents and automations (powered by Mastra), not just web apps (like build an agent to ping me in Slack 20 minutes before every meeting with research info). Replit also has a design-only mode that builds the frontend and mockups for features (which is much faster) if you’re just prototyping. And it raised $250M at a $3B valuation.
Why is running an agent for 200 minutes a thing to celebrate? Isn’t that slow? TLDR; long agent runtimes mean deeper, more complex tasks get done. It’s about capability and autonomy.
Brightwave's state-of-the-art multi-agent research system is the most powerful synthesis engine on the planet. 10k+ documents, long-running asynchronous background agents, fine-grained context control and a flexible, intuitive UI that thinks like you do. API access available. Try Brightwave today.*
*sponsored
🌐 What I’m consuming
What not to do when monetising a newsletter from our (yes, Ben’s Bites’) firsthand experience.
Inside the Man vs Machine hackathon - 100+ participants, 6 final projects for a $12,500 top prize. Can you guess which ones used AI to build and which ones didn’t? (non-paywalled article)
How Factory builds agents that help across the entire software development life-cycle.
Shawn Wang (aka swyx)’s thesis for joining Cognition (which just raised at a $10B valuation)
20-minute crash course for AI SDK v5.
Defeating nondeterminism in LLM inference - blog by Thinking Machines (ex-OpenAI CTO, Mira Murati’s new company)
⚙️ Tools to tinker with
Oboe - Use AI to become smarter, not stupid. Course with long reads, audio lectures, quizzes and more.
Voice Remixing by ElevenLabs - Change any aspect of a voice (real or generated) like gender, age or accent.
Scheduled Runs in Julius - Schedule any analysis to be run with just a single click, and have the results delivered straight to your Slack or email.
Cofounder - AI agent that runs your business with you, remembers things, and knows everything about your business.
Google AI Edge Gallery - Official app from Google (on Play Store) to run a local model (gemma 3n) on your mobile. (repo)
Design systems in v0 - Define colour schemes and preview light/dark modes for your apps.
Napkin AI - Create diagrams/mindmaps that you can actually use from just prompts.
*sponsored
🥣 Dev dish
Fartscroll - Makes a fart noise as you open/close your macbook 😂
Modal Notebooks - Cloud-hosted GPU notebook with collaborative editing and GPU swaps.
Veo 3 and Veo 3 Fast are now generally available in the Gemini API. The models are roughly 50% cheaper now and support vertical videos and 1080p.
vt - the CLI for Val Town. Deploy software instantly as you develop it.
Chroma package search - Enable your AI agents to search the source code of your package dependencies.
Web fetch tool in Anthropic API - Pre-built tool to get data from any webpage with no extra infra or cost.
I haven’t tried this, but this demo looks really cool.
📊 Charts I saw this week
Simon Willison used ChatGPT to recreate the chart below using US census data.

Parallel (the new AI search company by ex-Twitter CEO Parag Agarwal) shared some Deep Research benchmark results, claiming their search is the best and cheapest.
🍦 Afters
Daniel from BB community is hosting Kieran (from Every) in Toronto to demo his workflow for shipping like a team of 5 but solo with Claude Code.
Oracle’s stock price jumped by ~35% yesterday, making Larry Ellison the richest man in the world. They revealed new revenue majority of which is coming from OpenAI, according to this scoop from WSJ.
That’s it for today. Feel free to comment and share your thoughts. 👋
📷 thumbnail creds: @keshavatearth