Cheap intelligence, expensive AI

3 years in, ChatGPT is playing catchup to Google

Dec 18, 2025

The newsletter for the technically curious. Updates, tool reviews, and lay of the land from an exited founder turned investor and forever tinkerer.

Hey folks,

Gemini 3 Flash is out. It’s a good enough model to now shift any use-case that was running fine on the previous generation of “expensive models” like 2.5 Pro, and get a boost in performance while reducing your costs. For example, it scores 78% on SWE Bench Verified, higher than Sonnet 4.5 and even 3 Pro.

Its pricing, though, again highlights the trend for 2025 - “Intelligence is getting cheaper, but AI is becoming more expensive than ever.” Gemini 3 Flash and other recent models (3 Pro, GPT 5.2) have had a significant pricing jump from the baseline established earlier this year. Combine that with an increasing number of $200/mo plans, “the best AI you can get” keeps costing you more, even if the cost of intelligence keeps going down.

ChatGPT has a standalone UI for images now, powered by its new image generation models GPT-Image-1.5. It beats Google’s Nano Banana Pro on leaderboards by just a margin. OpenAI is also, once again, attempting to make an App Store for ChatGPT - they have now opened submissions for review and a path to monetise them.

Meta has been focusing on its SAM (Segment Anything Model) series of models recently. The latest addition to it is Sam Audio, which isolates and edits sound from complex audio mixtures. Also, SAM3 now powers Instagram’s Edit app - making it easier to blur an object, tag an outfit, outline, and more.

Claude Code now suggest follow-up prompts automatically (press Tab to accept), highlights syntax in diffs, and has a first-party plugins marketplace. Pro tip: if you installed Claude Code a long time ago, you should switch to their native install (vs the old NPM install). Run “claude install” in your terminal to update.

Register now for MongoDB’s Agentic Hackathon on Jan 10. Build your idea in 1 day, pitch it to top industry leaders, and compete for $30k in cash prizes. Finalists receive free access to MongoDB.local SF tech conf on Jan 15. Not competing? Use code MDBBuilder for 50% off .local tickets.*

🌐 What I’m consuming

Frontier of the Year 2025 - New milestones reached in 2025 (beyond just AI) and their potential impact.
Cursor CEO interviews John Schulman, the co-founder of Thinking Machines and previously OpenAI.
Prompt caching - 10x cheaper LLM tokens, but how? This is a very good and beginner-friendly read, even if you don’t know anything about LLMs
Prototypes are the new PRDs.
What actually is Claude Code’s plan mode?
The jagged AI frontier is a data frontier.
Writing the prompts for Granola’s Crunched 2025.
This week, Adobe added three of its most popular apps, Photoshop, Adobe Express and Acrobat, into ChatGPT. So now you can edit photos, create designs and edit PDFs directly in your ChatGPT conversations. This handy tutorial shows you how to get started for free.*

⚙️ Tools and demos

These new DEEP WORK AI Agents by a Swedish startup called “Incredible“ are going viral. Don’t miss this one! Only 300 spots left. Join here.*
People Search by Exa AI - Search over 1 billion people profiles semantically for sales, recruiting, market research and more.
Resemble AI - Deepfake detection with the answer to why some content is flagged, not just that it is.
Letta Code - A memory-first coding agent. Long-lived agents that persist across sessions and improve with use. (read more)
ngrok.ai - Route, secure, and manage traffic to any LLM—cloud or local—with one unified platform.
Review.fast - Speed up human review of AI-generated code 3X.
Quorum - Get AI to write text that doesn’t feel like slop.

🥣 Dev Dish

Mistral has a new API only model called Mistral Small Creative with $0.1 and $0.3 input/output costs for 1M tokens. (available on OpenRouter too)
port-killer - Native macOS menu bar app for finding and killing processes on open ports.
prettylog.net - Parser & formatter with search and syntax highlighting for your raw logs, debug output, etc.
google/mcp - The GitHub repo for all MCP servers at Google.
mini-SGLang - Distilled version of SGLang from 300K into 5,000 lines for anyone to understand how inference really works.

🍦 Afters

Replit Learn - Free beginner-friendly lessons to build real apps.
FrontierScience - A new benchmark from OpenAI to measure PhD-level scientific reasoning across physics, chemistry, and biology. GPT-5.2 tops the chart in both structured and open-ended problems.
YouTube is testing Playables Builders with creators to build mini games with AI and share them with their audience.

Enjoy this newsletter? Forward it to a friend.

That’s it for today. Feel free to comment and share your thoughts. 👋

Find me on X, Linkedin, or Instagram
Read about me and Ben’s Bites
📷 thumbnail creds: @keshavatearth

Thanks to today’s sponsors who made this newsletter possible :)
Wanna partner with us for Q1?

Discussion about this post

Ready for more?