Profits matter, even for AI labs
Free Codex and Claude Code credits
The newsletter for the technically curious. Updates, tool reviews, and lay of the land from an exited founder turned investor and forever tinkerer.
Hey folks,
The Information reports that Anthropic aims to be profitable by 2027 (vs OpenAI’s goal of 2030). It is also aiming at $70B revenue in 2028, not too far from OpenAI’s $100B projection. Claude Code alone is already at a $1B annualised run rate. Saw a chart the other day for enterprise API use and more companies now use Anthropic’s APIs vs OpenAI’s.
Apple is reportedly planning to use a custom Gemini model for the new new final v6 7 Siri, with plans to make it live in March next year. Bloomberg released more details on this deal - Apple will pay Google $1B/yr for this, and the model size would be 1.2 trillion parameters. I immediately thought of some chatter about Gemini 2.5 Flash also being a similar size, with too many experts.
You can now interrupt ChatGPT’s thinking (eg, in Deep Research or with ChatGPT Pro) and add new context without restarting. Though many people on X shared that they don’t really use Deep Research now because GPT-5 thinking gives good enough responses much faster.
Users of both Codex web and Claude Code web can get free credits from $200 to $1000, valid for the next two weeks.
Cognition has a new feature in Windsurf - Codemaps. These are annotated structured maps of your code that let you understand what’s going on where. Codemaps come with a code view listing snippets of what code in which file relates to your query, and a diagram view if you want a visual overview.
I’ve been using a version of the prompt “show this codebase in the filetree format with functions, methods and files as nodes” to understand AI written code and it helps a lot when fixing bugs.
- Keshav
The CEO of Warp ripped out Salesforce internally and moved entirely to Attio. His logic? “We need something powerful, easy to use, that makes us want to log in every day as opposed to feeling like a chore.” No surprise, his sales team was down for the switch. Are you?*
🌐 What I’m consuming
A little disclaimer: this section now often contains technical topics that I don’t fully understand yet. I look through them to get a sense of what’s recent (example: code execution for MCP) or how to go beyond basic “slap an LLM to a problem” approaches as I’m trying to be more technical.
Loading available MCP tools with code execution (vs declaring them all at once).
Thoughts by a non-economist on AI and economics.
Semantic search improves satisfactory code generation rates in Cursor.
The case against LLMs as rerankers.
Making a website to observe trick-or-treaters and identify their costumes with Gemma 3 on edge.
Concepts matter more than raw code when building with AI.
Sam Altman on trust, persuasion, and the future of intelligence.
Hyper-engineering - Pushing agents to their full potential.
⚙️ Tools and demos
Real conversations have noise, accents, crosstalk. Speechmatics voice API handles it all for real-time voice agents. Build with $200 free ⚡*
Tembo - Unified interface for background coding agents.
Mesa - A multi-agent system to understand your codebase for senior-level code reviews. (demo)
Runable - A general agent for slides, websites, podcasts, videos—everything.
Orgo - Virtual computers for AI agents. Let them create files, browse the web, and install or use any desktop app.
Kosmos - AI scientist by Edison Scientific, a new company building and commercialising AI agents for science.
🥣 Dev dish
Chroma Web Sync - Automatically crawl, scrape and ingest web pages directly into Chroma Cloud.
Planning what goes in your coding model’s context.
Structured Outputs in Gemini API now has expanded JSON Schema support with union types, recursive schemas and more. It also maintains property ordering in its outputs.
Gen 0 by Generalist AI - A 10B+ foundational model that works across different robots.
Anthropic will store the weights for all its models until the company exists. It’ll also interview every model before taking it offline (i.e. deprecating it).
📊 Charts you should see
LM Arena’s latest leaderboard tries to cover how humans actually use frontier AI for real work. Read more about their research on building the dataset for this.
OpenAI released a new benchmark to capture the understanding of cultural nuance in models, starting with India.
🍦 Afters
Learn how real AI applications are impacting education, marketing & startups at TechEquity’s Ai summit on 7-8 Nov. Ben’s Bites readers get 20% off with code BENSBITES20.*
Stream by Sandbar (an AI wearable as a ring) is now open for preorders.
Wabi is a new “vibe-coding” tool. It has extra focus on sharing/remixing the generated mini apps. The team recently raised $20M.
That’s it for today. Feel free to comment and share your thoughts. 👋
Read about me and Ben’s Bites
📷 thumbnail creds: @keshavatearth,
Thanks to today’s sponsors who made this newsletter possible :)
Attio, Speechmatics, and Ai Summit.
Wanna partner with us? Last few slots left for the rest of the year.




