The newsletter for the technically curious. Updates, tool reviews, and lay of the land from an exited founder turned investor and forever tinkerer. Hey folks, Gemini 3 Pro is out beating GPT-5.1, Sonnet 4.5 on all benchmarks but one (SWE-Bench Verified). It’s only slightly smarter than other models, but it is much better at vision. It scores 72.7% on a benchmark for screenshot understanding; the second best is 36.2%. Faster and slightly expensive than 2.5 Pro. My vibe check: it’s a good model, it’s eager to commit changes (!!), isn’t as good as following instructions as Codex (maybe some new prompting quirks needed?) and definitely good at frontend. This launch comes with a new IDE from Google: Antigravity (courtesy of acquiring the Windsurf founders). It also has an Agent Manager and a browser for the agent to see/test what it has built. I tried it for a few hours: Tab complete feels very slow, and the agent is over-eager to implement a plan (while in planning mode). The Agent Manager treats extra documents like plans, task lists as separate “artifacts” and doesn’t clutter your codebase. I liked that. It took me a while to get the browser integration working, but once it was set up, it was really nice and fast too vs Atlas. In other stuff, they released Gemini Agent (for ultra subs only), introduced Dynamic UI in chat, teased Gemini 3 Deep Think and rolled it out in Search (via AI mode) on day 1. They are not done (potentially nano banana 2 today), but neither are OpenAI and xAI. OpenAI released two models: GPT-5.1 Pro and GPT-5.1-Codex-Max to follow up on Gemini 3, and xAI has released Grok 4.1 Fast. These three models all look like they’d do better than Gemini 3 Pro, but only on specific tasks (hard academic problems, code generation and tool calls). An underrated release from Meta: SAM 3. SAM (Segment Anything Model) family of models can take an image/video and create an overlay of any individual or group of objects in it. Meta is partnering with Roboflow to let people fine-tune SAM on their use cases, and it’ll use SAM in Instagram’s video editing app called “Edits”. One API for All Your Voice AI Workflows. Stop wasting time juggling voice AI vendors. AssemblyAI combines multilingual Speech-to-Text, speaker diarization, speech understanding & LLMs in one developer-friendly API. Trusted by Granola, Dovetail & Ashby. Free to try, pay-as-you-go. Start building voice AI today.* I’ve been talking with TELUS (one of Canada’s largest telecom companies) this year. They built a platform that allows 70k+ employees pick from over 30 LLMs to build copilots. I chatted with them and wrote about their story here. 🌐 What I’m consuming
⚙️ Tools and demos
🍦 Afters
That’s it for today. Feel free to comment and share your thoughts. 👋
You're currently a free subscriber to Ben's Bites. For the full experience, upgrade your subscription. |


0 Comments
VHAVENDA IT SOLUTIONS AND SERVICES WOULD LIKE TO HEAR FROM YOU🫵🏼🫵🏼🫵🏼🫵🏼