Fireship
May 22, 2026
1. Google's AI Scale & Infrastructure
Google's token serving capacity has grown from 9.7 trillion to 3.2 quadrillion tokens per month. The company is splitting TPU chips into TPU-T (training) and TPU-I (inference) to optimize for different workloads, with massive capex investments driving infrastructure growth.
2. Gemini Omni & The Agentic Era
Google announced Gemini Omni, a multimodal model accepting any input format and producing any output. The company is positioning AI agents across all products (search, Gmail, Android, glasses), marking a shift from hyperlink-based information organization to becoming the interface to reality itself.
3. Neural Expressive Design System
The new Gemini app design system optimizes for on-demand UI generation, creating diagrams, timelines, and mini apps dynamically based on user prompts. The visual refresh includes new icons and gradients alongside functional improvements.
4. Gemini Flash 3.5 & Model Updates
Gemini Flash 3.5 delivers near-flagship performance (comparable to Opus 4.7 and GPT-5.5) with significantly faster inference speeds. However, pricing increased 3-30x relative to previous versions. Gemini 3.5 Pro remains unreleased until summer 2026.
5. Anti-gravity IDE & AI Coding
Anti-gravity (formerly Windserve) demonstrated building a complete OS in 12 hours with AI agents handling design, testing, and deployment. The tool pivoted from code writing to agent management, with a live demo showing real-time driver generation for Doom gameplay.
6. Web Developer Tools: HTML on Canvas API
Chrome introduced the HTML on Canvas API, allowing native HTML elements to be rendered directly in canvas. This enables developers to build interactive UIs with pixel-level control via WebGL/WebGPU while leveraging HTML for basic UI elements.