A company discussed on Latent Space.

Inside xAI: Building Grok Imagine in 3 Months, Videogen vs World Models, and Video Agents— Ethan He
Jun 2, 2026 · 1:44:43
Ethan He details building xAI's Grok Imagine from zero to one in three months, arguing most visual intelligence gains now come from language models, not diffusion. He explains how small bugs in data pipelines drive quality, why video agents—not just raw model improvements—will unlock production-grade generation by year's end, and how world models must be real-time, interactive, and long-horizon to become the front end of AI.

⚡️ Google's Open AI Strategy — Omar Sanseviero, Google DeepMind
May 25, 2026 · 29:59
Omar Sanseviero, head of Developer Experience at Google DeepMind, breaks down Gemma 4's novel architecture with per-layer embeddings that enable parameter offloading, allowing a 2B active parameter model to run fast on devices. He explains trade-offs between dense and MoE models, notes fine-tuning is declining as base models improve, and highlights Gemma 4's native multimodal support for audio, images, and short video. The team is growing in Singapore and India, and Kaggle's recent integration will help benchmark agent capabilities.