GPT

Product

A product discussed on Latent Space.

4 episodes

⚡️Making DeepSeek v4 outperform Opus 4.7 with Taste — @Ahmad Awais , CommandCode.ai
Jun 6, 2026 · 40:41
Ahmad Awais explains how his open-source CLI CommandCode uses a 'validate-then-repair' layer to fix tool-calling errors in open models like DeepSeek, allowing them to outperform premium models like Opus 4.7 in 6 of 10 evaluations. He argues that perceived weaknesses are harness/contract issues, not capability gaps, and extends the same repair logic to combat 'design slop' by encoding designer frameworks. Awais also shares plans to open-source CommandCode while keeping it focused on the best models.
Scaling Past Informal AI - Carina Hong, Axiom Math
Jun 4, 2026 · 1:33:04
Carina Hong, CEO of Axiom Math, argues that the path to superintelligence runs through formal verification, not informal RL, and that Lean-based systems can compound brilliance rather than just patch hallucinations. She explains how Axiom’s seven-month-old company achieved a perfect Putnam score and a $200M Series A by using verified generation to give better training signal, and lays out a vision where verification becomes the default infrastructure for all AI-generated code and reasoning.
Satya Nadella on AI: @NoPriorsPodcast x Latent Space Crossover Special at Microsoft Build 2026
Jun 3, 2026 · 41:27
Satya Nadella joins Swyx, Sarah Guo, and Elad Gil at Microsoft Build 2026 to argue that AI is an ecosystem platform where any company can build frontier intelligence using models, tools, data, and a harness—not just consume one model. He details Microsoft's MAI training strategy emphasizing clean data lineage, private evals as core IP, and multi-model harnesses with strong context layers. Nadella discusses real-world value from coding agents driving new IDE needs, long-running enterprise autopilots, and Work IQ turning M365 data into a usable database. He also covers evolving pricing models, SaaS unbundling, changing engineering roles, and the need for tangible societal benefits in healthcare and education.
Devin’s 80% Moment: Background Agents, 7x PRs, & End of Hand-Held Coding — Walden Yan & Cole Murray
Jun 1, 2026 · 1:09:33
Walden Yan (Cognition CPO) and Cole Murray (OpenInspect creator) join Swyx to unpack the rise of background agents: why a December 2025 model inflection made spec-to-PR workflows viable, how Devin's brain-outside-machine architecture handles security and scaling, and the unsolved challenges of repo setup, memory, and multi-agent orchestration. They argue uncontrolled vibe coding regresses codebases to the worst engineer, explain Devin's 7x merged PR growth to 80% of commits, and stress that local testing infra is the key to agent adoption.