Podcast Guide

Results for "SWE-bench"

2 results

Episodes

  • Latent Space: The AI Engineer Podcast
    StandardSummaries only

    ⚡️The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals & Human Data

    Latent Space: The AI Engineer Podcast· Feb 23, 2026

    Olivia Watkins (Frontier Evals team) and Mia Glaese (VP of Research at OpenAI, leading the Codex, human data, and alignment teams) discuss a new blog post (https://openai.com/index/why-we-no-longer-evaluate-swe-bench-ver

    openaievals
  • Latent Space: The AI Engineer Podcast
    StandardSummaries only

    [LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka

    Latent Space: The AI Engineer Podcast· Feb 26, 2026

    Swyx joined SAIL! Thank you SAIL Media, Prof. Tom Yeh, 8Lee, Hamid Bagheri, c9n, and many others for tuning into SAIL Live #6 with Nathan Lambert and Sebastian Raschka, PhD. Sharing here for the LS paid subscribers.We co

    anthropic