Couverture de AI Explained Official Podcast

AI Explained Official Podcast

AI Explained Official Podcast

De : Philip - Host of AI Explained YT
Écouter gratuitement

À propos de ce contenu audio

Covering the biggest news of the century - the arrival of smarter-than-human AI. From the author of Simple Bench, which reveals the remaining gap between LLM and human reasoning. Hype-free, and the British accent is a freebie bonus.

© 2026 AI Explained Official Podcast
Développement personnel Politique et gouvernement Réussite personnelle Sciences sociales
Les membres Amazon Prime bénéficient automatiquement de 2 livres audio offerts chez Audible.

Vous êtes membre Amazon Prime ?

Bénéficiez automatiquement de 2 livres audio offerts.
Bonne écoute !
    Épisodes
    • Deadline Day for Autonomous AI Weapons & Mass Surveillance
      Feb 27 2026

      Will Anthropic be forced to make a version of Claude for war? And does a new paper expose the risks of Claude agents, in both OpenClaw and the field of war? Plus, 5 more twists in the story of the Pentagon versus Anthropic + some AI lab employees, and a petition that could change everything, or nothing...


      Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai

      AI Insiders ($9!): https://www.patreon.com/AIExplained

      Chapters:
      00:00 - Introduction
      00:44 - Deadline Day + Petition
      02:42 - Twist 1: Existing Deal
      03:26 - Twist 2: Existing Policy
      04:21 - Twist 3: Twin Threats
      05:54 - Twist 4: Interesting Objections
      11:32 - Twist 5: Anthropic’s Dropped Policy


      Dario Statement: https://www.anthropic.com/news/statement-department-of-war

      Google/OpenAI Petition: https://notdivided.org/

      Axios on Amodei Rejection: https://www.axios.com/2026/02/26/anthropic-rejects-pentagon-ai-terms

      FT on US Threat: https://www.ft.com/content/11d27612-d6c5-4cf7-94dd-f65603549b7f

      Politico on Latest: https://archive.ph/20260227013117/https://www.politico.com/news/2026/02/26/incoherent-hegseths-anthropic-ultimatum-confounds-ai-policymakers-00800135

      The Verge on Current Deal: https://www.theverge.com/ai-artificial-intelligence/883456/anthropic-pentagon-department-of-defense-negotiations

      Anthropic RSP change: https://www.anthropic.com/news/responsible-scaling-policy-v3

      Time Magazine on RSP: https://time.com/7380854/exclusive-anthropic-drops-flagship-safety-pledge/

      Agent of Chaos Paper: https://x.com/NatalieShapira/status/2026062499599319526

      AI Agent Reliability Paper: https://arxiv.org/pdf/2602.16666

      My Patreon Video: https://www.patreon.com/posts/real-mystery-ai-151647211

      Patreon Documentary: https://www.patreon.com/posts/our-new-age-of-133960279



      Non-hype Newsletter: https://signaltonoise.beehiiv.com/

      Podcast: https://aiexplainedopodcast.buzzsprout.com/

      Afficher plus Afficher moins
      14 min
    • Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI
      Feb 20 2026

      Do we have a new best AI model, or do we have the downfall of benchmarks in general, as a way of capturing machine intelligence? Full breakdown of Gemini 3.1 Pro, guest-starring the new Sonnet 4.6, plus analysis from 7 papers/posts that will give you much needed context. Oh, and a new record on Simple Bench!

      https://epoch.ai/ai-explained-datacenters


      Check out my fast-growing (!) app, free to use, and code INSIDER15 for Pro: https://lmcouncil.ai

      AI Insiders ($9!): https://www.patreon.com/AIExplained


      Chapters:
      00:00 - Introduction
      00:30 - Post-training Dominance
      04:00 - ARC-AGI 2 Caveat
      05:54 - Simple Bench Record
      08:22 - Hallucination Caveat
      10:05 - Model Card
      11:12 - Exponential Coming
      12:20 - Amodei on Generalizing
      15:10 - One True Benchmark?
      17:02 - Other Metrics…

      Gemini 3.1 Model Card: https://storage.googleapis.com/deepmind-media/Model-Cards/Gemini-3-1-Pro-Model-Card.pdf

      Release: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/

      Where are Agents deployed?: https://www.anthropic.com/research/measuring-agent-autonomy

      Newsletter Post: https://signaltonoise.beehiiv.com/p/4-ai-numbers-that-surprised-me-this-week

      Hallucination AA: https://artificialanalysis.ai/evaluations/omniscience

      Melanie Mitchell: https://x.com/MelMitchell1/status/2022738363548340526
      ARC-AGI-2: https://x.com/arcprize/status/2024522812728496470/photo/1

      Chollet on Agentic Coding and ML: https://x.com/fchollet/status/2024519439140737442

      METR Caveat: https://metr.org/notes/2026-01-22-time-horizon-limitations/

      Talaas Fast: https://chatjimmy.ai/

      Amodei Interview Continual learning: https://www.dwarkesh.com/p/dario-amodei-2?open=false#%C2%A7002942-is-continual-learning-necessary-how-will-it-be-solved

      Metaculus FutureEval: https://www.metaculus.com/futureeval/

      Next Vid to Watch: https://www.patreon.com/posts/what-you-need-to-150647292



      Non-hype Newsletter: https://signaltonoise.beehiiv.com/

      Podcast: https://aiexplainedopodcast.buzzsprout.com/

      Afficher plus Afficher moins
      19 min
    • The Two Best AI Models/Enemies Just Got Released Simultaneously
      Feb 6 2026

      The two models that you will hear discussed for at least the next two months - Claude Opus 4.6 and GPT 5.3 Codex - just got released within 26 mins or each other. The full breakdown of around 250 pages of reports, with just the most interest moments, from the battle of which is best, Claude personhood, the surprising misbehaviour of Opus 4.6, and much more

      https://assemblyai.com/aiexplained

      Check out my fast-growing (!) app, free to use, and code INSIDER15 for Pro: https://lmcouncil.ai

      AI Insiders ($9): https://www.patreon.com/AIExplained

      Chapters:
      00:00 - Introduction
      00:54 - Self-improvement?
      02:44 - Knowledge Work
      05:30 - Overly agentic behaviour
      09:12 - Who Shouldn’t Use Claude Opus
      11:39 - Step-change?
      15:09 - Claude’s ‘Personhood’

      Hassabis Roadmap: https://www.patreon.com/posts/hassabis-roadmap-149750869

      Release of Opus 4.6: https://www.anthropic.com/news/claude-opus-4-6
      212 Page System Card: https://www-cdn.anthropic.com/0dd865075ad3132672ee0ab40b05a53f14cf5288.pdf
      Claude Code Tip: https://x.com/bcherny/status/2019475897691124107


      GPT Codex 5.3: https://openai.com/index/introducing-gpt-5-3-codex/

      System Card: https://openai.com/index/gpt-5-3-codex-system-card/

      Browse Comp: https://arxiv.org/pdf/2504.12516v1
      Finance Agent: https://www.vals.ai/benchmarks/finance_agent
      Terminal Bench 2: https://arxiv.org/pdf/2601.11868
      Vending Bench: https://andonlabs.com/blog/opus-4-6-vending-bench

      My X post: https://x.com/AIExplainedYT/status/2016851303436095647

      Anthropic Apology: https://x.com/ch402/status/2014066134194995256/photo/1

      Altman rebuttal: https://x.com/sama/status/2019139174339928189
      https://x.com/sama/status/2019140276246442089

      4% of GitHub: https://x.com/dylan522p/status/2019490550911766763



      Non-hype Newsletter: https://signaltonoise.beehiiv.com/

      Podcast: https://aiexplainedopodcast.buzzsprout.com/

      Afficher plus Afficher moins
      20 min
    Aucun commentaire pour le moment