Couverture de Two Minds, Lower Trust

Two Minds, Lower Trust

Two Minds, Lower Trust

Écouter gratuitement

Voir les détails

À propos de ce contenu audio

Why orchestrate multiple AI agents when a single strong model is so capable? Jon walks through three distinct rationales — capability, parallel context, and trust — and uses Anthropic's Claude Mythos Preview and Project Glasswing as the live, industrial-scale case study.

Credits

Cover Art by Brianna Williams

TMOM Intro Music by Danny Meza

A special thank you to these talented artists for their contributions to the show.

Links and Reference

  • Stanford 2026 AI Index Report: https://hai.stanford.edu/ai-index/2026-ai-index-report

  • Claude Opus 4.7 announcement: https://www.anthropic.com/news/claude-opus-4-7

  • Project Glasswing announcement: https://www.anthropic.com/glasswing

  • Claude Mythos Preview — Frontier Red Team write-up: https://red.anthropic.com/2026/mythos-preview/

  • Claude Mythos Preview — Alignment Risk Update: https://anthropic.com/claude-mythos-preview-risk-report

  • Andon Labs Vending-Bench (the eval Jon describes): https://andonlabs.com/evals/vending-bench

  • Mixture-of-Agents (Wang et al., June 2024): https://arxiv.org/abs/2406.04692

  • Self-MoA / "Rethinking Mixture-of-Agents" (Lee et al., Feb 2025): https://arxiv.org (search by title)

  • AI Control: Improving Safety Despite Intentional Subversion (Greenblatt et al., Dec 2023, Redwood Research): https://arxiv.org/abs/2312.06942

  • Anthropic multi-agent research system blog: https://www.anthropic.com/engineering/built-multi-agent-research-system

  • MAGDI — distilling multi-agent debate (Chen et al., early 2024): https://arxiv.org/abs/2402.01620

  • MACA — Multi-Agent Consensus Alignment (Sept 2025): https://arxiv.org (search by title)

  • Agent Arc — distilling multi-agent intelligence into a single LLM agent (Feb 2026): https://arxiv.org (search by title)

  • Condorcet Jury Theorem (1785): https://plato.stanford.edu/entries/jury-theorems/

Abandoned Episode Titles

How to Build God and Then Email Yourself About It from the Park

Four PhDs and a Guy Who Thinks the Colosseum Invented Pasta

Mythos Cleaned Its Git History So You Wouldn't Have To

OpenBSD Spent 27 Years Hardening the Wrong Things


adbl_web_anon_alc_button_suppression_c
Aucun commentaire pour le moment