Two Minds, Lower Trust
Impossible d'ajouter des articles
Échec de l’élimination de la liste d'envies.
Impossible de suivre le podcast
Impossible de ne plus suivre le podcast
-
Lu par :
-
De :
À propos de ce contenu audio
Why orchestrate multiple AI agents when a single strong model is so capable? Jon walks through three distinct rationales — capability, parallel context, and trust — and uses Anthropic's Claude Mythos Preview and Project Glasswing as the live, industrial-scale case study.
Credits
Cover Art by Brianna Williams
TMOM Intro Music by Danny Meza
A special thank you to these talented artists for their contributions to the show.
Links and Reference
Stanford 2026 AI Index Report: https://hai.stanford.edu/ai-index/2026-ai-index-report
Claude Opus 4.7 announcement: https://www.anthropic.com/news/claude-opus-4-7
Project Glasswing announcement: https://www.anthropic.com/glasswing
Claude Mythos Preview — Frontier Red Team write-up: https://red.anthropic.com/2026/mythos-preview/
Claude Mythos Preview — Alignment Risk Update: https://anthropic.com/claude-mythos-preview-risk-report
Andon Labs Vending-Bench (the eval Jon describes): https://andonlabs.com/evals/vending-bench
Mixture-of-Agents (Wang et al., June 2024): https://arxiv.org/abs/2406.04692
Self-MoA / "Rethinking Mixture-of-Agents" (Lee et al., Feb 2025): https://arxiv.org (search by title)
AI Control: Improving Safety Despite Intentional Subversion (Greenblatt et al., Dec 2023, Redwood Research): https://arxiv.org/abs/2312.06942
Anthropic multi-agent research system blog: https://www.anthropic.com/engineering/built-multi-agent-research-system
MAGDI — distilling multi-agent debate (Chen et al., early 2024): https://arxiv.org/abs/2402.01620
MACA — Multi-Agent Consensus Alignment (Sept 2025): https://arxiv.org (search by title)
Agent Arc — distilling multi-agent intelligence into a single LLM agent (Feb 2026): https://arxiv.org (search by title)
Condorcet Jury Theorem (1785): https://plato.stanford.edu/entries/jury-theorems/
Abandoned Episode Titles
How to Build God and Then Email Yourself About It from the Park
Four PhDs and a Guy Who Thinks the Colosseum Invented Pasta
Mythos Cleaned Its Git History So You Wouldn't Have To
OpenBSD Spent 27 Years Hardening the Wrong Things