Two Minds, Lower Trust

Impossible d'ajouter des articles

Désolé, nous ne sommes pas en mesure d'ajouter l'article car votre panier est déjà plein.

Veuillez réessayer plus tard

Échec de l’élimination de la liste d'envies.

Veuillez réessayer plus tard

Impossible de suivre le podcast

Impossible de ne plus suivre le podcast

Two Minds, Lower Trust

Écouter gratuitement

Voir les détails

À propos de ce contenu audio

Why orchestrate multiple AI agents when a single strong model is so capable? Jon walks through three distinct rationales — capability, parallel context, and trust — and uses Anthropic's Claude Mythos Preview and Project Glasswing as the live, industrial-scale case study.

Credits

Cover Art by Brianna Williams

TMOM Intro Music by Danny Meza

A special thank you to these talented artists for their contributions to the show.

Links and Reference

Stanford 2026 AI Index Report: https://hai.stanford.edu/ai-index/2026-ai-index-report
Claude Opus 4.7 announcement: https://www.anthropic.com/news/claude-opus-4-7
Project Glasswing announcement: https://www.anthropic.com/glasswing
Claude Mythos Preview — Frontier Red Team write-up: https://red.anthropic.com/2026/mythos-preview/
Claude Mythos Preview — Alignment Risk Update: https://anthropic.com/claude-mythos-preview-risk-report
Andon Labs Vending-Bench (the eval Jon describes): https://andonlabs.com/evals/vending-bench
Mixture-of-Agents (Wang et al., June 2024): https://arxiv.org/abs/2406.04692
Self-MoA / "Rethinking Mixture-of-Agents" (Lee et al., Feb 2025): https://arxiv.org (search by title)
AI Control: Improving Safety Despite Intentional Subversion (Greenblatt et al., Dec 2023, Redwood Research): https://arxiv.org/abs/2312.06942
Anthropic multi-agent research system blog: https://www.anthropic.com/engineering/built-multi-agent-research-system
MAGDI — distilling multi-agent debate (Chen et al., early 2024): https://arxiv.org/abs/2402.01620
MACA — Multi-Agent Consensus Alignment (Sept 2025): https://arxiv.org (search by title)
Agent Arc — distilling multi-agent intelligence into a single LLM agent (Feb 2026): https://arxiv.org (search by title)
Condorcet Jury Theorem (1785): https://plato.stanford.edu/entries/jury-theorems/

Abandoned Episode Titles

How to Build God and Then Email Yourself About It from the Park

Four PhDs and a Guy Who Thinks the Colosseum Invented Pasta

Mythos Cleaned Its Git History So You Wouldn't Have To

OpenBSD Spent 27 Years Hardening the Wrong Things

Aucun commentaire pour le moment