Couverture de MLGym: A New Framework and Benchmark for Advancing AI Research Agents

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Écouter gratuitement

Voir les détails

À propos de ce contenu audio

AutoML is dead an LLMs have killed it? MLGym is a benchmark and framework testing this theory. Roberta Raileanu and Deepak Nathani discuss how well current LLMs are doing at solving ML tasks, what the biggest roadblocks are, and what that means for AutoML generally.


Check out the paper: https://arxiv.org/pdf/2502.14499

More on Roberta: https://rraileanu.github.io/

More on Deepak: https://dnathani.net/

Aucun commentaire pour le moment