Couverture de EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Écouter gratuitement

Voir les détails

In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.

adbl_web_anon_alc_button_suppression_c
Aucun commentaire pour le moment