Source |
Dark Reading |
Identifiant |
8646677 |
Date de publication |
2025-02-03 22:13:26 (vue: 2025-02-03 23:08:08) |
Titre |
\\'Constitutional Classifiers\\' Technique Mitigates GenAI Jailbreaks |
Texte |
Anthropic says its Constitutional Classifiers approach offers a practical way to make it harder for bad actors to try and coerce an AI model off its guardrails.
Anthropic says its Constitutional Classifiers approach offers a practical way to make it harder for bad actors to try and coerce an AI model off its guardrails. |
Notes |
★★★
|
Envoyé |
Oui |
Condensat |
actors anthropic approach bad classifiers coerce constitutional genai guardrails harder its jailbreaks make mitigates model off offers practical says technique try way |
Tags |
|
Stories |
|
Move |
|