Anthropic 공식 채널의 새 소식입니다. usingclaude.com이 자동으로 수집하여 공유하며, 본문 전체와 정확한 맥락은 원문에서 확인해 주세요.
Next-generation Constitutional Classifiers: More efficient protection against universal jailbreaks
Last year, we described a new approach to defend against jailbreaks, which we called Constitutional Classifiers. We’ve now developed the next generation.
발행: 2026-06-05T15:19:41.000Z
출처: https://www.anthropic.com/research/next-generation-constitutional-classifiers