发现和使用优秀的技能扩展
检测提示注入、越狱、角色劫持和系统提取尝试。应用语义分析和 penalty 评分进行多层防御。
Detect prompt injection, jailbreak, role-hijack, and system extraction attempts. Applies multi-layer defense with semantic analysis and penalty scoring.