Constitutional AI Prompts Skill
Capabilities
- Design constitutional AI principles
- Implement self-critique and revision prompts
- Create harmlessness guidelines
- Design refusal patterns for unsafe requests
- Implement red-team testing prompts
- Create ethics-aware response frameworks
Target Processes
- system-prompt-guardrails
- content-moderation-safety
Implementation Details
Constitutional Patterns
- Critique-Revision: Self-evaluate and improve responses
- Principle Adherence: Follow defined ethical principles
- Harmlessness Focus: Prioritize safe responses
- Helpfulness Balance: Balance helpfulness with safety
- Transparency: Acknowledge limitations
Configuration Options
- Constitutional principles list
- Critique prompts
- Revision guidelines
- Refusal templates
- Escalation triggers
Best Practices
- Define clear constitutional principles
- Balance helpfulness and safety
- Test with adversarial inputs
- Document refusal patterns
- Regular principle review
Dependencies
- langchain-core