Agent Skills: Constitutional AI Prompts Skill

Constitutional AI and safety guardrail prompts for aligned LLM behavior

UncategorizedID: a5c-ai/babysitter/constitutional-ai-prompts

Install this agent skill to your local

pnpm dlx add-skill https://github.com/a5c-ai/babysitter/tree/HEAD/plugins/babysitter/skills/babysit/process/specializations/ai-agents-conversational/skills/constitutional-ai-prompts

Skill Files

Browse the full folder contents for constitutional-ai-prompts.

Download Skill

Loading file tree…

plugins/babysitter/skills/babysit/process/specializations/ai-agents-conversational/skills/constitutional-ai-prompts/SKILL.md

Skill Metadata

Name
constitutional-ai-prompts
Description
Constitutional AI and safety guardrail prompts for aligned LLM behavior

Constitutional AI Prompts Skill

Capabilities

  • Design constitutional AI principles
  • Implement self-critique and revision prompts
  • Create harmlessness guidelines
  • Design refusal patterns for unsafe requests
  • Implement red-team testing prompts
  • Create ethics-aware response frameworks

Target Processes

  • system-prompt-guardrails
  • content-moderation-safety

Implementation Details

Constitutional Patterns

  1. Critique-Revision: Self-evaluate and improve responses
  2. Principle Adherence: Follow defined ethical principles
  3. Harmlessness Focus: Prioritize safe responses
  4. Helpfulness Balance: Balance helpfulness with safety
  5. Transparency: Acknowledge limitations

Configuration Options

  • Constitutional principles list
  • Critique prompts
  • Revision guidelines
  • Refusal templates
  • Escalation triggers

Best Practices

  • Define clear constitutional principles
  • Balance helpfulness and safety
  • Test with adversarial inputs
  • Document refusal patterns
  • Regular principle review

Dependencies

  • langchain-core