Agent-Skills.md

Agent Skills: Content Moderation API Skill

Content moderation API integration using OpenAI Moderation, Perspective API, and others

UncategorizedID: a5c-ai/babysitter/content-moderation-api

Author

a5c-ai

https://github.com/a5c-ai View all skills

Repository

a5c-ai/babysitter

a5c-aiLicense: MIT

244

Install this agent skill to your local

pnpm dlx add-skill https://github.com/a5c-ai/babysitter/tree/HEAD/library/specializations/ai-agents-conversational/skills/content-moderation-api

Skill Files

Browse the full folder contents for content-moderation-api.

Loading file tree…

library/specializations/ai-agents-conversational/skills/content-moderation-api/SKILL.md

Skill Metadata

Name: content-moderation-api
Description: Content moderation API integration using OpenAI Moderation, Perspective API, and others

Content Moderation API Skill

Capabilities

Integrate OpenAI Moderation API
Set up Perspective API for toxicity detection
Configure moderation thresholds
Implement content filtering pipelines
Design moderation response handling
Create moderation logging and reporting

Target Processes

content-moderation-safety
system-prompt-guardrails

Implementation Details

Moderation APIs

OpenAI Moderation: Hate, violence, self-harm, sexual content
Perspective API: Toxicity, insult, profanity, threat
Azure Content Safety: Text and image moderation
LlamaGuard: Open-source safety classifier

Configuration Options

API credentials and endpoints
Category thresholds
Action policies (block, warn, flag)
Logging configuration
Fallback behavior

Best Practices

Set appropriate thresholds
Handle edge cases gracefully
Log moderation decisions
Regular threshold review
Multi-layer moderation

Dependencies

openai
google-cloud-language (Perspective)
azure-ai-contentsafety