Agent Skills: RAG Embedding Generation Skill

Batch embedding generation with caching, rate limiting, and multiple provider support

UncategorizedID: a5c-ai/babysitter/rag-embedding-generation

Install this agent skill to your local

pnpm dlx add-skill https://github.com/a5c-ai/babysitter/tree/HEAD/plugins/babysitter/skills/babysit/process/specializations/ai-agents-conversational/skills/rag-embedding-generation

Skill Files

Browse the full folder contents for rag-embedding-generation.

Download Skill

Loading file tree…

plugins/babysitter/skills/babysit/process/specializations/ai-agents-conversational/skills/rag-embedding-generation/SKILL.md

Skill Metadata

Name
rag-embedding-generation
Description
Batch embedding generation with caching, rate limiting, and multiple provider support

RAG Embedding Generation Skill

Capabilities

  • Generate embeddings with multiple providers
  • Implement batch processing for large datasets
  • Configure caching for embedding reuse
  • Handle rate limiting and retries
  • Support various embedding models
  • Implement embedding quality validation

Target Processes

  • rag-pipeline-implementation
  • vector-database-setup

Implementation Details

Embedding Providers

  1. OpenAI Embeddings: text-embedding-ada-002, text-embedding-3-*
  2. HuggingFace: sentence-transformers models
  3. Cohere: embed-v3 models
  4. Voyage AI: voyage-2 models
  5. Local Models: GGUF/ONNX embedding models

Configuration Options

  • Model selection and parameters
  • Batch size optimization
  • Cache backend configuration
  • Rate limit settings
  • Retry policies
  • Dimensionality settings

Best Practices

  • Use appropriate model for domain
  • Implement caching for cost reduction
  • Monitor embedding quality
  • Handle API errors gracefully

Dependencies

  • langchain-openai / langchain-huggingface
  • numpy
  • Caching backend (Redis, SQLite)