Agent Skills: Real-time Voice AI

Real-time voice AI knowledge including STT and TTS providers, LiveKit Agents plugins, and voice pipeline patterns. Use when working with speech-to-text, text-to-speech, voice agents, LiveKit, or any voice-related infrastructure.

UncategorizedID: keenranger/dotfiles/voice-realtime

Install this agent skill to your local

pnpm dlx add-skill https://github.com/keenranger/dotfiles/tree/HEAD/agent/skills/voice-realtime

Skill Files

Browse the full folder contents for voice-realtime.

Download Skill

Loading file tree…

agent/skills/voice-realtime/SKILL.md

Skill Metadata

Name
voice-realtime
Description
Real-time voice AI knowledge including STT and TTS providers, LiveKit Agents plugins, and voice pipeline patterns. Use when working with speech-to-text, text-to-speech, voice agents, LiveKit, or any voice-related infrastructure.

Real-time Voice AI

Knowledge for building real-time voice agents with STT/TTS providers and LiveKit Agents.

Provider Selection

See stt-providers.md for speech-to-text comparison.

See tts-providers.md for text-to-speech comparison.

LiveKit Integration

See livekit-plugins.md for plugin usage and code patterns.

Quick Decision Matrix

| Requirement | STT | TTS | |-------------|-----|-----| | Korean + Accuracy | OpenAI gpt-4o-transcribe | Google Cloud TTS | | Korean + Low latency | Deepgram Nova-3 | ElevenLabs | | Korean specialty | CLOVA | Google Cloud TTS | | Cost-sensitive | Deepgram Nova-3 | Google Cloud TTS | | Best quality | OpenAI gpt-4o-transcribe | ElevenLabs | | Lowest latency | Deepgram Nova-3 | Cartesia |