Agent Skills: multimodal-llm

Vision, audio, video generation, and multimodal LLM integration patterns. Use when processing images, transcribing audio, generating speech, generating AI video (Kling, Sora, Veo, Runway), or building multimodal AI pipelines.

mcp-enhancementID: yonatangross/orchestkit/multimodal-llm

Install this agent skill to your local

pnpm dlx add-skill https://github.com/yonatangross/orchestkit/multimodal-llm

Skill Files

Browse the full folder contents for multimodal-llm.

Download Skill

Loading file tree…

Select a file to preview its contents.