Agent Skills: multimodal-llm
Vision, audio, video generation, and multimodal LLM integration patterns. Use when processing images, transcribing audio, generating speech, generating AI video (Kling, Sora, Veo, Runway), or building multimodal AI pipelines.
mcp-enhancementID: yonatangross/orchestkit/multimodal-llm
13314
Install this agent skill to your local
Skill Files
Browse the full folder contents for multimodal-llm.
Loading file tree…
Select a file to preview its contents.