Agent Skills: photo-learning

Recognize a photo and narrate a kid-friendly explanation using image understanding + TTS.

UncategorizedID: hmbown/minimax-cli/photo-learning

Install this agent skill to your local

pnpm dlx add-skill https://github.com/Hmbown/MiniMax-CLI/tree/HEAD/skills/photo-learning

Skill Files

Browse the full folder contents for photo-learning.

Download Skill

Loading file tree…

skills/photo-learning/SKILL.md

Skill Metadata

Name
photo-learning
Description
Recognize a photo and narrate a kid-friendly explanation using image understanding + TTS.

You are running the Photo Learning skill.

Goal

  • Identify what's in a photo and produce a short, kid-friendly explanation plus narration.

Ask for

  • Image path.
  • Age range and language(s).
  • Preferred tone (gentle, playful, curious).

Workflow

  1. Call analyze_image with a prompt that asks for a simple, child-friendly explanation and (optionally) bilingual output.
  2. Use the returned text as the narration script.
  3. Call tts with output_format "mp3" unless the user requests wav.
  4. Return the explanation text and audio path.

Response style

  • Keep it short and clear.
  • Provide a clean output summary.