Agent Skills: gemini-image-generator

Generate images using Google Gemini with customizable options

UncategorizedID: mkdev-me/claude-skills/gemini-image-generator

Install this agent skill to your local

pnpm dlx add-skill https://github.com/mkdev-me/agent-skills/tree/HEAD/gemini-image-generator

Skill Files

Browse the full folder contents for gemini-image-generator.

Download Skill

Loading file tree…

gemini-image-generator/SKILL.md

Skill Metadata

Name
gemini-image-generator
Description
Generate images using Google Gemini with customizable options

gemini-image-generator

Instructions

Use this skill to generate images using Google Gemini's image generation model. The skill supports:

  • Text-to-image generation from prompts
  • Image-to-image generation with a reference image
  • Multiple output sizes (1K, 2K, 4K)
  • Custom output paths

The API key must be set via the GEMINI_API_KEY environment variable.

Parameters

  • --prompt (required): The text prompt describing the image to generate
  • --output (required): Output file path for the generated image
  • --reference: Optional reference image for style/content guidance
  • --size: Image size - "1K", "2K", or "4K" (default: 4K)

Examples

Basic text-to-image generation

./scripts/generate.py --prompt "A serene mountain landscape at sunset" --output images/landscape.png

With reference image for style guidance

./scripts/generate.py --prompt "Same character but wearing a party hat" --reference images/character.png --output images/party.png

Different output size

./scripts/generate.py --prompt "Abstract art" --output art.png --size 2K

Setup

Before first use, set up the virtual environment:

cd scripts && python3 -m venv venv && ./venv/bin/pip install -r requirements.txt

Set your API key:

export GEMINI_API_KEY="your-api-key-here"