AI Image Generation Skill Skill

AI Image Generation Skill

Generate high-quality AI images using OpenAI's gpt-image-1 model with customizable styles and themes.

When to Use This Skill

Use this skill when the user wants to:

Generate images from text descriptions
Create artwork with specific artistic styles
Generate images with particular aspect ratios (vertical, horizontal, square)
Apply themed visual styles (Studio Ghibli, futuristic, Pixar, oil painting, Chinese painting)

Instructions

Check for API Key: Verify that the OPENAI_API_KEY environment variable is set
Gather Requirements: Ask the user for:
- Image prompt (required)
- Style/aspect ratio: vertical (1024x1536), horizontal (1536x1024), or square (1024x1024)
- Theme: ghibli, futuristic, pixar, oil-paint, or chinese-paint (optional)
- Output location (optional, defaults to ./generated_image.png)
Run the CLI: Execute the main.py script with the appropriate parameters
Report Results: Show the user where the image was saved and any relevant details

Available Options

Aspect Ratios (--style)

vertical: 1024x1536 pixels (portrait orientation)
horizontal: 1536x1024 pixels (landscape orientation)
square: 1024x1024 pixels (default)

Artistic Themes (--theme)

ghibli: Studio Ghibli animation style with whimsical, dreamlike aesthetics
futuristic: Sci-fi style with sleek designs and neon lights
pixar: Vibrant 3D animation style with expressive characters
oil-paint: Classical oil painting with rich textures and brushstrokes
chinese-paint: Traditional Chinese ink painting with delicate brushwork

Usage Examples

Basic Usage

uv run main.py --prompt "a cat sitting on a tree"

With Style and Theme

uv run main.py --prompt "a sunset over mountains" --style horizontal --theme oil-paint --output ./sunset.png

Futuristic Portrait

uv run main.py --prompt "a robot in a city" --style vertical --theme futuristic --output ./robot.png

Studio Ghibli Landscape

uv run main.py --prompt "a magical forest with spirits" --style horizontal --theme ghibli --output ./forest.png

Setup Requirements

This skill requires an OpenAI API key with access to the gpt-image-1 model:

export OPENAI_API_KEY='your-api-key-here'

Note: Using gpt-image-1 requires organization verification on the OpenAI platform.

Technical Details

Model: OpenAI gpt-image-1 (released April 2025)
Response Format: Base64 encoded images (b64_json)
Supported Sizes: 1024x1024, 1024x1536, 1536x1024
Maximum Resolution: Up to 4096x4096 pixels
Dependencies: openai>=2.7.1

Pricing Information

Usage is priced per token:

Text tokens: $5 per million
Image input tokens: $10 per million
Image output tokens: $40 per million

Approximate costs per generated image:

Low quality square: ~$0.02
Medium quality square: ~$0.07
High quality square: ~$0.19

Troubleshooting

API Key Not Set

If you see "Error: OPENAI_API_KEY environment variable not set", ensure your API key is exported in your shell session.

Organization Not Verified

gpt-image-1 requires organization verification on platform.openai.com. Visit your OpenAI account settings to complete verification.

Invalid Size Error

Ensure you're using one of the supported sizes: 1024x1024, 1024x1536, or 1536x1024.

Agent Skills: AI Image Generation Skill

Install this agent skill to your local

Skill Files