AI Image Generation Skill
Generate high-quality AI images using OpenAI's gpt-image-1 model with customizable styles and themes.
When to Use This Skill
Use this skill when the user wants to:
- Generate images from text descriptions
- Create artwork with specific artistic styles
- Generate images with particular aspect ratios (vertical, horizontal, square)
- Apply themed visual styles (Studio Ghibli, futuristic, Pixar, oil painting, Chinese painting)
Instructions
- Check for API Key: Verify that the OPENAI_API_KEY environment variable is set
- Gather Requirements: Ask the user for:
- Image prompt (required)
- Style/aspect ratio: vertical (1024x1536), horizontal (1536x1024), or square (1024x1024)
- Theme: ghibli, futuristic, pixar, oil-paint, or chinese-paint (optional)
- Output location (optional, defaults to ./generated_image.png)
- Run the CLI: Execute the main.py script with the appropriate parameters
- Report Results: Show the user where the image was saved and any relevant details
Available Options
Aspect Ratios (--style)
vertical: 1024x1536 pixels (portrait orientation)horizontal: 1536x1024 pixels (landscape orientation)square: 1024x1024 pixels (default)
Artistic Themes (--theme)
ghibli: Studio Ghibli animation style with whimsical, dreamlike aestheticsfuturistic: Sci-fi style with sleek designs and neon lightspixar: Vibrant 3D animation style with expressive charactersoil-paint: Classical oil painting with rich textures and brushstrokeschinese-paint: Traditional Chinese ink painting with delicate brushwork
Usage Examples
Basic Usage
uv run main.py --prompt "a cat sitting on a tree"
With Style and Theme
uv run main.py --prompt "a sunset over mountains" --style horizontal --theme oil-paint --output ./sunset.png
Futuristic Portrait
uv run main.py --prompt "a robot in a city" --style vertical --theme futuristic --output ./robot.png
Studio Ghibli Landscape
uv run main.py --prompt "a magical forest with spirits" --style horizontal --theme ghibli --output ./forest.png
Setup Requirements
This skill requires an OpenAI API key with access to the gpt-image-1 model:
export OPENAI_API_KEY='your-api-key-here'
Note: Using gpt-image-1 requires organization verification on the OpenAI platform.
Technical Details
- Model: OpenAI gpt-image-1 (released April 2025)
- Response Format: Base64 encoded images (b64_json)
- Supported Sizes: 1024x1024, 1024x1536, 1536x1024
- Maximum Resolution: Up to 4096x4096 pixels
- Dependencies: openai>=2.7.1
Pricing Information
Usage is priced per token:
- Text tokens: $5 per million
- Image input tokens: $10 per million
- Image output tokens: $40 per million
Approximate costs per generated image:
- Low quality square: ~$0.02
- Medium quality square: ~$0.07
- High quality square: ~$0.19
Troubleshooting
API Key Not Set
If you see "Error: OPENAI_API_KEY environment variable not set", ensure your API key is exported in your shell session.
Organization Not Verified
gpt-image-1 requires organization verification on platform.openai.com. Visit your OpenAI account settings to complete verification.
Invalid Size Error
Ensure you're using one of the supported sizes: 1024x1024, 1024x1536, or 1536x1024.