Back to tags
Tag

Agent Skills with tag: image-processing

6 skills match this tag. Use tags to discover related Agent Skills and explore similar workflows.

image

Generate and edit images using AI models via OpenRouter. Supports Nano Banana Pro (Gemini 3 Pro Image), FLUX, and other image generation models.

text-to-imageimage-processinggenerative-artai-models
flight505
flight505
0

gemini-cli

"Use Gemini CLI when processing images, PDFs, large files, needing 1M+ token context, or requiring Gemini's strong reasoning and fine-grained domain knowledge.

geminiimage-processingpdflarge-context
metrovoc
metrovoc
1

ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens. | Sử dụng khi: AI, LLM, vision, embedding, phân tích hình ảnh, Gemini API.

google-geminimultimodalimage-processingaudio-processing
wollfoo
wollfoo
0

gemini-imagegen

Generate and edit images using the Gemini API (Nano Banana). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.

text-to-imageimage-to-imageimage-processinggenerative-art
gupsammy
gupsammy
0

gifgrep

Search GIF providers with CLI/TUI, download results, and extract stills/sheets.

cliterminalgifimage-processing
steipete
steipete
0

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

generative-arttext-to-imageimage-processinggemini-3-pro
steipete
steipete
0