Back to tags
Tag

Agent Skills with tag: computer-vision

12 skills match this tag. Use tags to discover related Agent Skills and explore similar workflows.

transformers

This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification, question answering, translation, summarization, image classification, object detection, speech recognition, and fine-tuning models on custom datasets.

transformerspretrained-modelsfine-tuningnatural-language-processing
ovachiever
ovachiever
81

multimodal-looker

|

multimodalcomputer-visionimage-processingdeep-learning
bahayonghang
bahayonghang
0

image-processing

Process, transform, and analyze images using common operations

image-processingimage-analysiscomputer-vision
tatat
tatat
1

VLM

Implement vision-based AI chat capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to analyze images, describe visual content, or create applications that combine image understanding with conversational AI. Supports image URLs and base64 encoded images for multimodal interactions.

computer-visionmultimodalimage-analysisconversational-ai
UholySmokes
UholySmokes
1

computer-vision

Build computer vision solutions - image classification, object detection, and transfer learning

computer-visionimage-classificationobject-detectiontransfer-learning
pluginagentmarketplace
pluginagentmarketplace
11

computer-vision

Image processing, object detection, segmentation, and vision models. Use for image classification, object detection, or visual analysis tasks.

computer-visionimage-processingmachine-learningobject-detection
pluginagentmarketplace
pluginagentmarketplace
21

deep-learning

Neural networks, CNNs, RNNs, Transformers with TensorFlow and PyTorch. Use for image classification, NLP, sequence modeling, or complex pattern recognition.

pytorchtensorflowneural-network-architecturescomputer-vision
pluginagentmarketplace
pluginagentmarketplace
21

Fluxwing Screenshot Importer

Import UI screenshots and generate uxscii components automatically using vision analysis. Use when user wants to import, convert, or generate .uxm components from screenshots or images.

computer-visionimage-analysisui-componentsscreenshot-capture
trabian
trabian
101

Computer Vision

Implement computer vision tasks including image classification, object detection, segmentation, and pose estimation using PyTorch and TensorFlow

computer-visiondeep-learningpytorchtensorflow
aj-geddes
aj-geddes
301

ml-cv-specialist

Deep expertise in ML/CV model selection, training pipelines, and inference architecture. Use when designing machine learning systems, computer vision pipelines, or AI-powered features.

computer-visiondeep-learningml-pipelinesneural-network-architectures
alirezarezvani
alirezarezvani
4110

screenshot-feature-extractor

Analyze product screenshots to extract feature lists and generate development task checklists. Use when: (1) Analyzing competitor product screenshots for feature extraction, (2) Generating PRD/task lists from UI designs, (3) Batch analyzing multiple app screens, (4) Conducting competitive analysis from visual references.

computer-visionfeature-extractionui-analysistask-generation
notedit
notedit
9916

transformers

This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification, question answering, translation, summarization, image classification, object detection, speech recognition, and fine-tuning models on custom datasets.

transformersmachine-learningnatural-language-processingcomputer-vision
K-Dense-AI
K-Dense-AI
3,233360