Back to authors
benchflow-ai

benchflow-ai

440 Skills published on GitHub.

auto-skill-generator

>

UncategorizedView skill →

harbor

Harbor framework for agent evaluation. Use when: (1) Running harbor commands (harbor run, harbor tasks check), (2) Creating/validating SkillsBench tasks, (3) Understanding task format or debugging failures.

UncategorizedView skill →

skill-creator

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.

UncategorizedView skill →

skillsbench

SkillsBench contribution workflow. Use when: (1) Creating benchmark tasks, (2) Understanding repo structure, (3) Preparing PRs for task submission.

UncategorizedView skill →

mesh-analysis

Analyzes 3D mesh files (STL) to calculate geometric properties (volume, components) and extract attribute data. Use this skill to process noisy 3D scan data and filter debris.

UncategorizedView skill →

civ6lib

Civilization 6 district mechanics library. Use when working with district placement validation, adjacency bonus calculations, or understanding Civ6 game rules.

UncategorizedView skill →

hex-grid-spatial

Hexagonal grid spatial operations for Civilization 6 map analysis.

UncategorizedView skill →

map-optimization-strategy

Strategy for solving constraint optimization problems on spatial maps. Use when you need to place items on a grid/map to maximize some objective while satisfying constraints.

UncategorizedView skill →

sqlite-map-parser

Parse SQLite databases into structured JSON data. Use when exploring unknown database schemas, understanding table relationships, and extracting map data as JSON.

UncategorizedView skill →

pcap-analysis

Network packet capture analysis for security investigations and threat detection.

UncategorizedView skill →

threat-detection

Exact detection thresholds for identifying malicious network patterns including port scans, DoS attacks, and beaconing behavior.

UncategorizedView skill →

gamma-phase-associator

GaMMA earthquake phase association algorithm for clustering seismic phase picks into earthquake events.

UncategorizedView skill →

obspy-data-api

An overview of the core data API of ObsPy, a Python framework for processing seismological data. It is useful for parsing common seismological file formats, or manipulating custom data into standard objects for downstream use cases such as ObsPy's signal processing routines or SeisBench's modeling API.

UncategorizedView skill →

seisbench-model-api

An overview of the core model API of SeisBench, a Python framework for training and applying machine learning algorithms to seismic data. It is useful for annotating waveforms using pretrained SOTA ML models, for tasks like phase picking, earthquake detection, waveform denoising and depth estimation. For any waveform, you can manipulate it into an obspy stream object and it will work seamlessly with seisbench models.

UncategorizedView skill →

seismic-picker-selection

This is a summary the advantages and disadvantages of earthquake event detection and phase picking methods, shared by leading seismology researchers at the 2025 Earthquake Catalog Workshop. Use it when you have a seismic phase picking task at hand.

UncategorizedView skill →

dc-power-flow

DC power flow analysis for power systems. Use when computing power flows using DC approximation, building susceptance matrices, calculating line flows and loading percentages, or performing sensitivity analysis on transmission networks.

UncategorizedView skill →

economic-dispatch

Generator economic dispatch and cost optimization for power systems. Use when minimizing generation costs, computing optimal generator setpoints, calculating operating margins, or working with generator cost functions.

UncategorizedView skill →

locational-marginal-prices

Extract LMPs from DC-OPF solutions using dual values for nodal electricity pricing.

UncategorizedView skill →

power-flow-data

Power system network data formats and topology. Use when parsing bus, generator, and branch data for power flow analysis.

UncategorizedView skill →

enterprise-artifact-search

Multi-hop evidence search + structured extraction over enterprise artifact datasets (docs/chats/meetings/PRs/URLs). Strong disambiguation to prevent cross-product leakage; returns JSON-ready entities plus evidence pointers.

UncategorizedView skill →

pdf

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

UncategorizedView skill →

xlsx

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

UncategorizedView skill →

jackson-security

Security considerations for Jackson JSON deserialization in Java applications. Covers timing of validation, raw input interception, and common deserialization attack patterns.

UncategorizedView skill →

senior-java

World-class Java and Spring Boot development skill for enterprise applications, microservices, and cloud-native systems. Expertise in Spring Framework, Spring Boot 3.x, Spring Cloud, JPA/Hibernate, and reactive programming with WebFlux. Includes project scaffolding, dependency management, security implementation, and performance optimization.

UncategorizedView skill →

erlang-concurrency

Use when erlang's concurrency model including lightweight processes, message passing, process links and monitors, error handling patterns, selective receive, and building massively concurrent systems on the BEAM VM.

UncategorizedView skill →

erlang-distribution

Use when erlang distributed systems including node connectivity, distributed processes, global name registration, distributed supervision, network partitions, and building fault-tolerant multi-node applications on the BEAM VM.

UncategorizedView skill →

erlang-otp-behaviors

Use when oTP behaviors including gen_server for stateful processes, gen_statem for state machines, supervisors for fault tolerance, gen_event for event handling, and building robust, production-ready Erlang applications with proven patterns.

UncategorizedView skill →

find-bugs

Find bugs, security vulnerabilities, and code quality issues in local branch changes. Use when asked to review changes, find bugs, security review, or audit code on the current branch.

UncategorizedView skill →

senior-security

Comprehensive security engineering skill for application security, penetration testing, security architecture, and compliance auditing. Includes security assessment tools, threat modeling, crypto implementation, and security automation. Use when designing security architecture, conducting penetration tests, implementing cryptography, or performing security audits.

UncategorizedView skill →

ssh-penetration-testing

SSH security assessment methodology including enumeration, credential attacks, and vulnerability exploitation.

UncategorizedView skill →

browser-testing

VERIFY your changes work. Measure CLS, detect theme flicker, test visual stability, check performance. Use BEFORE and AFTER making changes to confirm fixes. Includes ready-to-run scripts: measure-cls.ts, detect-flicker.ts

UncategorizedView skill →

web-interface-guidelines

Vercel's comprehensive UI guidelines for building accessible, performant web interfaces. Use this skill when reviewing or building UI components for compliance with best practices around accessibility, performance, animations, and visual stability.

UncategorizedView skill →

senior-data-engineer

World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, real-time streaming, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, Flink, Kinesis, and modern data stack. Includes data modeling, pipeline orchestration, data quality, streaming quality monitoring, and DataOps. Use when designing data architectures, building batch or streaming data pipelines, optimizing data workflows, or implementing data governance.

UncategorizedView skill →

glm-basics

General Lake Model (GLM) basics for lake thermal simulation.

UncategorizedView skill →

glm-calibration

Calibrate GLM parameters for water temperature simulation. Use when you need to adjust model parameters to minimize RMSE between simulated and observed temperatures.

UncategorizedView skill →

glm-output

Read and process GLM output files. Use when you need to extract temperature data from NetCDF output, convert depth coordinates, or calculate RMSE against observations.

UncategorizedView skill →

fuzzy-match

A toolkit for fuzzy string matching and data reconciliation. Useful for matching entity names (companies, people) across different datasets where spelling variations, typos, or formatting differences exist.

UncategorizedView skill →

image-ocr

Extract text content from images using Tesseract OCR via Python

UncategorizedView skill →

openai-vision

Analyze images and multi-frame sequences using OpenAI GPT vision models

UncategorizedView skill →

video-frame-extraction

Extract frames from video files and save them as images using OpenCV

UncategorizedView skill →

mhc-algorithm

mHC (Manifold-Constrained Hyper-Connections) for stabilizing deep network training with doubly stochastic matrices.

UncategorizedView skill →

modal-gpu

Run Python code on cloud GPUs using Modal serverless platform. Use when you need A100/T4/A10G GPU access for training ML models. Covers Modal app setup, GPU selection, data downloading inside functions, and result handling.

UncategorizedView skill →

nanogpt-training

Train GPT-2 scale models (~124M parameters) efficiently on a single GPU. Covers GPT-124M architecture, tokenized dataset loading (e.g., HuggingFace Hub shards), modern optimizers (Muon, AdamW), mixed precision training, and training loop implementation.

UncategorizedView skill →

gemini-count-in-video

Analyze and count objects in videos using Google Gemini API (object counting, pedestrian detection, vehicle tracking, and surveillance video analysis).

UncategorizedView skill →

gemini-video-understanding

Analyze videos with Google Gemini API (summaries, Q&A, transcription with timestamps + visual context, scene/timeline detection, video clipping, FPS control, multi-video comparison, and YouTube URL analysis).

UncategorizedView skill →

gpt-multimodal

Analyze images and multi-frame sequences using OpenAI GPT series

UncategorizedView skill →

13f-analyzer

Perform various data analysis on SEC 13-F and obtain some insights of fund activities such as number of holdings, AUM, and change of holdings between two quarters.

UncategorizedView skill →

fuzzy-name-search

This skill includes search capability in 13F, such as fuzzy search a fund information using possibly inaccurate name, or fuzzy search a stock cusip info using its name.

UncategorizedView skill →

obspy-datacenter-client

ObsPy allows you to easily download earthquake data from various data centers/networks through its client API. This is an overview of available services and a demo of the client API for International Federation of Digital Seismograph Networks (FDSN). Use this if you need to download earthquake research data.

UncategorizedView skill →

nlp-research-repo-package-installment

Align Python version and repo-declared dependencies (requirements.txt / environment.yml) before installing packages for NLP research code reproduction.

UncategorizedView skill →

Page 1 of 9 · 440 results