Back to categories
Category

Agent Skills in category: verify

196 skills match this category. Browse curated collections and explore related Agent Skills.

testing

Use when running tests to validate implementations, collecting test evidence, or debugging failures. Load in TEST state. Covers unit tests (pytest/jest), API tests (curl), browser tests (Claude-in-Chrome), database verification. All results are code-verified, not LLM-judged.

unit-testingintegration-testingend-to-end-testingtest-automation
ingpoc
ingpoc
5

validation

This skill should be used when validating whether a specific word or phrase is appropriate, commonly used, and correct in academic technical contexts. Use for checking terminology in research papers targeting top-tier computer science conferences.

naming-conventionswriting-feedbackreview-checkpointspublication-quality
minhuw
minhuw
4

testing-patterns

Cross-language testing strategies and patterns. Triggers on: test pyramid, unit test, integration test, e2e test, TDD, BDD, test coverage, mocking strategy, test doubles, test isolation.

testing-patternsmulti-languageunit-testingintegration-testing
0xDarkMatter
0xDarkMatter
3

python-pytest-patterns

pytest testing patterns for Python. Triggers on: pytest, fixture, mark, parametrize, mock, conftest, test coverage, unit test, integration test, pytest.raises.

software-testingunit-testingintegration-testingtest-coverage
0xDarkMatter
0xDarkMatter
3

propose-feature-test-plan

Create a test plan mapping EARS requirements and Critical Constraints to specific tests

QA-planningtest-case-generationtest-coveragerequirements-traceability
cbgbt
cbgbt
34

fact-find

Quick lookup of specific facts about Bottlerocket with citations

web-searchcitation-managementbottlerocketresearch-assistant
cbgbt
cbgbt
34

testing

Write tests for JavaScript, React, PHP, and WordPress including unit tests, integration tests, and E2E tests. Use when writing tests, setting up testing frameworks, or debugging test failures.

unit-testingintegration-testinge2e-testingjavascript
vapvarun
vapvarun
3

performance-testing

Performance testing guidance including load testing with k6, locust, and artillery, benchmarking strategies, profiling techniques, metrics analysis, performance budgets, and bottleneck identification. Use when setting up performance tests, analyzing system behavior under load, or optimizing application performance. Trigger keywords: performance testing, load testing, k6, locust, artillery, benchmarking, profiling, latency, throughput, performance budget, bottleneck, stress testing, scalability testing.

performance-testingload-testingbenchmarkingprofiling
cosmix
cosmix
3

testing

Creates comprehensive test suites including unit tests, integration tests, and end-to-end tests. Trigger keywords: test, testing, unit test, integration test, e2e, coverage, TDD, mock, fixture.

unit-testingintegration-testingend-to-end-testingtest-driven-development
cosmix
cosmix
3

data-validation

Data validation patterns including schema validation, input sanitization, output encoding, and type coercion. Use when implementing form validation, API input validation, JSON Schema, Zod, Pydantic, sanitization, XSS prevention, or custom validators.

input-validationschema-validationjson-schemazod
cosmix
cosmix
3

e2e-testing

End-to-end testing patterns and best practices for web applications using Playwright and Cypress. Covers Page Object Model, test fixtures, selector strategies, async handling, visual regression testing, and flaky test prevention. Use when setting up E2E tests, debugging test failures, or improving test reliability. Trigger keywords: e2e testing, end-to-end tests, Playwright, Cypress, Page Object Model, test fixtures, selectors, data-testid, async tests, visual regression, flaky tests, browser testing.

end-to-end-testingplaywrightcypresspage-object-model
cosmix
cosmix
3

bloom_integrity_verification

>

large-language-modelschecksum-validationpgp-signaturebloom
GOATnote-Inc
GOATnote-Inc
31

websearch-standard

Standard multi-source verification search strategy for moderate complexity research. 2-iteration workflow with source ranking, consensus identification, and citation transparency. Use for feature comparisons, moderate complexity topics, fact-checking. Keywords: compare, differences, features, fact-check, verify, what are.

web-searchfact-checkingmulti-source-verification
thomasholknielsen
thomasholknielsen
41

test-generator

Generates Jest or Pytest tests following Ben's testing standards. Use when creating tests, adding test coverage, writing unit tests, mocking dependencies, or when user mentions testing, test cases, Jest, Pytest, fixtures, assertions, or coverage.

unit-testingpytestjesttest-case-generation
benshapyro
benshapyro
71

verification-gate

Enforce mandatory pre-action verification checkpoints to prevent pattern-matching from overriding explicit reasoning. Use this skill when about to execute implementation actions (Bash, Write, Edit, MultiEdit) to verify hypothesis-action alignment. Blocks execution when hypothesis unverified or action targets different system than hypothesis identified. Critical for preventing cognitive dissonance where correct diagnosis leads to wrong implementation.

command-guardchain-of-thoughtpattern-matchingassumption-challenging
Jamie-BitFlight
Jamie-BitFlight
111

holistic-linting

This skill should be used when the model needs to ensure code quality through comprehensive linting and formatting. It provides automatic linting workflows for orchestrators (format → lint → resolve via concurrent agents) and sub-agents (lint touched files before task completion). Prevents claiming "production ready" code without verification. Includes linting rules knowledge base for ruff, mypy, and bandit, plus the linting-root-cause-resolver agent for systematic issue resolution.

lintingstatic-analysisformattingcode-quality
Jamie-BitFlight
Jamie-BitFlight
111

funsloth-check

Validate datasets for Unsloth fine-tuning. Use when the user wants to check a dataset, analyze tokens, calculate Chinchilla optimality, or prepare data for training.

data-preprocessingtoken-optimizationllmdataset-validation
chrisvoncsefalvay
chrisvoncsefalvay
4

webapp-testing

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

playwrightbrowser-automatione2e-testingui-testing
jwiegley
jwiegley
4

Page 3 of 11 · 196 results