Back to tags
Tag

Agent Skills with tag: ui-automation

18 skills match this tag. Use tags to discover related Agent Skills and explore similar workflows.

chrome-automation

Chrome 浏览器自动化操作。当用户需要自动化浏览器操作、网页测试、数据抓取或 UI 自动化时使用此技能。

chrome-automationbrowser-automationweb-testingdata-scraping
aAAaqwq
aAAaqwq
0

Playwright Browser Automation

Complete browser automation with Playwright. Auto-detects dev servers, writes clean test scripts to /tmp. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use when user wants to test websites, automate browser interactions, validate web functionality, or perform any browser-based testing.

playwrightbrowser-automationweb-testingui-automation
ovachiever
ovachiever
81

webapp-testing

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

playwrightweb-testingfrontend-testingui-automation
ederheisler
ederheisler
0

axe

Control iOS Simulators via accessibility APIs. Use this skill when the user wants to automate iOS simulator interactions, tap buttons by accessibility label, type text, swipe, take screenshots, describe the UI accessibility tree, or test iOS apps programmatically.

iosui-automationaccessibilitysimctl
aliceisjustplaying
aliceisjustplaying
11

chrome-automation

Launch and control Chrome with AT-SPI2 accessibility for browser automation. Use when asked to start chrome, open browser, launch chrome, begin browser automation, or control web pages.

chrome-automationbrowser-automationat-spi2ui-automation
cevatkerim
cevatkerim
2

browser-automation

Browser automation via Puppeteer MCP for JS-rendered content

browser-automationpuppeteerjavascriptnodejs
HTRamsey
HTRamsey
3

playwright-automation

Execute complex browser automation using Playwright Python. Use for video recording, multi-page navigation, data extraction. Triggers on "browser script", "record video of website", "extract data from webpage".

playwrightpythonbrowser-automationui-automation
brianclaridge
brianclaridge
21

windows-ui-automation

Expert in Windows UI Automation (UIA) and Win32 APIs for desktop automation. Specializes in accessible, secure automation of Windows applications including element discovery, input simulation, and process interaction. HIGH-RISK skill requiring strict security controls for system access.

ui-automationwindowswin32-apisaccess-control
martinholovsky
martinholovsky
92

linux-at-spi2

Expert in AT-SPI2 (Assistive Technology Service Provider Interface) for Linux desktop automation. Specializes in accessible automation of GTK/Qt applications via D-Bus accessibility interface. HIGH-RISK skill requiring security controls for system-wide access.

linuxui-automationassistive-technologydbus
martinholovsky
martinholovsky
92

macos-accessibility

Expert in macOS Accessibility APIs (AXUIElement) for desktop automation. Specializes in secure automation of macOS applications with proper TCC permissions, element discovery, and system interaction. HIGH-RISK skill requiring strict security controls.

macosui-automationaccess-controlaccessibility-api
martinholovsky
martinholovsky
92

web-browser

Allows to interact with web pages by performing actions such as clicking buttons, filling out forms, and navigating links. It works by remote controlling Google Chrome or Chromium browsers using the Chrome DevTools Protocol (CDP). When Claude needs to browse the web, it can use this skill to do so.

browser-automationui-automationchrome-devtools-protocolclaude-skills
MykalMachon
MykalMachon
31

stealth-browser

Stealth browser automation with anti-bot bypass and persistent page state. Use when users need to navigate Cloudflare-protected sites, fill forms, take screenshots, extract web data, or automate browser workflows on sites that block regular automation. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "bypass cloudflare", or any browser interaction request on protected sites.

browser-automationheadless-browserui-automationweb-scraping
zippoxer
zippoxer
3

Suno Upload

Upload a Suno prompt.md file to suno.com using Chrome automation with NO human intervention required. Parses the prompt file, navigates to Suno's Create interface, fills all form fields including sliders (lyrics, style, title, weirdness, style influence, vocal gender, exclude styles), and submits for song generation. Uses proven coordinate-based slider manipulation for reliable automation. Use this skill when the user asks to "upload to Suno", "create on Suno", "generate with Suno", "submit to Suno", or wants to automatically upload a generated prompt to the Suno website.

browser-automationchrome-automationui-automationfile-uploads
nwp
nwp
71

castella-mcp

Enable AI agents to introspect and control Castella UIs via MCP. Create MCP servers, expose UI resources, handle MCP tools, and use semantic IDs.

agent-tool-interfaceui-automationnetwork-protocolssemantic-layer
i2y
i2y
321

browser-tools

Interactive browser automation via Chrome DevTools Protocol. Use when you need to interact with web pages, test frontends, or when user interaction with a visible browser is required.

browser-automationchrome-devtools-protocolweb-testingui-automation
badlogic
badlogic
15611

peekaboo

Capture and automate macOS UI with the Peekaboo CLI.

climacosui-automation
steipete
steipete
2,731407

web-browser

Allows to interact with web pages by performing actions such as clicking buttons, filling out forms, and navigating links. It works by remote controlling Google Chrome or Chromium browsers using the Chrome DevTools Protocol (CDP). When Claude needs to browse the web, it can use this skill to do so.

claude-skillschrome-devtools-protocolui-automationbrowser-automation
mitsuhiko
mitsuhiko
57234

web-browser

Allows to interact with web pages by performing actions such as clicking buttons, filling out forms, and navigating links. It works by remote controlling Google Chrome or Chromium browsers using the Chrome DevTools Protocol (CDP). When Claude needs to browse the web, it can use this skill to do so.

claude-skillschrome-devtools-protocolui-automationbrowser-automation
mitsuhiko
mitsuhiko
57234