Performance Skill | Agent Skills

Performance

Comprehensive performance optimization patterns for frontend, backend, and LLM inference.

Quick Reference

| Category | Rules | Impact | When to Use | |----------|-------|--------|-------------| | Core Web Vitals | 4 | CRITICAL | LCP, INP, CLS optimization with 2026 thresholds | | Render Optimization | 3 | HIGH | React Compiler, memoization, virtualization | | Lazy Loading | 3 | HIGH | Code splitting, route splitting, preloading | | Image Optimization | 3 | HIGH | Next.js Image, AVIF/WebP, responsive images | | Profiling & Backend | 3 | MEDIUM | React DevTools, py-spy, bundle analysis | | LLM Inference | 3 | MEDIUM | vLLM, quantization, speculative decoding | | Caching | 2 | HIGH | Redis cache-aside, prompt caching, HTTP cache headers | | Query & Data Fetching | 2 | HIGH | TanStack Query prefetching, optimistic updates, rollback | | Sustainability | 1 | MEDIUM | Page weight budgets, lazy loading, optimized formats, dark mode |

Total: 24 rules across 9 categories

Core Web Vitals

Google's Core Web Vitals with 2026 stricter thresholds.

| Rule | File | Key Pattern | |------|------|-------------| | LCP Optimization | rules/cwv-lcp.md | Preload hero, SSR, fetchpriority="high" | | INP Optimization | rules/cwv-inp.md | scheduler.yield, useTransition, requestIdleCallback | | INP Advanced | rules/cwv-inp-advanced.md | Layout thrashing, third-party scripts, rAF patterns | | CLS Prevention | rules/cwv-cls.md | Explicit dimensions, aspect-ratio, font-display |

2026 Thresholds

| Metric | Current Good | 2026 Good | |--------|--------------|-----------| | LCP | <= 2.5s | <= 2.0s | | INP | <= 200ms | <= 150ms | | CLS | <= 0.1 | <= 0.08 |

Render Optimization

React render performance patterns for React 19+.

| Rule | File | Key Pattern | |------|------|-------------| | React Compiler | rules/render-compiler.md | Auto-memoization, "Memo" badge verification | | Manual Memoization | rules/render-memo.md | useMemo/useCallback escape hatches, state colocation | | Virtualization | rules/render-virtual.md | TanStack Virtual for 100+ item lists |

Lazy Loading

Code splitting and lazy loading with React.lazy and Suspense.

| Rule | File | Key Pattern | |------|------|-------------| | React.lazy + Suspense | rules/loading-lazy.md | Component lazy loading, error boundaries | | Route Splitting | rules/loading-splitting.md | React Router 7.x, Vite manual chunks | | Preloading | rules/loading-preload.md | Prefetch on hover, modulepreload hints |

Image Optimization

Production image optimization for modern web applications.

| Rule | File | Key Pattern | |------|------|-------------| | Next.js Image | rules/images-nextjs.md | Image component, priority, blur placeholder | | Format Selection | rules/images-formats.md | AVIF/WebP, quality 75-85, picture element | | Responsive Images | rules/images-responsive.md | sizes prop, art direction, CDN loaders |

Profiling & Backend

Profiling tools and backend optimization patterns.

| Rule | File | Key Pattern | |------|------|-------------| | React Profiling | rules/profiling-react.md | DevTools Profiler, flamegraph, render counts | | Backend Profiling | rules/profiling-backend.md | py-spy, cProfile, memory_profiler, flame graphs | | Bundle Analysis | rules/profiling-bundle.md | vite-bundle-visualizer, tree shaking, performance budgets |

LLM Inference

High-performance LLM inference with vLLM, quantization, and speculative decoding.

| Rule | File | Key Pattern | |------|------|-------------| | vLLM Deployment | rules/inference-vllm.md | PagedAttention, continuous batching, tensor parallelism | | Quantization | rules/inference-quantization.md | AWQ, GPTQ, FP8, INT8 method selection | | Speculative Decoding | rules/inference-speculative.md | N-gram, draft model, 1.5-2.5x throughput |

Caching

Backend Redis caching and LLM prompt caching for cost savings and performance.

| Rule | File | Key Pattern | |------|------|-------------| | Redis & Backend | rules/caching-redis.md | Cache-aside, write-through, invalidation, stampede prevention | | HTTP & Prompt | rules/caching-http.md | HTTP cache headers, LLM prompt caching, semantic caching |

Query & Data Fetching

TanStack Query v5 patterns for prefetching and optimistic updates.

| Rule | File | Key Pattern | |------|------|-------------| | Prefetching | rules/query-prefetching.md | Hover prefetch, route loaders, queryOptions, Suspense | | Optimistic Updates | rules/query-optimistic.md | Optimistic mutations, rollback, cache invalidation |

Sustainability

Digital sustainability patterns for reducing carbon footprint and energy usage.

| Rule | File | Key Pattern | |------|------|-------------| | Sustainability UX | rules/sustainability-ux.md | Page weight budgets, AVIF/WebP, lazy loading, dark mode |

Local Profiling Target

When profiling a local app (Lighthouse, Core Web Vitals, bundle analysis), use Portless named URLs for stable, self-documenting targets:

# Discover services
portless list
# app → app.localhost:1355 (port 3000)

# Profile with agent-browser (preferred for visual metrics)
agent-browser open "http://app.localhost:1355"
agent-browser profiler start
agent-browser wait --load networkidle
agent-browser profiler stop /tmp/profile.json

# Lighthouse via agent-browser
agent-browser open "http://app.localhost:1355"
agent-browser screenshot /tmp/perf-baseline.png

# Or direct Lighthouse CLI
npx lighthouse http://app.localhost:1355 --output=json --output-path=/tmp/lighthouse.json

Named URLs are stable across restarts and self-documenting in performance reports. Install Portless with npm i -g portless.

Quick Start Example

// LCP: Priority hero image with SSR
import Image from 'next/image';

export default async function Page() {
  const data = await fetchHeroData();
  return (
    <Image
      src={data.heroImage}
      alt="Hero"
      priority
      placeholder="blur"
      sizes="100vw"
      fill
    />
  );
}

Key Decisions

| Decision | Recommendation | |----------|----------------| | Memoization | Let React Compiler handle it (2026 default) | | Lists 100+ items | Use TanStack Virtual | | Image format | AVIF with WebP fallback (30-50% smaller) | | LCP content | SSR/SSG, never client-side fetch | | Code splitting | Per-route for most apps, per-component for heavy widgets | | Prefetch strategy | On hover for nav links, viewport for content | | Quantization | AWQ for 4-bit, FP8 for H100/H200 | | Bundle budget | Hard fail in CI to prevent regression |

Common Mistakes

Client-side fetching LCP content (delays render)
Images without explicit dimensions (causes CLS)
Lazy loading LCP images (delays largest paint)
Heavy computation in event handlers (blocks INP)
Layout-shifting animations (use transform instead)
Lazy loading tiny components < 5KB (overhead > savings)
Missing error boundaries on lazy components
Using GPTQ without calibration data
Not benchmarking actual workload patterns
Only measuring in lab environment (need RUM)

Related Skills

ork:react-server-components-framework - Server-first rendering
ork:vite-advanced - Build optimization
browser-tools - Visual profiling with agent-browser + Portless
caching - Cache strategies for responses
ork:monitoring-observability - Production monitoring and alerting
ork:database-patterns - Query and index optimization
ork:llm-integration - Local inference with Ollama

Capability Details

lcp-optimization

Keywords: LCP, largest-contentful-paint, hero, preload, priority, SSR Solves:

Optimize hero image loading
Server-render critical content
Preload and prioritize LCP resources

inp-optimization

Keywords: INP, interaction, responsiveness, long-task, transition, yield Solves:

Break up long tasks with scheduler.yield
Defer non-urgent updates with useTransition
Optimize event handler performance

cls-prevention

Keywords: CLS, layout-shift, dimensions, aspect-ratio, font-display Solves:

Reserve space for dynamic content
Prevent font flash and image pop-in
Use transform for animations

react-compiler

Keywords: react-compiler, auto-memo, memoization, React 19 Solves:

Enable automatic memoization
Identify when manual memoization needed
Verify compiler is working

virtualization

Keywords: virtual, TanStack, large-list, scroll, overscan Solves:

Render 100+ item lists efficiently
Dynamic height virtualization
Window scrolling patterns

lazy-loading

Keywords: React.lazy, Suspense, code-splitting, dynamic-import Solves:

Route-based code splitting
Component lazy loading with error boundaries
Prefetch on hover and viewport

image-optimization

Keywords: next/image, AVIF, WebP, responsive, blur-placeholder Solves:

Next.js Image component patterns
Format selection and quality settings
Responsive sizing and CDN configuration

profiling

Keywords: profiler, flame-graph, py-spy, DevTools, bundle-analyzer Solves:

Profile React renders and backend code
Generate and interpret flame graphs
Analyze and optimize bundle size

inp-advanced

Keywords: INP, scheduler-yield, layout-thrashing, third-party-scripts, requestAnimationFrame Solves:

Break long tasks with scheduler.yield()
Audit and defer blocking third-party scripts
Avoid synchronous layout thrashing in event handlers
Optimize form submissions, dropdowns, accordions, filters

sustainability

Keywords: sustainability, carbon-footprint, page-weight, green-ux, dark-mode, lazy-loading Solves:

Enforce page weight budgets (< 1MB)
Eliminate auto-playing videos and heavy decorative animations
Serve optimized image formats (AVIF/WebP)
Implement cursor-based pagination to prevent over-fetching

llm-inference

Keywords: vllm, quantization, speculative-decoding, inference, throughput Solves:

Deploy LLMs with vLLM for production
Choose quantization method for hardware
Accelerate generation with speculative decoding

References

Load on demand with Read("${CLAUDE_SKILL_DIR}/references/<file>"): | File | Content | |------|---------| | rum-setup.md | Real User Monitoring | | react-compiler-migration.md | Compiler adoption | | tanstack-virtual-patterns.md | Virtualization patterns | | vllm-deployment.md | Production vLLM config | | quantization-guide.md | Method comparison | | cdn-setup.md | Image CDN configuration | | cc-prompt-cache-guide.md | CC 2.1.72 prompt cache optimization, stable-first prompt structure |

Agent Skills: Performance

Install this agent skill to your local

Skill Files