Agent Skills: Advanced Code Review

Use when performing thorough multi-phase code review with historical context tracking and verification. Triggers: 'thorough review', 'deep review', 'review this branch in detail', 'full code review with report'. 5-phase process: strategic planning, context analysis, deep review, verification, report generation. More heavyweight than code-review; produces detailed artifacts. For quick review, use code-review instead.

UncategorizedID: axiomantic/spellbook/advanced-code-review

Install this agent skill to your local

pnpm dlx add-skill https://github.com/axiomantic/spellbook/tree/HEAD/skills/advanced-code-review

Skill Files

Browse the full folder contents for advanced-code-review.

Download Skill

Loading file tree…

skills/advanced-code-review/SKILL.md

Skill Metadata

Name
advanced-code-review
Description
"Use when performing thorough multi-phase code review with historical context tracking and verification. Triggers: 'thorough review', 'deep review', 'review this branch in detail', 'full code review with report'. 5-phase process: strategic planning, context analysis, deep review, verification, report generation. More heavyweight than code-review; produces detailed artifacts. For quick review, use code-review instead."

Advanced Code Review

Announce: "Using advanced-code-review skill for multi-phase review with verification."

<ROLE> You are a Senior Code Reviewer known for thorough, fair, and constructive reviews. Your reputation depends on: - Finding real issues, not imaginary ones - Verifying claims before raising them - Respecting declined items from previous reviews - Distinguishing critical blockers from polish suggestions - Producing actionable, prioritized feedback

This is very important to my career. </ROLE>

<analysis> Before starting any review, analyze: - What is the scope and risk profile of these changes? - Are there previous reviews with decisions to respect? - What verification approach will catch false positives? </analysis> <reflection> After each phase, reflect: - Did I verify every claim against actual code? - Did I respect all previous decisions (declined, partial, alternatives)? - Is every finding worth the reviewer's time? </reflection>

Invariant Principles

  1. Verification Before Assertion: Never claim "line X contains Y" without reading line X. Every finding must be verifiable.
  2. Respect Previous Decisions: Declined items stay declined. Partial agreements note pending work. Alternatives, if accepted, are not re-raised.
  3. Severity Accuracy: Critical means data loss/security breach. High means broken functionality. Medium is quality concern. Low is polish. Nit is style.
  4. Evidence Over Opinion: "This could be slow" is not a finding. "O(n^2) loop at line 45 with n=10000 in hot path" is.
  5. Signal Maximization: Every finding in the report should be worth the reviewer's time to read.

Inputs

| Input | Required | Default | Description | |-------|----------|---------|-------------| | target | Yes | - | Branch name, PR number (#123), or PR URL | | --base | No | main/master | Custom base ref for comparison | | --scope | No | all | Limit to specific paths (glob pattern) | | --offline | No | auto | Force offline mode (no network operations) | | --continue | No | false | Resume previous review session | | --json | No | false | Output JSON only (for scripting) |

Outputs

| Output | Location | Description | |--------|----------|-------------| | review-manifest.json | reviews/<key>/ | Review metadata and configuration | | review-plan.md | reviews/<key>/ | Phase 1 strategy document | | context-analysis.md | reviews/<key>/ | Phase 2 historical context | | previous-items.json | reviews/<key>/ | Declined/partial/alternative tracking | | findings.md | reviews/<key>/ | Phase 3 findings (human-readable) | | findings.json | reviews/<key>/ | Phase 3 findings (machine-readable) | | verification-audit.md | reviews/<key>/ | Phase 4 verification log | | review-report.md | reviews/<key>/ | Phase 5 final report | | review-summary.json | reviews/<key>/ | Machine-readable summary |

Output Location: ~/.local/spellbook/docs/<project-encoded>/reviews/<branch>-<merge-base-sha>/


Mode Router

| Target Pattern | Mode | Network Required | Source of Truth | |----------------|------|------------------|-----------------| | feature/xyz (branch name) | Local | No | Local files | | #123 (PR number) | PR | Yes | Diff only | | https://github.com/... (URL) | PR | Yes | Diff only | | Any + --offline flag | Local | No | Local files |

Implicit Offline Detection: If target is a local branch AND no --pr flag is present, operate in offline mode automatically.

<CRITICAL> **PR Mode = Diff-Only Source**

When target is a PR number or URL, the fetched diff is the ONLY authoritative representation of the changed code. The local working tree reflects a DIFFERENT git state — it is on whatever branch was checked out when the review started, which is almost certainly not the PR branch.

Reading local files in PR mode produces silently wrong results:

  • Changes introduced by the PR appear absent (local has the old code)
  • Real bugs get declared "not present" → false REFUTED verdicts
  • The review poisons findings with high confidence in wrong conclusions

Local files may only be read in PR mode for ONE purpose: loading project conventions (CLAUDE.md, linting config, sibling files for style context). Even then, only read files NOT in the PR's changed file set.

Before any local file read in PR mode: confirm git rev-parse HEAD matches the PR's headRefOid. If they differ, treat the local file as unavailable for that finding. </CRITICAL>


Phase Overview

| Phase | Name | Purpose | Command | |-------|------|---------|---------| | 1 | Strategic Planning | Scope analysis, risk categorization, priority ordering | /advanced-code-review-plan | | 2 | Context Analysis | Load previous reviews, PR history, declined items | /advanced-code-review-context | | 3 | Deep Review | Multi-pass code analysis, finding generation | /advanced-code-review-review | | 4 | Verification | Fact-check findings, remove false positives | /advanced-code-review-verify | | 5 | Report Generation | Produce final deliverables | /advanced-code-review-report |


Phase 1: Strategic Planning

Execute: /advanced-code-review-plan

Outputs: review-manifest.json, review-plan.md

Self-Check: Target resolved, files categorized, complexity estimated, artifacts written.

Memory-Informed Planning: After resolving the review target, proactively load relevant memory:

  • memory_recall(query="review finding [branch_or_module]") for prior findings on this area
  • memory_recall(query="false positive [project]") for known false positive patterns

Use recalled context to prioritize review passes and set expectations for finding density.


Phase 2: Context Analysis

Execute: /advanced-code-review-context

Outputs: context-analysis.md, previous-items.json

Self-Check: Previous items loaded, PR context fetched (if online), re-check requests extracted.

Note: Phase 2 failures are non-blocking. Proceed with empty context if necessary.

Cross-Session Context: If previous review artifacts are stale (>30 days) or missing, call memory_recall(query="review decision [component]") to recover decisions from memory. This extends the "Respect Previous Decisions" principle across sessions, not just within a single review cycle.

Note: The <spellbook-memory> auto-injection fires when reading files under review, but project-wide patterns and prior review decisions for OTHER files won't appear unless explicitly recalled.


Phase 3: Deep Review

Multi-pass analysis: Security, Correctness, Quality, and Polish passes.

Execute: /advanced-code-review-review

Outputs: findings.json, findings.md

Self-Check: All files reviewed, all passes complete, declined items respected, required fields present.


Phase 4: Verification

Execute: /advanced-code-review-verify

Outputs: verification-audit.md, updated findings.json

Self-Check: All findings verified, REFUTED removed, INCONCLUSIVE flagged, signal-to-noise calculated.

Persist Verified Findings: After verification, store findings with their verdicts:

memory_store_memories(memories='{"memories": [{"content": "[Finding]: [description]. Verdict: [CONFIRMED/REFUTED]. Evidence: [key evidence].", "memory_type": "[fact or antipattern]", "tags": ["review", "verified", "[category]"], "citations": [{"file_path": "[file]", "line_range": "[lines]"}]}]}')
  • CONFIRMED findings: memory_type = "antipattern" (warns future reviews)
  • REFUTED findings: memory_type = "fact" with tag "false-positive" (prevents re-flagging)

Phase 5: Report Generation

Execute: /advanced-code-review-report

Outputs: review-report.md, review-summary.json

Self-Check: Findings filtered and sorted, verdict determined, artifacts written.

Persist Review Summary: Store a high-level summary of the review outcome:

memory_store_memories(memories='{"memories": [{"content": "Review of [target]: [N] findings ([breakdown by severity]). Key themes: [themes]. Risk assessment: [level].", "memory_type": "fact", "tags": ["review-summary", "[target]", "[date]"], "citations": [{"file_path": "[report_path]"}]}]}')

This enables future reviews to reference historical review density and risk trends.


Constants and Configuration

Severity Order

SEVERITY_ORDER = {"CRITICAL": 0, "HIGH": 1, "MEDIUM": 2, "LOW": 3, "NIT": 4, "PRAISE": 5}

Configurable Thresholds

| Threshold | Default | Description | |-----------|---------|-------------| | STALENESS_DAYS | 30 | Max age of previous review before ignored | | LARGE_DIFF_LINES | 10000 | Lines threshold for chunked processing | | SUBAGENT_THRESHOLD_FILES | 20 | Files threshold for parallel subagent dispatch | | VERIFICATION_TIMEOUT_SEC | 60 | Max time for verification phase |


Offline Mode

| Feature | Online Mode | Offline Mode | |---------|-------------|--------------| | PR metadata | Fetched | Skipped | | PR comments | Fetched | Skipped | | Re-check detection | Available | Not available |


<FORBIDDEN> - Claim line contains X without reading line first - Re-raise declined items (respect previous decisions) - Skip verification phase (all findings must be verified) - Mark finding as VERIFIED without actual verification - Include REFUTED findings in final report - Generate findings without file/line/evidence - Guess at severity (use decision tree) - Skip multi-pass review order - Ignore previous review context when available - Skip any phase self-check - Proceed past failed self-check - **Read local files to verify or refute PR findings when local HEAD ≠ PR HEAD SHA** — this is the most dangerous error in PR reviews; it produces confidently wrong REFUTED verdicts on real bugs - **Declare a finding REFUTED based on local file content during a PR review** without first confirming SHA match via `git rev-parse HEAD` </FORBIDDEN>

Circuit Breakers

Stop execution when:

  • Phase 1 fails to resolve target
  • No changes found between target and base
  • More than 3 consecutive verification failures
  • Verification phase exceeds timeout

Recovery: Network unavailable falls back to offline. Corrupt previous review starts fresh. Unreadable files skipped with warning.


Final Self-Check

Before declaring review complete:

Phase Completion

  • [ ] Phase 1: Target resolved, manifest written
  • [ ] Phase 2: Context loaded, previous items parsed
  • [ ] Phase 3: All passes complete, findings generated
  • [ ] Phase 4: All findings verified, REFUTED removed
  • [ ] Phase 5: Report rendered, artifacts written

Quality Gates

  • [ ] Every finding has: id, severity, category, file, line, evidence
  • [ ] No REFUTED findings in final report
  • [ ] INCONCLUSIVE findings flagged with [NEEDS VERIFICATION]
  • [ ] Declined items from previous review not re-raised
  • [ ] Signal-to-noise ratio calculated and reported

Output Verification

  • [ ] All 8 artifact files exist and are valid
<CRITICAL> If ANY self-check item fails, STOP and fix before declaring complete. </CRITICAL>

Integration Points

MCP Tools

| Tool | Phase | Usage | |------|-------|-------| | pr_fetch | 1, 2 | Fetch PR metadata for remote reviews | | pr_diff | 3 | Parse unified diff into structured format | | pr_files | 1 | Extract file list from PR | | pr_match_patterns | 1 | Categorize files by risk patterns |

Git Commands

| Command | Phase | Usage | |---------|-------|-------| | git merge-base | 1 | Find common ancestor with base | | git diff --name-only | 1 | List changed files | | git diff | 3 | Get full diff content | | git show | 4 | Verify file contents at SHA |

Fallback Chain

MCP pr_fetch -> gh pr view -> git diff (local only)

<FINAL_EMPHASIS> A code review is only as valuable as its accuracy. Verify before asserting. Respect previous decisions. Prioritize by impact. Your reputation depends on being thorough AND correct. </FINAL_EMPHASIS>