mirror of https://github.com/KeygraphHQ/shannon.git synced 2026-05-16 14:29:08 +02:00

Files

T

ezl-keygraph 46be49c175 chore: remove unused scan tools and dead error type (#327 )

* chore: remove unused scan tools and dead error type

* chore(logs): redact base URL and target URL from preflight info logs

2026-05-04 21:51:45 +05:30

17 KiB

Raw Permalink Blame History

CLAUDE.md

AI-powered penetration testing agent for defensive security analysis. Automates vulnerability assessment by combining reconnaissance tools with AI-powered code analysis.

Commands

Prerequisites: Docker, AI provider credentials (.env for local, shn setup or env vars for npx)

Dual CLI

Shannon supports two CLI modes, auto-detected based on the current working directory:

	npx (`npx @keygraph/shannon`)	Local (`./shannon`)
Install	Zero-install via npm	Clone the repo
Image	Pulled from Docker Hub (`keygraph/shannon:latest`)	Built locally (`shannon-worker`)
State	`~/.shannon/`	Project directory
Credentials	`~/.shannon/config.toml` (via `shn setup`) or env vars	`./.env`
Config	`~/.shannon/config.toml` (via `shn setup`)	N/A
Prompts	Bundled in Docker image	Mounted from `./apps/worker/prompts/` (live-editable)

Mode auto-detection: local mode activates when env var SHANNON_LOCAL=1 is set by the ./shannon entry point (apps/cli/src/mode.ts). Otherwise npx mode.

npx Quick Start

# Configure credentials (interactive wizard)
npx @keygraph/shannon setup

# Or export env vars directly (non-interactive / CI)
export ANTHROPIC_API_KEY=your-key

# Run
npx @keygraph/shannon start -u <url> -r /path/to/repo

Local (Development) Quick Start

# Setup
echo "ANTHROPIC_API_KEY=your-key" > .env

# Build (auto-runs if image missing)
./shannon build

# Run
./shannon start -u <url> -r my-repo
./shannon start -u <url> -r my-repo -c ./apps/worker/configs/my-config.yaml
./shannon start -u <url> -r /any/path/to/repo

Common Commands

# Setup (npx mode only — one-time credential configuration)
npx @keygraph/shannon setup

# Workspaces & Resume
./shannon start -u <url> -r my-repo -w my-audit    # New named workspace
./shannon start -u <url> -r my-repo -w my-audit    # Resume (same command)
./shannon workspaces                                 # List all workspaces

# Monitor
./shannon logs <workspace>            # Tail workflow log
./shannon status                      # Show running workers
# Temporal Web UI: http://localhost:8233

# Stop
./shannon stop                        # Preserves workflow data
./shannon stop --clean                # Full cleanup including volumes (confirms first)

# Image management
./shannon build [--no-cache]          # Local mode: build worker image
npx @keygraph/shannon uninstall             # npx mode: remove ~/.shannon/ (confirms first)

# Build TypeScript (development)
pnpm run build                       # Build all packages via Turborepo
pnpm run check                       # Type-check all packages
pnpm biome                           # Biome lint + format + import sorting check
pnpm biome:fix                       # Auto-fix lint, format, and import sorting

Monorepo tooling: pnpm workspaces, Turborepo for task orchestration, Biome for linting/formatting. TypeScript compiler options shared via tsconfig.base.json at the root. All packages extend it, overriding only rootDir and outDir. Shared devDependencies (typescript, @types/node, turbo, @biomejs/biome) are hoisted to the root workspace.

Options: -c <file> (YAML config), -o <path> (output directory), -w <name> (named workspace; auto-resumes if exists), --pipeline-testing (minimal prompts, 10s retries), --debug (preserve worker container after exit for log inspection)

Architecture

Monorepo Layout

apps/cli/        — @keygraph/shannon (published to npm, bundled with tsdown)
apps/worker/     — @shannon/worker (private, Temporal worker + pipeline logic)

CLI Package (`apps/cli/`)

Published as @keygraph/shannon on npm. Contains only Docker orchestration logic — no Temporal SDK, business logic, or prompts. Bundled with tsdown for single-file ESM output.

apps/cli/src/index.ts — CLI dispatcher (setup, start, stop, logs, workspaces, status, build, uninstall, info)
apps/cli/src/mode.ts — Auto-detection: local mode if SHANNON_LOCAL=1 env var is set
apps/cli/src/docker.ts — Compose lifecycle, image pull/build, ephemeral docker run worker spawning
apps/cli/src/home.ts — State directory management (~/.shannon/ for npx, ./ for local)
apps/cli/src/env.ts — .env loading, TOML fallback (npx only) via apps/cli/src/config/resolver.ts, credential validation, env flag building
apps/cli/src/config/resolver.ts — Cascading config (npx only): env vars → ~/.shannon/config.toml (parsed with smol-toml)
apps/cli/src/config/writer.ts — TOML serialization and secure file persistence (0o600)
apps/cli/src/commands/setup.ts — Interactive TUI wizard (@clack/prompts) for provider credential setup (npx only)
apps/cli/src/paths.ts — Repo/config path resolution (bare name → ./repos/<name>, or any absolute/relative path)
apps/cli/src/commands/ — Command handlers
apps/cli/infra/compose.yml — Bundled Temporal compose file for npx mode
apps/cli/tsdown.config.ts — tsdown bundler config
shannon — Node.js entry point (#!/usr/bin/env node) that delegates to apps/cli/dist/index.mjs

Docker Architecture

Infra (Temporal) runs via docker-compose.yml. Workers are ephemeral docker run --rm containers, one per scan, each with a unique task queue and isolated volume mounts.

docker-compose.yml — Infra only: shannon-temporal (port 7233/8233). Network: shannon-net
Dockerfile — 2-stage build (builder + Chainguard Wolfi runtime). Uses pnpm. Entrypoint: CMD ["node", "apps/worker/dist/temporal/worker.js"]
No docker-compose.docker.yml — host gateway handled via --add-host flag in CLI

Worker Package (`apps/worker/`)

apps/worker/src/paths.ts — Centralized path constants (PROMPTS_DIR, CONFIGS_DIR, WORKSPACES_DIR)
apps/worker/src/session-manager.ts — Agent definitions (AGENTS record). Agent types in apps/worker/src/types/agents.ts
apps/worker/src/config-parser.ts — YAML config parsing with JSON Schema validation
apps/worker/src/ai/claude-executor.ts — Claude Agent SDK integration with retry logic
apps/worker/src/services/ — Business logic layer (Temporal-agnostic). Activities delegate here. Key: agent-execution.ts, error-handling.ts, container.ts
apps/worker/src/types/ — Consolidated types: Result<T,E>, ErrorCode, AgentName, ActivityLogger, etc.
apps/worker/src/utils/ — Shared utilities (file I/O, formatting, concurrency)

Temporal Orchestration

Durable workflow orchestration with crash recovery, queryable progress, intelligent retry, and parallel execution (5 concurrent agents in vuln/exploit phases).

apps/worker/src/temporal/workflows.ts — Main workflow (pentestPipelineWorkflow)
apps/worker/src/temporal/activities.ts — Thin wrappers — heartbeat loop, error classification, container lifecycle. Business logic delegated to apps/worker/src/services/
apps/worker/src/temporal/activity-logger.ts — TemporalActivityLogger implementation of ActivityLogger interface
apps/worker/src/temporal/summary-mapper.ts — Maps PipelineSummary to WorkflowSummary
apps/worker/src/temporal/worker.ts — Combined worker + client entry point (per-invocation task queue, submits workflow, waits for result)
apps/worker/src/temporal/shared.ts — Types, interfaces, query definitions

Five-Phase Pipeline

Pre-Recon (pre-recon) — Source code analysis to build the architectural baseline
Recon (recon) — Attack surface mapping from initial findings
Vulnerability Analysis (5 parallel agents) — injection, xss, auth, authz, ssrf
Exploitation (5 parallel agents, conditional) — Exploits confirmed vulnerabilities
Reporting (report) — Executive-level security report

Supporting Systems

Configuration — YAML configs in apps/worker/configs/ with JSON Schema validation (config-schema.json). Supports auth settings (MFA/TOTP), URL/code rule scoping (rules.avoid/rules.focus), run-scope steering (vuln_classes, exploit), free-form rules_of_engagement, and post-hoc report filters (min_severity, min_confidence, guidance). code_path avoid rules are written into ~/.claude/settings.json permissions.deny (Read/Edit) once per workflow by apps/worker/src/temporal/activities.ts:syncCodePathDenyRules so the SDK enforces them at the tool layer even in bypassPermissions mode. vuln_classes/exploit scope is locked into session.json on first run; resumes with a different scope fail fast (persistOrValidateRunScope). Credential resolution — local mode: env vars → ./.env; npx mode: env vars → ~/.shannon/config.toml (via shn setup)
Prompts — Per-phase templates in apps/worker/prompts/ with variable substitution ({{TARGET_URL}}, {{CONFIG_CONTEXT}}). Shared partials in apps/worker/prompts/shared/ via apps/worker/src/services/prompt-manager.ts, including _code-path-rules.txt (focus/avoid [FILE]/[GLOB] routing) and _rules-of-engagement.txt (free-text engagement rules). When exploit: false, apps/worker/src/services/findings-renderer.ts deterministically converts each *_exploitation_queue.json into a *_findings.md for report assembly — no LLM in the loop
SDK Integration — Uses @anthropic-ai/claude-agent-sdk with maxTurns: 10_000 and bypassPermissions mode. Adaptive thinking is enabled by default on Opus 4.6/4.7 (supportsAdaptiveThinking in apps/worker/src/ai/models.ts); disable per-scan via CLAUDE_ADAPTIVE_THINKING=false (env) or core.adaptive_thinking = false (npx TOML). Browser automation via playwright-cli with session isolation (-s=<session>). TOTP generation via generate-totp CLI tool. Login flow template at apps/worker/prompts/shared/login-instructions.txt supports form, SSO, API, and basic auth
Audit System — Crash-safe append-only logging in workspaces/{hostname}_{sessionId}/. Tracks session metrics, per-agent logs, prompts, and deliverables. WorkflowLogger (apps/worker/src/audit/workflow-logger.ts) provides unified human-readable per-workflow logs, backed by LogStream (apps/worker/src/audit/log-stream.ts) shared stream primitive
Deliverables — Saved to deliverables/ in the target repo via the save-deliverable CLI script (apps/worker/src/scripts/save-deliverable.ts)
Workspaces & Resume — Named workspaces via -w <name> or auto-named from URL+timestamp. Resume detects completed agents via session.json. loadResumeState() in apps/worker/src/temporal/activities.ts validates deliverable existence, restores git checkpoints, and cleans up incomplete deliverables. Workspace listing via apps/worker/src/temporal/workspaces.ts

Development Notes

Adding a New Agent

Define agent in apps/worker/src/session-manager.ts (add to AGENTS record). ALL_AGENTS/AgentName types live in apps/worker/src/types/agents.ts
Create prompt template in apps/worker/prompts/ (e.g., vuln-newtype.txt)
Two-layer pattern: add a thin activity wrapper in apps/worker/src/temporal/activities.ts (heartbeat + error classification). AgentExecutionService in apps/worker/src/services/agent-execution.ts handles the agent lifecycle automatically via the AGENTS registry
Register activity in apps/worker/src/temporal/workflows.ts within the appropriate phase

Modifying Prompts

Variable substitution: {{TARGET_URL}}, {{CONFIG_CONTEXT}}, {{LOGIN_INSTRUCTIONS}}
Shared partials in apps/worker/prompts/shared/ included via apps/worker/src/services/prompt-manager.ts
Test with --pipeline-testing for fast iteration

Key Design Patterns

Configuration-Driven — YAML configs with JSON Schema validation
Progressive Analysis — Each phase builds on previous results
SDK-First — Claude Agent SDK handles autonomous analysis
Modular Error Handling — ErrorCode enum, Result<T,E> for explicit error propagation, automatic retry (3 attempts per agent)
Services Boundary — Activities are thin Temporal wrappers; apps/worker/src/services/ owns business logic, accepts ActivityLogger, returns Result<T,E>. No Temporal imports in services
DI Container — Per-workflow in apps/worker/src/services/container.ts. AuditSession excluded (parallel safety)
Ephemeral Workers — Each scan runs in its own docker run --rm container with a per-invocation task queue. Temporal routes activities by queue name, so per-scan queues ensure activities never land on a worker with the wrong repo mounted

Security

Defensive security tool only. Use only on systems you own or have explicit permission to test.

Code Style Guidelines

Formatting

Biome handles formatting and linting. Run pnpm biome:fix to auto-fix. Config in biome.json: single quotes, semicolons, trailing commas, 2-space indent, 120 char line width.

Clarity Over Brevity

Optimize for readability, not line count — three clear lines beat one dense expression
Use descriptive names that convey intent
Prefer explicit logic over clever one-liners

Structure

Keep functions focused on a single responsibility
Use early returns and guard clauses instead of deep nesting
Never use nested ternary operators — use if/else or switch
Extract complex conditions into well-named boolean variables

TypeScript Conventions

Use function keyword for top-level functions (not arrow functions)
Explicit return type annotations on exported/top-level functions
Prefer readonly for data that shouldn't be mutated
exactOptionalPropertyTypes is enabled — use spread for optional props, not direct undefined assignment

Avoid

Combining multiple concerns into a single function to "save lines"
Dense callback chains when sequential logic is clearer
Sacrificing readability for DRY — some repetition is fine if clearer
Abstractions for one-time operations
Backwards-compatibility shims, deprecated wrappers, or re-exports for removed code — delete the old code, don't preserve it

Comments

Comments must be timeless — no references to this conversation, refactoring history, or the AI.

Patterns used in this codebase:

/** JSDoc */ — file headers (after license) and exported functions/interfaces
// N. Description — numbered sequential steps inside function bodies. Use when a function has 3+ distinct phases where at least one isn't immediately obvious from the code. Each step marks the start of a logical phase. Reference: AgentExecutionService.execute (steps 1-9) and injectModelIntoReport (steps 1-5)
// === Section === — high-level dividers between groups of functions in long files, or to label major branching/classification blocks (e.g., // === SPENDING CAP SAFEGUARD ===). Not for sequential steps inside function bodies — use numbered steps for that
// NOTE: / // WARNING: / // IMPORTANT: — gotchas and constraints

Never: obvious comments, conversation references ("as discussed"), history ("moved from X")

Key Files

CLI: shannon (entry point), apps/cli/src/index.ts (dispatcher), apps/cli/src/docker.ts (orchestration), apps/cli/src/mode.ts (auto-detection)

Entry Points: apps/worker/src/temporal/workflows.ts, apps/worker/src/temporal/activities.ts, apps/worker/src/temporal/worker.ts

Core Logic: apps/worker/src/session-manager.ts, apps/worker/src/ai/claude-executor.ts, apps/worker/src/ai/settings-writer.ts (writes code_path deny rules to ~/.claude/settings.json), apps/worker/src/config-parser.ts, apps/worker/src/services/ (incl. preflight.ts, findings-renderer.ts, reporting.ts), apps/worker/src/audit/

Config: docker-compose.yml, apps/cli/infra/compose.yml, apps/worker/configs/, apps/worker/prompts/, tsconfig.base.json (shared compiler options), turbo.json, biome.json

CI/CD: .github/workflows/release.yml (Docker Hub push + npm publish + GitHub release, manual dispatch)

Package Installation

Package managers are configured with a minimum release age (7 days). Requires pnpm >= 10.16.0. If pnpm install fails due to a package being too new, do not attempt to bypass it — report the blocked package to the user and stop.

Troubleshooting

"Repository not found" — Pass a bare name (-r my-repo) for ./repos/my-repo, or a path (-r /path/to/repo) for any directory
"Temporal not ready" — Wait for health check or docker compose logs temporal
Worker not processing — Check docker ps --filter "name=shannon-worker-"
Reset state — ./shannon stop --clean
Local apps unreachable — Use host.docker.internal instead of localhost
Container permissions — On Linux, may need sudo for docker commands

17 KiB Raw Permalink Blame History