shannon

mirror of https://github.com/KeygraphHQ/shannon.git synced 2026-04-01 02:10:55 +02:00

Author	SHA1	Message	Date
george-keygraph	0b35e2bd1d	Update README.md	2026-03-06 16:34:45 -08:00
george-keygraph	4c6750541b	Update README.md	2026-03-06 11:38:53 -08:00
george-keygraph	595b2ada78	Update README.md	2026-03-06 11:36:43 -08:00
george-keygraph	03377de469	Update README.md	2026-03-05 16:47:03 -08:00
george-keygraph	477ccd71aa	Update README.md	2026-03-05 16:45:08 -08:00
keygraphVarun	53bb10c450	Update README.md	2026-03-04 18:39:05 -08:00
keygraphVarun	e69ce6f51e	Update README.md	2026-03-04 18:17:46 -08:00
keygraphVarun	9b0e64944b	Update README.md cleanup	2026-03-04 13:57:28 -08:00
keygraphVarun	57d1141f4a	Update README.md	2026-03-04 13:38:43 -08:00
keygraphVarun	1aafc0c3d0	Update README.md update readme	2026-03-04 13:08:18 -08:00
ezl-keygraph	6a76df2f4c	feat: add Google Vertex AI support with service account auth	2026-03-03 02:42:46 +05:30
ezl-keygraph	b62abfea4c	feat: add three-tier model system with Bedrock support Introduce small/medium/large model tiers so agents use the appropriate model for their task complexity. Pre-recon uses Opus (large) for deep source code analysis, most agents use Sonnet (medium), and report uses Haiku (small) for summarization. - Add src/ai/models.ts with ModelTier type and resolveModel() - Add modelTier field to AgentDefinition - Refactor claude-executor env var passthrough into loop - Add Bedrock credential validation in preflight and CLI - Pass through Bedrock and model env vars in docker-compose	2026-03-03 01:08:26 +05:30
ajmallesh	d67c07dc55	feat: add configurable pipeline retry and concurrency settings (#157 ) - Add `pipeline` config section with `retry_preset` and `max_concurrent_pipelines` options - Add `subscription` retry preset with extended 6h max interval for Anthropic rate limit windows - Replace Promise.allSettled with concurrency-limited runner for vuln/exploit pipelines - Wire pipeline config through client, shared types, and workflow activity proxy selection	2026-02-24 09:31:33 -08:00
ajmallesh	17d12be2ab	chore: update README banner image	2026-02-24 09:11:50 -08:00
ezl-keygraph	1e3f709423	docs: add WSL2 setup guide for Windows users	2026-02-17 18:03:45 +05:30
ezl-keygraph	3a07f8a81f	Merge pull request #140 from KeygraphHQ/feat/resume-workspace feat: add named workspaces with resume support	2026-02-17 00:23:23 +05:30
ezl-keygraph	e85f6e0c73	feat: add MSYS path fix, Claude Code CLI, and Windows instructions - Prevent MSYS from converting Unix container paths on Windows - Install @anthropic-ai/claude-code globally in the Docker image - Add Windows platform instructions to README	2026-02-16 20:11:08 +05:30
ezl-keygraph	1b696cac1b	fix: store checkpoint as success commit hash and show cumulative metrics - Swap commitGitSuccess/getGitCommitHash order so checkpoint in session.json points to the success commit (which contains deliverables) instead of the pre-agent marker commit - Simplify restoreGitCheckpoint: git reset --hard now naturally preserves completed agent deliverables, removing the in-memory backup/restore - Show cumulative cost/duration in workflow.log from session.json - Fill in per-agent metrics for skipped agents in workflow.log breakdown - Display cumulative cost in client output for resume runs	2026-02-14 02:52:11 +05:30
ezl-keygraph	c169b0d0a6	fix: restore CLAUDE_CODE_MAX_OUTPUT_TOKENS env var support Re-add the env var that was removed during SDK upgrade. Needed for controlling output token limits in SDK subprocesses.	2026-02-12 08:51:39 -08:00
ezl-keygraph	3c13a9a7e6	feat: upgrade claude-agent-sdk to 0.2.38 and adapt to new SDK types (#113 ) * feat: upgrade claude-agent-sdk to 0.2.38 and adapt to new SDK types - Bump @anthropic-ai/claude-agent-sdk from 0.1.x to 0.2.38 (both root and mcp-server) - Bump zod from 3.x to 4.x (SDK peer dependency) - Add allowDangerouslySkipPermissions to query options (required for bypassPermissions) - Suppress new SDK message types (tool_progress, tool_use_summary, auth_status) - Use structured error field on assistant messages instead of text-sniffing - Add stop_reason to result message handling for diagnostics - Add SDKAssistantMessageError type matching SDK's string literal union * chore: remove CLAUDE_CODE_MAX_OUTPUT_TOKENS from all config and docs	2026-02-11 00:19:59 +05:30
ezl-keygraph	2e9ee2a11e	fix: mount repos and configs directories into worker container (#107 ) * feat: use static repos/ folder mount instead of dynamic TARGET_REPO Replace dynamic per-run TARGET_REPO bind mount with a static ./repos:/repos mount. Users place target repositories under ./repos/ and reference them by folder name. This fixes stale mounts when switching targets and enables running multiple scans concurrently against different repos. * feat: mount configs directory into worker container * docs: add instructions for repos and configs directory setup	2026-02-10 00:05:41 +05:30
Arjun Malleswaran	4aee8db3d0	fix: add cache-busting param to screenshot URL (#82 )	2026-02-07 10:08:25 -08:00
Arjun Malleswaran	9ed5327561	Feat/shannon by keygraph branding (#81 ) * feat: update splash screen screenshot with new branding * docs: add Trendshift badge to README	2026-02-07 10:02:48 -08:00
keygraphVarun	7cb0a0ae5e	Update README.md	2026-01-27 16:18:02 -08:00
keygraphVarun	8f42eb64fa	Update README.md	2026-01-22 15:26:16 -08:00
ajmallesh	a15408e23f	docs: remove Gemini 3 Pro from supported router models	2026-01-20 16:42:16 -08:00
ajmallesh	25fde5240a	docs: remove DeepSeek references from router mode documentation	2026-01-20 09:59:40 -08:00
ajmallesh	f85c1bd193	refactor: simplify router to OpenAI and OpenRouter providers only - Remove Gemini direct and DeepSeek provider configurations - Keep OpenAI (gpt-5.2, gpt-5-mini) and OpenRouter (Gemini 3 models) - Update documentation and environment examples - Remove cost column from README providers table	2026-01-20 09:49:16 -08:00
ajmallesh	cd04c7a6d2	feat: add model tracking and reporting across pipeline - Track actual model name from router through audit logs, session.json, and query output - Add router-utils.ts to resolve model names from ROUTER_DEFAULT env var - Inject model info into final report's Executive Summary section - Update documentation with supported providers, pricing, and config examples - Update router-config.json with latest model versions (GPT-5.2, Gemini 2.5, etc.)	2026-01-15 18:30:19 -08:00
Arjun Malleswaran	51e621d0d5	Feat/temporal (#46 ) * refactor: modularize claude-executor and extract shared utilities - Extract message handling into src/ai/message-handlers.ts with pure functions - Extract output formatting into src/ai/output-formatters.ts - Extract progress management into src/ai/progress-manager.ts - Add audit-logger.ts with Null Object pattern for optional logging - Add shared utilities: formatting.ts, file-io.ts, functional.ts - Consolidate getPromptNameForAgent into src/types/agents.ts * feat: add Claude Code custom commands for debug and review * feat: add Temporal integration foundation (phase 1-2) - Add Temporal SDK dependencies (@temporalio/client, worker, workflow, activity) - Add shared types for pipeline state, metrics, and progress queries - Add classifyErrorForTemporal() for retry behavior classification - Add docker-compose for Temporal server with SQLite persistence * feat: add Temporal activities for agent execution (phase 3) - Add activities.ts with heartbeat loop, git checkpoint/rollback, and error classification - Export runClaudePrompt, validateAgentOutput, ClaudePromptResult for Temporal use - Track attempt number via Temporal Context for accurate audit logging - Rollback git workspace before retry to ensure clean state * feat: add Temporal workflow for 5-phase pipeline orchestration (phase 4) * feat: add Temporal worker, client, and query tools (phase 5) - Add worker.ts with workflow bundling and graceful shutdown - Add client.ts CLI to start pipelines with progress polling - Add query.ts CLI to inspect running workflow state - Fix buffer overflow by truncating error messages and stack traces - Skip git operations gracefully on non-git repositories - Add kill.sh/start.sh dev scripts and Dockerfile.worker * feat: fix Docker worker container setup - Install uv instead of deprecated uvx package - Add mcp-server and configs directories to container - Mount target repo dynamically via TARGET_REPO env variable * fix: add report assembly step to Temporal workflow - Add assembleReportActivity to concatenate exploitation evidence files before report agent runs - Call assembleFinalReport in workflow Phase 5 before runReportAgent - Ensure deliverables directory exists before writing final report - Simplify pipeline-testing report prompt to just prepend header * refactor: consolidate Docker setup to root docker-compose.yml * feat: improve Temporal client UX and env handling - Change default to fire-and-forget (--wait flag to opt-in) - Add splash screen and improve console output formatting - Add .env to gitignore, remove from dockerignore for container access - Add Taskfile for common development commands * refactor: simplify session ID handling and improve Taskfile options - Include hostname in workflow ID for better audit log organization - Extract sanitizeHostname utility to audit/utils.ts for reuse - Remove unused generateSessionLogPath and buildLogFilePath functions - Simplify Taskfile with CONFIG/OUTPUT/CLEAN named parameters * chore: add .env.example and simplify .gitignore * docs: update README and CLAUDE.md for Temporal workflow usage - Replace Docker CLI instructions with Task-based commands - Add monitoring/stopping sections and workflow examples - Document Temporal orchestration layer and troubleshooting - Simplify file structure to key files overview * refactor: replace Taskfile with bash CLI script - Add shannon bash script with start/logs/query/stop/help commands - Remove Taskfile.yml dependency (no longer requires Task installation) - Update README.md and CLAUDE.md to use ./shannon commands - Update client.ts output to show ./shannon commands * docs: fix deliverable filename in README * refactor: remove direct CLI and .shannon-store.json in favor of Temporal - Delete src/shannon.ts direct CLI entry point (Temporal is now the only mode) - Remove .shannon-store.json session lock (Temporal handles workflow deduplication) - Remove broken scripts/export-metrics.js (imported non-existent function) - Update package.json to remove main, start script, and bin entry - Clean up CLAUDE.md and debug.md to remove obsolete references * chore: remove licensing comments from prompt files to prevent leaking into actual prompts * fix: resolve parallel workflow race conditions and retry logic bugs - Fix save_deliverable race condition using closure pattern instead of global variable - Fix error classification order so OutputValidationError matches before generic validation - Fix ApplicationFailure re-classification bug by checking instanceof before re-throwing - Add per-error-type retry limits (3 for output validation, 50 for billing) - Add fast retry intervals for pipeline testing mode (10s vs 5min) - Increase worker concurrent activities to 25 for parallel workflows * refactor: pipeline vuln→exploit workflow for parallel execution - Replace sync barrier between vuln/exploit phases with independent pipelines - Each vuln type runs: vuln agent → queue check → conditional exploit - Add checkExploitationQueue activity to skip exploits when no vulns found - Use Promise.allSettled for graceful failure handling across pipelines - Add PipelineSummary type for aggregated cost/duration/turns metrics * fix: re-throw retryable errors in checkExploitationQueue * fix: detect and retry on Claude Code spending cap errors - Add spending cap pattern detection in detectApiError() with retryable error - Add matching patterns to classifyErrorForTemporal() for proper Temporal retry - Add defense-in-depth safeguard in runClaudePrompt() for $0 cost / low turn detection - Add final sanity check in activities before declaring success * fix: increase heartbeat timeout to prevent false worker-dead detection Original 30s timeout was from POC spec assuming <5min activities. With hour-long activities and multiple concurrent workflows sharing one worker, resource contention causes event loop stalls exceeding 30s, triggering false heartbeat timeouts. Increased to 10min (prod) and 5min (testing). * fix: temporal db init * fix: persist home dir * feat: add per-workflow unified logging with ./shannon logs ID=<workflow-id> - Add WorkflowLogger class for human-readable, per-workflow log files - Create workflow.log in audit-logs/{workflowId}/ with phase, agent, tool, and LLM events - Update ./shannon logs to require ID param and tail specific workflow log - Add phase transition logging at workflow boundaries - Include workflow completion summary with agent breakdown (duration, cost) - Mount audit-logs volume in docker-compose for host access --------- Co-authored-by: ezl-keygraph <ezhil@keygraph.io>	2026-01-15 10:36:11 -08:00
ezl-keygraph	45acb16711	refactor: remove orchestration layer (#45 ) * refactor: remove orchestration layer and simplify CLI Remove the complex orchestration layer including checkpoint management, rollback/recovery commands, and session management commands. This consolidates the execution logic directly in shannon.ts for a simpler fire-and-forget execution model. Changes: - Remove checkpoint-manager.ts and rollback functionality - Remove command-handler.ts and cli/prompts.ts - Simplify session-manager.ts to just agent definitions - Consolidate orchestration logic in shannon.ts - Update CLAUDE.md documentation Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * refactor: move session lock logic to shannon.ts, simplify session-manager - Reduce session-manager.ts to only AGENTS, AGENT_ORDER, getParallelGroups() - Move Session interface and lock file functions to shannon.ts - Simplify Session to only: id, webUrl, repoPath, status, startedAt - Remove unused types/session.ts Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> * refactor: use crypto.randomUUID() for session ID generation Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 22:58:17 +05:30
ezl-keygraph	8381198c41	feat: add configurable output directory with --output flag (#41 ) * feat: add configurable output directory with --output flag Add --output CLI flag to specify custom output directory for session folders containing audit logs, prompts, agent logs, and deliverables. Changes: - Add --output <path> CLI flag parsing - Update generateAuditPath() to use custom path when provided - Add consolidateOutputs() to copy deliverables to session folder - Update Docker examples with volume mounts for output directories - Default remains ./audit-logs/ when --output is not specified 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * feat: add configurable output directory with --output flag Add --output CLI flag to specify custom output directory for session folders containing audit logs, prompts, agent logs, and deliverables. Changes: - Add --output <path> CLI flag parsing - Store outputPath in Session interface for persistence - Update generateAuditPath() to use custom path when provided - Pass outputPath through pre-recon and checkpoint-manager - Add consolidateOutputs() to copy deliverables to session folder - Update Docker examples with volume mount instructions - Default remains ./audit-logs/ when --output is not specified 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: add gitkeep and fix formatting * fix: correct docker run command formatting in README Remove invalid inline comments after backslash continuations in docker run commands. Comments cannot appear after backslash line continuations in shell scripts, as the backslash escapes the newline character. Reorganized comments to appear on separate lines before or after the command block for better clarity and proper shell syntax. Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-01-08 23:50:42 +05:30
keygraphVarun	82fbf55843	Update README.md docs: rename Benchmark Results to Sample Reports, add link to XBOW benchmark	2026-01-05 13:04:33 -08:00
Khaushik-keygraph	11fdb69826	fix: Add Linux support for Docker volume permissions	2025-12-20 23:02:24 +05:30
ajmallesh	0068b34859	docs: fix GitHub links in Community & Support section Update GitHub Issues and Discussions links to use correct organization name (KeygraphHQ instead of keygraph). 🤖 Generated with [Claude Code](https://claude.com/claude-code)	2025-12-16 22:48:54 -08:00
ajmallesh	10e602ec87	docs: update Discord invite links	2025-12-16 13:33:02 -08:00
keygraphVarun	b0cd70b67c	clarify contributions	2025-12-16 13:14:29 -08:00
ajmallesh	26b42ecd67	docs: add Docker instructions for testing local applications Co-Authored-By: Khaushik-keygraph <khaushik.contractor@keygraph.io>	2025-12-15 10:34:24 -08:00
ajmallesh	cecb64729f	docs: add Windows Defender false positive guidance Closes #16	2025-12-02 19:07:37 -08:00
ajmallesh	c7de6636d9	docs: update Discord invite links	2025-12-01 09:24:19 -08:00
ajmallesh	7c2edeb4c0	chore: change license to AGPL-3.0	2025-11-26 18:45:36 -08:00
ajmallesh	9d20d94dda	docs: clarify Shannon is a white-box pentesting tool - Add prominent callout that Shannon Lite is designed for white-box (source-available) application security testing - Update XBOW benchmark description to "hint-free, source-aware" - Clarify benchmark comparison context (white-box vs black-box results) - Update benchmark performance comparison image 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 12:37:55 -08:00
keygraphVarun	7e0b2b28fe	cleanup	2025-11-22 20:43:09 +05:30
ajmallesh	719bf03293	fix: resolve Docker build failure and clarify env var configuration - Remove .env file with incorrect CLAUDE_CODE_MAX_TOKENS variable - Remove .env copy from Dockerfile that was causing build to fail - Update README to distinguish local (export) vs Docker (-e) env var usage - Add CLAUDE_CODE_MAX_OUTPUT_TOKENS to all Docker run examples The correct variable is CLAUDE_CODE_MAX_OUTPUT_TOKENS (not CLAUDE_CODE_MAX_TOKENS) and should be passed at runtime via -e flag for Docker or export for local runs. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-19 10:28:44 -08:00
keygraphVarun	68ec5ccc5a	style changes	2025-11-13 20:28:15 +05:30
keygraphVarun	f4f320dcb5	Link to benchmark	2025-11-13 20:27:26 +05:30
ajmallesh	acc4a1b032	Update license references from BSL to MPL in documentation 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 17:48:05 +05:30
ajmallesh	abfc4eba82	Rename SQLi/Command Injection to Injection throughout README Consolidates SQL Injection and Command Injection references to the unified "Injection" terminology for consistency with agent naming and OWASP categorization. Changes: - Updated feature descriptions and vulnerability lists - Modified architecture diagrams - Simplified targeted vulnerability scope 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-03 16:56:40 -08:00
ajmallesh	92db01bd2d	docs: add ctf-mode branch documentation to README Add a TIP callout in the Overview section documenting the ctf-mode branch for users who want to run Shannon against Capture-The-Flag challenges with optimized flag extraction prompts. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-03 10:35:45 -08:00
ajmallesh	34850477a2	refactor: update injection display name and add max tokens docs - Change agent prefix from [SQLi/Cmd] to [Injection] to reflect expanded scope - Add README documentation for CLAUDE_CODE_MAX_OUTPUT_TOKENS environment variable This update aligns the display naming with the expanded injection analysis scope that now covers SQLi, Command Injection, LFI/RFI, SSTI, Path Traversal, and Insecure Deserialization vulnerabilities. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-03 10:21:17 -08:00

1 2

59 Commits