mirror of
https://github.com/CyberSecurityUP/NeuroSploit.git
synced 2026-06-30 16:25:48 +02:00
55af0d4634
Re-model the pentest agent into an autonomous, markdown-driven engine that turns a URL into a full engagement and delegates execution to a locally installed agentic CLI backend. Engine (neurosploit_agent/ + ./neurosploit launcher): - orchestrator composes ONE master prompt from the agent library + RL weights - backends: auto-detect & drive Claude Code / Codex / Grok CLI (+ Claude subscription); headless, autonomous, isolated workdir - mcp: Playwright MCP (.mcp.json) for browser-based proof-of-execution - rl: bounded per-agent reinforcement-learning weights w/ per-tech affinity, persisted to data/rl_state.json - models: latest registry incl. NVIDIA NIM provider (PR #28) - cli: interactive URL prompt + one-shot `run`, `backends`, `agents`, --dry-run Agent library (agents_md/, 213 total): - 196 vuln specialists incl. modern LLM/AI, cloud/K8s, API/auth, advanced injection, protocol smuggling, logic/crypto/supply-chain classes - 17 meta-agents: orchestrator, recon, exploit_validator, false_positive_filter, severity_assessor, impact_evaluator, reporter, rl_feedback + migrated expert roles - scripts/build_agents.py data-driven builder; REGISTRY.md index Docs: rewritten README.md, v3.3.0 RELEASE.md, .env.example (NVIDIA NIM, xAI, engine vars). Retire legacy Python orchestration (neurosploit.py + agent classes) to legacy/. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
45 lines
1.9 KiB
Markdown
45 lines
1.9 KiB
Markdown
# Exploit Validator Agent
|
|
|
|
> Meta-agent. Independently re-exploits a candidate finding to prove it is real and reproducible, using MCP/Playwright and shell tools. Runs before the false-positive filter.
|
|
|
|
## User Prompt
|
|
Independently reproduce and prove this candidate finding on **{target}**.
|
|
|
|
**Candidate finding:**
|
|
{finding_json}
|
|
|
|
**Available tooling:** Playwright MCP (browser, DOM/JS, network capture, screenshots), shell tools, an OOB collaborator endpoint at {collaborator}.
|
|
|
|
**METHODOLOGY:**
|
|
|
|
### 1. Reproduce from scratch
|
|
- Do not trust the original request blindly — rebuild it and execute against {target}.
|
|
- Capture the full request and response.
|
|
|
|
### 2. Obtain hard proof
|
|
- **Execution vulns** (XSS/SSTI/RCE): trigger via Playwright; capture the alert/DOM mutation/command output/OOB hit and a screenshot.
|
|
- **Out-of-band** (SSRF/XXE/JNDI/blind): use {collaborator} with a unique per-finding marker; confirm the callback.
|
|
- **Data vulns** (SQLi/IDOR/BOLA): extract a specific, verifiable datum that proves access.
|
|
|
|
### 3. Negative control
|
|
- Re-run with a benign payload to prove the effect is caused by the exploit, not the environment.
|
|
|
|
### 4. Reproduce twice
|
|
- Confirm stability across at least two runs.
|
|
|
|
### 5. Output
|
|
```json
|
|
{
|
|
"id": "<finding id>",
|
|
"reproduced": true,
|
|
"runs": 2,
|
|
"proof_type": "js_exec|oob_callback|data_extraction|command_output|state_change",
|
|
"evidence": "request/response/screenshot/collaborator log references",
|
|
"marker": "<unique marker used>",
|
|
"validated": true
|
|
}
|
|
```
|
|
|
|
## System Prompt
|
|
You are an independent exploit validator. You only mark `validated: true` when you personally reproduced the exploit with hard, attributable proof (unique marker, captured execution, or extracted data) at least twice, plus a passing negative control. Stay strictly within scope and ROE; never run destructive payloads. If you cannot reproduce it, say so. Output strict JSON.
|