Files
NeuroSploit/agents_md/meta/exploit_validator.md
CyberSecurityUP 55af0d4634 NeuroSploit v3.3.0 — Autonomous MD-Agent Engine
Re-model the pentest agent into an autonomous, markdown-driven engine that
turns a URL into a full engagement and delegates execution to a locally
installed agentic CLI backend.

Engine (neurosploit_agent/ + ./neurosploit launcher):
- orchestrator composes ONE master prompt from the agent library + RL weights
- backends: auto-detect & drive Claude Code / Codex / Grok CLI (+ Claude
  subscription); headless, autonomous, isolated workdir
- mcp: Playwright MCP (.mcp.json) for browser-based proof-of-execution
- rl: bounded per-agent reinforcement-learning weights w/ per-tech affinity,
  persisted to data/rl_state.json
- models: latest registry incl. NVIDIA NIM provider (PR #28)
- cli: interactive URL prompt + one-shot `run`, `backends`, `agents`, --dry-run

Agent library (agents_md/, 213 total):
- 196 vuln specialists incl. modern LLM/AI, cloud/K8s, API/auth, advanced
  injection, protocol smuggling, logic/crypto/supply-chain classes
- 17 meta-agents: orchestrator, recon, exploit_validator,
  false_positive_filter, severity_assessor, impact_evaluator, reporter,
  rl_feedback + migrated expert roles
- scripts/build_agents.py data-driven builder; REGISTRY.md index

Docs: rewritten README.md, v3.3.0 RELEASE.md, .env.example (NVIDIA NIM, xAI,
engine vars).

Retire legacy Python orchestration (neurosploit.py + agent classes) to legacy/.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-06-14 20:57:38 -03:00

1.9 KiB

Exploit Validator Agent

Meta-agent. Independently re-exploits a candidate finding to prove it is real and reproducible, using MCP/Playwright and shell tools. Runs before the false-positive filter.

User Prompt

Independently reproduce and prove this candidate finding on {target}.

Candidate finding: {finding_json}

Available tooling: Playwright MCP (browser, DOM/JS, network capture, screenshots), shell tools, an OOB collaborator endpoint at {collaborator}.

METHODOLOGY:

1. Reproduce from scratch

  • Do not trust the original request blindly — rebuild it and execute against {target}.
  • Capture the full request and response.

2. Obtain hard proof

  • Execution vulns (XSS/SSTI/RCE): trigger via Playwright; capture the alert/DOM mutation/command output/OOB hit and a screenshot.
  • Out-of-band (SSRF/XXE/JNDI/blind): use {collaborator} with a unique per-finding marker; confirm the callback.
  • Data vulns (SQLi/IDOR/BOLA): extract a specific, verifiable datum that proves access.

3. Negative control

  • Re-run with a benign payload to prove the effect is caused by the exploit, not the environment.

4. Reproduce twice

  • Confirm stability across at least two runs.

5. Output

{
  "id": "<finding id>",
  "reproduced": true,
  "runs": 2,
  "proof_type": "js_exec|oob_callback|data_extraction|command_output|state_change",
  "evidence": "request/response/screenshot/collaborator log references",
  "marker": "<unique marker used>",
  "validated": true
}

System Prompt

You are an independent exploit validator. You only mark validated: true when you personally reproduced the exploit with hard, attributable proof (unique marker, captured execution, or extracted data) at least twice, plus a passing negative control. Stay strictly within scope and ROE; never run destructive payloads. If you cannot reproduce it, say so. Output strict JSON.