mirror of https://github.com/Shiva108/ai-llm-red-team-handbook.git synced 2026-07-16 17:17:33 +02:00

T

shiva108 f4ecc3794e feat: Add markdown linting script and apply initial formatting fixes to the handbook, including blank lines around

2025-12-04 10:12:20 +01:00

assets

feat: Revamp banner.svg with updated gradients, new geometric patterns, circuit overlays, and a redesigned text layout.

2025-11-30 13:02:26 +01:00

docs

feat: Add markdown linting script and apply initial formatting fixes to the handbook, including blank lines around

2025-12-04 10:12:20 +01:00

scripts

Added demo scripts and report example

2025-11-30 12:43:44 +01:00

LICENSE

Initial commit

2025-11-27 09:46:15 +01:00

README.md

docs: Significantly expand and restructure README with detailed project components, repository structure, use cases, and roadmap, plus new contributing and contact sections.

2025-12-03 22:34:36 +01:00

README.md

AI / LLM Red Team Field Manual & Consultant's Handbook

A comprehensive operational toolkit for conducting AI/LLM red team assessments on Large Language Models, AI agents, RAG pipelines, and AI-enabled applications. This repository provides both tactical field guidance and strategic consulting frameworks.

📚 What's Inside

This repository contains three core resources:

1. AI LLM Red Team Handbook (260KB, 8000+ lines)

A complete consultancy guide covering:

Part I: Foundations - Methodology, ethics, legal considerations, and mindset (Chapters 1-3)
Part II: Engagement Framework - SOW/RoE templates, threat modeling, scoping, lab setup (Chapters 4-7)
Part III: Operations - Evidence collection, reporting, presentations, lessons learned (Chapters 8-11)
Part IV: Technical Deep Dives - RAG pipelines, supply chain security, prompt injection, and more (Chapters 12-14+)
Appendices - Tools, resources, templates, and references

Current Coverage (14 Chapters):

Introduction to AI Red Teaming
Ethics, Legal, and Stakeholder Communication
The Red Teamer's Mindset
SOW, Rules of Engagement, and Client Onboarding
Threat Modeling and Risk Analysis
Scoping an Engagement
Lab Setup and Environmental Safety
Evidence, Documentation, and Chain of Custody
Writing Effective Reports and Deliverables
Presenting Results and Remediation Guidance
Lessons Learned and Building Future Readiness
Retrieval-Augmented Generation (RAG) Pipelines
Data Provenance and Supply Chain Security
Prompt Injection (Direct/Indirect, 1st/3rd Party)

2. AI LLM Red Team Field Manual (56KB)

Compact operational reference for field use:

Quick-reference attack prompts and payloads
Testing checklists and methodology
Tool commands and configurations
OWASP Top 10 for LLMs mapping
MITRE ATLAS framework alignment

3. Python Testing Framework (`scripts/`)

Automated testing suite including:

Prompt injection attacks
Safety bypass and jailbreak tests
Data leakage and PII extraction
Tool/plugin misuse testing
Adversarial fuzzing
Model integrity validation

🚀 Quick Start

# Clone the repository
git clone https://github.com/shiva108/ai-llm-red-team-handbook.git
cd ai-llm-red-team-handbook

# Manual testing: Start with the Field Manual
open docs/AI_LLM\ Red\ Team\ Field\ Manual.md

# Automated testing:
cd scripts
pip install -r requirements.txt
python runner.py --config config.py

📖 Detailed setup: See Configuration Guide

📁 Repository Structure

ai-llm-red-team-handbook/
├── docs/
│   ├── AI LLM Red Team Hand book.md         # Complete consultancy guide (260KB)
│   ├── AI_LLM Red Team Field Manual.md      # Operational field reference (56KB)
│   ├── Building a World-Class AI Red Team.md # Team-building strategy guide
│   ├── Configuration.md                      # Setup and configuration guide
│   ├── Full_LLM_RedTeam_Report_Template.docx # Client report template
│   └── archive/                              # Historical versions
├── scripts/
│   ├── runner.py                             # Test orchestration
│   ├── test_prompt_injection.py              # Prompt injection tests
│   ├── test_safety_bypass.py                 # Jailbreak tests
│   ├── test_data_exposure.py                 # Data leakage tests
│   ├── test_tool_misuse.py                   # Plugin/tool abuse tests
│   ├── test_fuzzing.py                       # Adversarial fuzzing
│   └── requirements.txt                      # Python dependencies
├── assets/                                   # Images and resources
└── README.md                                 # This file

🎯 Use Cases

Use Case	Resources	Description
Red Team Assessments	Field Manual + Python Framework	Conduct comprehensive LLM security assessments
Consultant Engagements	Handbook + Report Template	Full methodology for client projects
Team Training	Handbook Foundations (Ch 1-11)	Onboard and develop security teams
Research & Development	Technical Chapters (Ch 12+)	Deep dives into specific attack surfaces
Compliance & Audit	Threat Modeling (Ch 5) + Tools	Risk assessments and control validation

⚙️ Prerequisites

Manual Testing:

Any text editor + target LLM access

Automated Testing:

Python 3.8+
Dependencies: requests, pytest, pydantic, python-dotenv
API credentials for target LLM

🧪 Python Testing Framework

Test Suites

test_prompt_injection.py - Automated prompt injection attacks
test_safety_bypass.py - Jailbreak and guardrail bypass tests
test_data_exposure.py - Data leakage and PII extraction
test_tool_misuse.py - Function-calling and plugin abuse
test_fuzzing.py - Adversarial input fuzzing
test_integrity.py - Model integrity and consistency

Configuration

Create scripts/.env:

API_ENDPOINT=https://api.example.com/v1/chat/completions
API_KEY=your-secret-api-key
MODEL_NAME=gpt-4

Run tests:

python runner.py                           # All tests
python runner.py --test prompt_injection   # Specific test
python runner.py --verbose                 # Verbose output

📖 Full configuration options: Configuration Guide

🗺️ Roadmap

Current Status:

✅ Handbook: 14 chapters complete (Foundations through Prompt Injection)
✅ Field Manual: Operational reference complete
✅ Python Framework: Core test suites implemented

In Progress:

🔄 Additional technical chapters (15-46)
🔄 Multimodal AI attack techniques
🔄 Advanced RAG exploitation scenarios

Planned:

Sample RAG and LLM test environments
Interactive attack case studies
Extended plugin/tool abuse coverage
Video tutorials and walkthroughs

Contributions welcome via issues and PRs.

📄 License

Licensed under CC BY-SA 4.0 (Creative Commons Attribution-ShareAlike 4.0 International).

See LICENSE for details.

⚠️ Disclaimer

For authorized security testing only.

Ensure:

Written authorization (SOW/RoE) is in place
Compliance with applicable laws and regulations (CFAA, GDPR, etc.)
Testing conducted in isolated environments when appropriate
No unauthorized testing on production systems

The authors accept no liability for misuse or unauthorized use of this material.

🤝 Contributing

We welcome contributions! Please:

Review existing issues and PRs
Follow the established format and style
Test any code additions
Submit clear, well-documented PRs

For major changes, please open an issue first to discuss.

📬 Contact & Support

Issues: GitHub Issues
Discussions: GitHub Discussions

Last Updated: December 2024 | Handbook Chapters: 14/46 Complete

README.md

AI / LLM Red Team Field Manual & Consultant's Handbook

📚 What's Inside

1. AI LLM Red Team Handbook (260KB, 8000+ lines)

2. AI LLM Red Team Field Manual (56KB)

3. Python Testing Framework (scripts/)

🚀 Quick Start

📁 Repository Structure

🎯 Use Cases

⚙️ Prerequisites

🧪 Python Testing Framework

Test Suites

Configuration

🗺️ Roadmap

📄 License

⚠️ Disclaimer

🤝 Contributing

📬 Contact & Support

3. Python Testing Framework (`scripts/`)