AI / LLM Red Team Field Manual & Consultant's Handbook

Repository Banner

License Python Contributions Welcome Last Updated

A comprehensive operational toolkit for conducting AI/LLM red team assessments on Large Language Models, AI agents, RAG pipelines, and AI-enabled applications. This repository provides both tactical field guidance and strategic consulting frameworks.


📚 What's Inside

This repository contains three core resources:

1. AI LLM Red Team Handbook (260KB, 8000+ lines)

A complete consultancy guide covering:

  • Part I: Foundations - Methodology, ethics, legal considerations, and mindset (Chapters 1-3)
  • Part II: Engagement Framework - SOW/RoE templates, threat modeling, scoping, lab setup (Chapters 4-7)
  • Part III: Operations - Evidence collection, reporting, presentations, lessons learned (Chapters 8-11)
  • Part IV: Technical Deep Dives - RAG pipelines, supply chain security, prompt injection, and more (Chapters 12-14+)
  • Appendices - Tools, resources, templates, and references

Current Coverage (14 Chapters):

  1. Introduction to AI Red Teaming
  2. Ethics, Legal, and Stakeholder Communication
  3. The Red Teamer's Mindset
  4. SOW, Rules of Engagement, and Client Onboarding
  5. Threat Modeling and Risk Analysis
  6. Scoping an Engagement
  7. Lab Setup and Environmental Safety
  8. Evidence, Documentation, and Chain of Custody
  9. Writing Effective Reports and Deliverables
  10. Presenting Results and Remediation Guidance
  11. Lessons Learned and Building Future Readiness
  12. Retrieval-Augmented Generation (RAG) Pipelines
  13. Data Provenance and Supply Chain Security
  14. Prompt Injection (Direct/Indirect, 1st/3rd Party)

2. AI LLM Red Team Field Manual (56KB)

Compact operational reference for field use:

  • Quick-reference attack prompts and payloads
  • Testing checklists and methodology
  • Tool commands and configurations
  • OWASP Top 10 for LLMs mapping
  • MITRE ATLAS framework alignment

3. Python Testing Framework (scripts/)

Automated testing suite including:

  • Prompt injection attacks
  • Safety bypass and jailbreak tests
  • Data leakage and PII extraction
  • Tool/plugin misuse testing
  • Adversarial fuzzing
  • Model integrity validation

🚀 Quick Start

# Clone the repository
git clone https://github.com/shiva108/ai-llm-red-team-handbook.git
cd ai-llm-red-team-handbook

# Manual testing: Start with the Field Manual
open docs/AI_LLM\ Red\ Team\ Field\ Manual.md

# Automated testing:
cd scripts
pip install -r requirements.txt
python runner.py --config config.py

📖 Detailed setup: See Configuration Guide


📁 Repository Structure

ai-llm-red-team-handbook/
├── docs/
│   ├── AI LLM Red Team Hand book.md         # Complete consultancy guide (260KB)
│   ├── AI_LLM Red Team Field Manual.md      # Operational field reference (56KB)
│   ├── Building a World-Class AI Red Team.md # Team-building strategy guide
│   ├── Configuration.md                      # Setup and configuration guide
│   ├── Full_LLM_RedTeam_Report_Template.docx # Client report template
│   └── archive/                              # Historical versions
├── scripts/
│   ├── runner.py                             # Test orchestration
│   ├── test_prompt_injection.py              # Prompt injection tests
│   ├── test_safety_bypass.py                 # Jailbreak tests
│   ├── test_data_exposure.py                 # Data leakage tests
│   ├── test_tool_misuse.py                   # Plugin/tool abuse tests
│   ├── test_fuzzing.py                       # Adversarial fuzzing
│   └── requirements.txt                      # Python dependencies
├── assets/                                   # Images and resources
└── README.md                                 # This file

🎯 Use Cases

Use Case Resources Description
Red Team Assessments Field Manual + Python Framework Conduct comprehensive LLM security assessments
Consultant Engagements Handbook + Report Template Full methodology for client projects
Team Training Handbook Foundations (Ch 1-11) Onboard and develop security teams
Research & Development Technical Chapters (Ch 12+) Deep dives into specific attack surfaces
Compliance & Audit Threat Modeling (Ch 5) + Tools Risk assessments and control validation

⚙️ Prerequisites

Manual Testing:

  • Any text editor + target LLM access

Automated Testing:

  • Python 3.8+
  • Dependencies: requests, pytest, pydantic, python-dotenv
  • API credentials for target LLM

🧪 Python Testing Framework

Test Suites

  • test_prompt_injection.py - Automated prompt injection attacks
  • test_safety_bypass.py - Jailbreak and guardrail bypass tests
  • test_data_exposure.py - Data leakage and PII extraction
  • test_tool_misuse.py - Function-calling and plugin abuse
  • test_fuzzing.py - Adversarial input fuzzing
  • test_integrity.py - Model integrity and consistency

Configuration

Create scripts/.env:

API_ENDPOINT=https://api.example.com/v1/chat/completions
API_KEY=your-secret-api-key
MODEL_NAME=gpt-4

Run tests:

python runner.py                           # All tests
python runner.py --test prompt_injection   # Specific test
python runner.py --verbose                 # Verbose output

📖 Full configuration options: Configuration Guide


🗺️ Roadmap

Current Status:

  • Handbook: 14 chapters complete (Foundations through Prompt Injection)
  • Field Manual: Operational reference complete
  • Python Framework: Core test suites implemented

In Progress:

  • 🔄 Additional technical chapters (15-46)
  • 🔄 Multimodal AI attack techniques
  • 🔄 Advanced RAG exploitation scenarios

Planned:

  • Sample RAG and LLM test environments
  • Interactive attack case studies
  • Extended plugin/tool abuse coverage
  • Video tutorials and walkthroughs

Contributions welcome via issues and PRs.


📄 License

Licensed under CC BY-SA 4.0 (Creative Commons Attribution-ShareAlike 4.0 International).

See LICENSE for details.


⚠️ Disclaimer

For authorized security testing only.

Ensure:

  • Written authorization (SOW/RoE) is in place
  • Compliance with applicable laws and regulations (CFAA, GDPR, etc.)
  • Testing conducted in isolated environments when appropriate
  • No unauthorized testing on production systems

The authors accept no liability for misuse or unauthorized use of this material.


🤝 Contributing

We welcome contributions! Please:

  1. Review existing issues and PRs
  2. Follow the established format and style
  3. Test any code additions
  4. Submit clear, well-documented PRs

For major changes, please open an issue first to discuss.


📬 Contact & Support


Last Updated: December 2024 | Handbook Chapters: 14/46 Complete

S
Description
No description provided
Readme 335 MiB
Languages
Python 96%
Shell 3.8%
Dockerfile 0.2%