AI / LLM Red Team Field Manual & Consultant's Handbook
A comprehensive operational toolkit for conducting AI/LLM red team assessments on Large Language Models, AI agents, RAG pipelines, and AI-enabled applications. This repository provides both tactical field guidance and strategic consulting frameworks.
📚 What's Inside
This repository contains three core resources:
1. AI LLM Red Team Handbook (260KB, 8000+ lines)
A complete consultancy guide covering:
- Part I: Foundations - Methodology, ethics, legal considerations, and mindset (Chapters 1-3)
- Part II: Engagement Framework - SOW/RoE templates, threat modeling, scoping, lab setup (Chapters 4-7)
- Part III: Operations - Evidence collection, reporting, presentations, lessons learned (Chapters 8-11)
- Part IV: Technical Deep Dives - RAG pipelines, supply chain security, prompt injection, and more (Chapters 12-14+)
- Appendices - Tools, resources, templates, and references
Current Coverage (14 Chapters):
- Introduction to AI Red Teaming
- Ethics, Legal, and Stakeholder Communication
- The Red Teamer's Mindset
- SOW, Rules of Engagement, and Client Onboarding
- Threat Modeling and Risk Analysis
- Scoping an Engagement
- Lab Setup and Environmental Safety
- Evidence, Documentation, and Chain of Custody
- Writing Effective Reports and Deliverables
- Presenting Results and Remediation Guidance
- Lessons Learned and Building Future Readiness
- Retrieval-Augmented Generation (RAG) Pipelines
- Data Provenance and Supply Chain Security
- Prompt Injection (Direct/Indirect, 1st/3rd Party)
2. AI LLM Red Team Field Manual (56KB)
Compact operational reference for field use:
- Quick-reference attack prompts and payloads
- Testing checklists and methodology
- Tool commands and configurations
- OWASP Top 10 for LLMs mapping
- MITRE ATLAS framework alignment
3. Python Testing Framework (scripts/)
Automated testing suite including:
- Prompt injection attacks
- Safety bypass and jailbreak tests
- Data leakage and PII extraction
- Tool/plugin misuse testing
- Adversarial fuzzing
- Model integrity validation
🚀 Quick Start
# Clone the repository
git clone https://github.com/shiva108/ai-llm-red-team-handbook.git
cd ai-llm-red-team-handbook
# Manual testing: Start with the Field Manual
open docs/AI_LLM\ Red\ Team\ Field\ Manual.md
# Automated testing:
cd scripts
pip install -r requirements.txt
python runner.py --config config.py
📖 Detailed setup: See Configuration Guide
📁 Repository Structure
ai-llm-red-team-handbook/
├── docs/
│ ├── AI LLM Red Team Hand book.md # Complete consultancy guide (260KB)
│ ├── AI_LLM Red Team Field Manual.md # Operational field reference (56KB)
│ ├── Building a World-Class AI Red Team.md # Team-building strategy guide
│ ├── Configuration.md # Setup and configuration guide
│ ├── Full_LLM_RedTeam_Report_Template.docx # Client report template
│ └── archive/ # Historical versions
├── scripts/
│ ├── runner.py # Test orchestration
│ ├── test_prompt_injection.py # Prompt injection tests
│ ├── test_safety_bypass.py # Jailbreak tests
│ ├── test_data_exposure.py # Data leakage tests
│ ├── test_tool_misuse.py # Plugin/tool abuse tests
│ ├── test_fuzzing.py # Adversarial fuzzing
│ └── requirements.txt # Python dependencies
├── assets/ # Images and resources
└── README.md # This file
🎯 Use Cases
| Use Case | Resources | Description |
|---|---|---|
| Red Team Assessments | Field Manual + Python Framework | Conduct comprehensive LLM security assessments |
| Consultant Engagements | Handbook + Report Template | Full methodology for client projects |
| Team Training | Handbook Foundations (Ch 1-11) | Onboard and develop security teams |
| Research & Development | Technical Chapters (Ch 12+) | Deep dives into specific attack surfaces |
| Compliance & Audit | Threat Modeling (Ch 5) + Tools | Risk assessments and control validation |
⚙️ Prerequisites
Manual Testing:
- Any text editor + target LLM access
Automated Testing:
- Python 3.8+
- Dependencies:
requests,pytest,pydantic,python-dotenv - API credentials for target LLM
🧪 Python Testing Framework
Test Suites
test_prompt_injection.py- Automated prompt injection attackstest_safety_bypass.py- Jailbreak and guardrail bypass teststest_data_exposure.py- Data leakage and PII extractiontest_tool_misuse.py- Function-calling and plugin abusetest_fuzzing.py- Adversarial input fuzzingtest_integrity.py- Model integrity and consistency
Configuration
Create scripts/.env:
API_ENDPOINT=https://api.example.com/v1/chat/completions
API_KEY=your-secret-api-key
MODEL_NAME=gpt-4
Run tests:
python runner.py # All tests
python runner.py --test prompt_injection # Specific test
python runner.py --verbose # Verbose output
📖 Full configuration options: Configuration Guide
🗺️ Roadmap
Current Status:
- ✅ Handbook: 14 chapters complete (Foundations through Prompt Injection)
- ✅ Field Manual: Operational reference complete
- ✅ Python Framework: Core test suites implemented
In Progress:
- 🔄 Additional technical chapters (15-46)
- 🔄 Multimodal AI attack techniques
- 🔄 Advanced RAG exploitation scenarios
Planned:
- Sample RAG and LLM test environments
- Interactive attack case studies
- Extended plugin/tool abuse coverage
- Video tutorials and walkthroughs
Contributions welcome via issues and PRs.
📄 License
Licensed under CC BY-SA 4.0 (Creative Commons Attribution-ShareAlike 4.0 International).
See LICENSE for details.
⚠️ Disclaimer
For authorized security testing only.
Ensure:
- Written authorization (SOW/RoE) is in place
- Compliance with applicable laws and regulations (CFAA, GDPR, etc.)
- Testing conducted in isolated environments when appropriate
- No unauthorized testing on production systems
The authors accept no liability for misuse or unauthorized use of this material.
🤝 Contributing
We welcome contributions! Please:
- Review existing issues and PRs
- Follow the established format and style
- Test any code additions
- Submit clear, well-documented PRs
For major changes, please open an issue first to discuss.
📬 Contact & Support
- Issues: GitHub Issues
- Discussions: GitHub Discussions
Last Updated: December 2024 | Handbook Chapters: 14/46 Complete