Adam Wilson b42b684974 Revise README for clarity and updated references
Updated the README to reflect the author's name and provide clearer links to the research paper. Removed redundant sections and clarified repository contents and usage notes.
2025-11-12 15:07:15 -07:00
2025-09-02 15:13:51 -06:00

LLMSecOps Research

Overview

This repository contains code, workflows, and experiments that support graduate research conducted by Adam Wilson for the M.Sc., Information Security Engineering program at SANS Technology Institute. The full paper, "Automating Generative AI Guidelines: Reducing Prompt Injection Risk with 'Shift-Left' MITRE ATLAS Mitigation Testing," is available at sans.org and sans.edu.

Research Paper Abstract

Automated testing during the build stage of the AI engineering life cycle can evaluate the effectiveness of generative AI guidelines against prompt injection attacks. This technique provides early feedback for developers and defenders when assessing the mitigation performance of an LLM-integrated application. This research combines prompt engineering techniques and automated policy violation checks in the GitHub Actions cloud-native build system to demonstrate a practical “shift-left” approach to securing apps based on foundation models.

Repository Contents

  • Example prompt payloads and test harnesses used to evaluate prompt injection mitigations.
  • GitHub Actions workflows that run automated tests during the build stage to detect policy violations.
  • Scripts and tooling that demonstrate how to integrate automated checks into an AI engineering pipeline.

Usage Notes

Refer to individual directories and workflow files for details on running tests and customizing checks for your environment. The code is intended to reproduce and extend the experiments described in the paper.

License and Citation

If you use this repository for research or production, please cite the accompanying paper and follow any licensing terms included with the code.

Description
No description provided
Readme MIT 132 MiB
Languages
Python 98.1%
Shell 1.9%