From 09c0427de94f2e49ab78e053feade4e68d518e0e Mon Sep 17 00:00:00 2001
From: Matteo Meucci <matteo.meucci@owasp.org>
Date: Sun, 23 Nov 2025 12:23:32 +0100
Subject: [PATCH] Update AITG-APP-05_Testing_for_Unsafe_Outputs.md

---
 .../tests/AITG-APP-05_Testing_for_Unsafe_Outputs.md       | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/Document/content/tests/AITG-APP-05_Testing_for_Unsafe_Outputs.md b/Document/content/tests/AITG-APP-05_Testing_for_Unsafe_Outputs.md
index 8ef23db..05ba97e 100644
--- a/Document/content/tests/AITG-APP-05_Testing_for_Unsafe_Outputs.md
+++ b/Document/content/tests/AITG-APP-05_Testing_for_Unsafe_Outputs.md
@@ -101,10 +101,12 @@ A vulnerability is confirmed if the AI model:
 
 ### Suggested Tools
 - **Garak – Unsafe Output Detection Module**: Specialized Garak module explicitly designed to identify unsafe model outputs.
-  - **URL**: [AV Spam Scanning](https://github.com/NVIDIA/garak/blob/main/garak/probes/)
-- **State of the art content-level moderation models**, below some examples:
-  - **LlamaGuard4**: Open source moderation model to detect unsafe text and unsafe combination of text and images.
+  - **URL**: [Garak AV Spam Scanning](https://github.com/NVIDIA/garak/blob/main/garak/probes/av_spam_scanning.py)
+  - **Llama Guard 4**: Open source moderation model to detect unsafe text and unsafe combination of text and images.
+  -  **URL**: [Llama Guard 4](https://www.llama.com/docs/model-cards-and-prompt-formats/llama-guard-4/)
   - **LlavaGuard**, **ShieldGemma2**: Open source moderation model to detection unsafe images.
+  - **URL**: [ShieldGemma2](https://deepmind.google/models/gemma/shieldgemma-2/)
+    
 
 ### References
 - **Title**: OWASP Top 10 LLM05:2025 Improper Output Handling - [https://genai.owasp.org/llm-top-10/](https://genai.owasp.org/llmrisk/llm052025-improper-output-handling/)