From 9c5af76f6f91f403efa2f0e27ca2da7958c5341c Mon Sep 17 00:00:00 2001 From: Marco Morana Date: Mon, 17 Nov 2025 10:50:13 -0500 Subject: [PATCH] Update 2.1.2_Identify_RAI_threats.md Changed image with threat labels, neew image file is "TAI Threats SAIF mapping.png" e added Threats References labesl to the text descriptions --- .../content/2.1.2_Identify_RAI_threats.md | 26 +++++++++---------- 1 file changed, 13 insertions(+), 13 deletions(-) diff --git a/Document/content/2.1.2_Identify_RAI_threats.md b/Document/content/2.1.2_Identify_RAI_threats.md index 05bf359..19f08f7 100644 --- a/Document/content/2.1.2_Identify_RAI_threats.md +++ b/Document/content/2.1.2_Identify_RAI_threats.md @@ -16,7 +16,7 @@ Before we review it, let's briefly revisit key Trustworthy AI concepts defined b Based on these concepts, we carefully revise your previous identification of threats:

- RAI Threat Model + RAI Threat Model

*Fig. 3 Responsible AI Threat modeling* @@ -25,27 +25,27 @@ Based on these concepts, we carefully revise your previous identification of thr #### **🟦 AI Application (4 additions):** -1. Hallucinations -2. Toxic Content Generation -3. Automation Bias -4. Lack of Explainability -5. Lack of Accountability in Agent/Plugin Actions +1. Hallucinations (T01-HAL) +2. Toxic Content Generation (T01-TCG) +3. Automation Bias (T01-AB) +4. Lack of Explainability (T01-LoE) +5. Lack of Accountability in Agent/Plugin Actions (T01-LoA) #### **🟪 AI Model (3 additions):** -6. Bias and Discrimination -7. Overfitting and Generalization -8. Misalignment with Human Intent +6. Bias and Discrimination (T01-B&D) +7. Overfitting and Generalization (T01-O&B) +8. Misalignment with Human Intent (T01-MHI) #### **🟩 AI Infrastructure (2 additions):** -9. Lack of Traceability & Auditability of Infrastructure Processes +9. Lack of Traceability (T01-LoT) & Lack of Auditability (T01-LoA) of Infrastructure Processes #### **🟨 AI Data (3 additions):** -10. Non-representative Training Data -11. Toxic or Harmful Training Data -12. Data Privacy Violations +10. Non-representative Training Data (T01-NRTD) +11. Toxic or Harmful Training Data (T01-T&HD) +12. Data Privacy Violations (T01-DPV) The following tables provide a structured overview, facilitating clear visibility for identifying, managing, and mitigating critical Responsible AI and Trustworthy AI (RT) threats across your entire AI system architecture.