Commit Graph

191 Commits

Author SHA1 Message Date
Matteo Meucci c0c38b582e Merge pull request #32 from zangobot/main
Include more testing tools, by dividing them between general-purpouse or domain-specific
2025-09-09 16:37:06 +02:00
Luca Demetrio 0749eeda55 Update AITG-MOD-01_Testing_for_Evasion_Attacks.md
Removed typo
2025-09-02 11:21:23 +02:00
Matteo Meucci 5268eff3ae Merge pull request #31 from RoeiArpaly/main
Update AITG-APP-04_Testing_for_Input_Leakage.md
2025-09-01 09:43:06 +02:00
Roei Arpaly 4182d8f869 Update AITG-APP-04_Testing_for_Input_Leakage.md
Co-authored-by: Yoni Birman <birmanbirman@gmail.com>
2025-08-31 23:13:40 +03:00
Matteo Meucci ddd1d12544 Merge pull request #29 from RoeiArpaly/main
Update AITG-APP-04_Testing_for_Input_Leakage.md
2025-08-13 10:53:21 +02:00
Roei Arpaly 296224d780 Update AITG-APP-04_Testing_for_Input_Leakage.md
adding adversarial input test cases
2025-08-13 11:46:54 +03:00
maurapintor 0ed6bb99ad added secml-torch and adv-lib, updated description of deepsec 2025-08-08 10:16:15 +02:00
Luca Demetrio be0385d8cf Update AITG-MOD-01_Testing_for_Evasion_Attacks.md
Update AI security testing tools by adding difference between general-purpose and domain-specific libraries
2025-08-08 09:57:15 +02:00
Matteo Meucci 2399f8293b Merge pull request #26 from fedric95/main
Hallucination - "Debunking" vs "Factuality and Misinformation"
2025-08-04 10:18:41 +02:00
Federico Ricciuti befe2755c7 Introduced Debunking tests and a differentiation between "Factuality and Misinformation" and "Debunking" hallucinations. As described by Giskard in the Phrase benchmark. 2025-08-03 14:34:38 +02:00
Matteo Meucci 066bfaa2dd Merge pull request #25 from fedric95/main 2025-07-26 00:17:36 +04:00
fedric95 d27026fda7 Merge branch 'OWASP:main' into main 2025-07-25 20:30:56 +02:00
Federico Ricciuti 0dd87354da 1. Specified that temperature=0 does not imply reproducibility (https://arxiv.org/pdf/2506.09501)
2. Pointed out that LLMs are generally less secure in low-resource languages
3. Made some order on the payloads for the bias test, now it using always the same base example.
2025-07-25 20:26:32 +02:00
Matteo Meucci 124c92f538 Merge pull request #24 from federicodotta/main 2025-07-25 14:55:11 +04:00
federicodotta 897c532bba + Planning instructions to avoid issues with token consumption 2025-07-25 12:18:11 +02:00
Matteo Meucci dfee7656c2 Merge pull request #23 from fedric95/main 2025-07-17 18:26:58 +04:00
Federico Ricciuti 9da16a16c1 Correction of the readme to refer to the correct changed test 2025-07-17 15:22:07 +02:00
Federico Ricciuti 977235af4d Introduction of the AITG-APP-10_Testing_for_Content_Bias.md, with tests to detect biased decisions made by the AI System. 2025-07-17 15:16:22 +02:00
Federico Ricciuti 49ee4b9d6c The unsafe output test now includes hate releated unsafe content as part of the tests.
AITG-APP-10_Testing_for_Harmful_Content_Bias.md replaced with AITG-APP-10_Testing_for_Content_Bias.md, and now it focuses on the detection of biases contened in the generated outputs.
2025-07-17 15:14:33 +02:00
Matteo Meucci 11e22f40cd Merge pull request #22 from federicodotta/main 2025-07-14 11:09:32 +04:00
federicodotta 82b7a18ef4 README updated 2025-07-14 08:19:58 +02:00
Matteo Meucci db71d7c1a4 Merge pull request #21 from federicodotta/main 2025-07-13 13:52:59 +04:00
federicodotta 2b16a5c5f3 + Testing Limitations and Requirements 2025-07-13 11:21:09 +02:00
Matteo Meucci 71b4f26900 Merge pull request #20 from fedric95/main 2025-07-12 21:30:58 +04:00
Federico Ricciuti 198167aebe - Introduced the necessity of defining a safety taxonomy before conducting the tests: the definition of what is safe and what is unsafe depends on the application.
- Linked an existing safety taxonomy
- Added examples of moderation models
- Removed most of the references to the concept of bias. They should be addressed in another test.

TO-DO

- Include tests that consider the potential multimodal nature of the application (right now it is more text-only)
- Make a specific test to evaluate the biases of the AI application under test and remove all the references to biases in this test
2025-07-12 19:12:00 +02:00
Matteo Meucci f4a5804a70 Merge pull request #19 from federicodotta/main 2025-07-12 16:42:53 +04:00
federicodotta 5dbedf3dc3 Prompt Injection Techniques section addeded 2025-07-12 13:51:10 +02:00
federicodotta 5a434e776b Update in typo tricks 2025-07-12 12:35:05 +02:00
federicodotta a56ba3f4e6 + Echo Chamber Attack 2025-07-12 12:24:58 +02:00
federicodotta b483d240cf + AntiGPT reference 2025-07-12 11:53:03 +02:00
federicodotta abfcbde568 + AntiGPT Prompt Injection 2025-07-12 11:49:27 +02:00
Matteo Meucci a6b1ed20fe Merge pull request #18 from mmorana1/patch-8
Update 2.1_Identify_AI_Threats.md
2025-07-09 20:11:59 +04:00
Marco Morana 250ead1ffc Update 2.1_Identify_AI_Threats.md
Re-aligned all references and links
2025-07-09 11:38:48 -04:00
Matteo Meucci d452ac3a95 Merge pull request #17 from mmorana1/patch-7
Update 2.1_Identify_AI_Threats.md
2025-07-09 18:34:53 +04:00
Marco Morana f821459f13 Update 2.1_Identify_AI_Threats.md
Reference more specialized taxonomies like the one developed by Pangea
2025-07-09 10:18:43 -04:00
Matteo Meucci 13315f501a Merge pull request #16 from mmorana1/patch-6
Update References.md
2025-07-09 18:08:20 +04:00
Marco Morana 5fef43e31f Update References.md
Added ref [23] to PJI taxonomy
2025-07-09 09:55:52 -04:00
Matteo Meucci 9ceb54ed27 Merge pull request #15 from mmorana1/patch-5 2025-07-09 10:37:59 +04:00
Marco Morana 2c6a41ef75 Update 2.1_Identify_AI_Threats.md
Add note on risk
2025-07-08 18:17:12 -04:00
Matteo Meucci 8175757126 Merge pull request #13 from mmorana1/patch-2 2025-07-08 22:36:47 +04:00
Marco Morana c17d9cdf46 Update README.md
Cosmetic changes
2025-07-01 14:59:33 -04:00
Matteo Meucci aa34513214 Merge pull request #12 from mmorana1/patch-2
Update README.md
2025-07-01 20:26:40 +02:00
Marco Morana def23545ab Update README.md
Added references to CSA Red Teaming guide and OWASP AI VSS
2025-07-01 14:16:04 -04:00
Matteo Meucci 4e44d02705 Merge pull request #11 from mmorana1/patch-1
Testing small edits
2025-06-30 22:52:26 +02:00
Marco Morana 84c9c7c989 Testing small edits 2025-06-30 15:36:22 -04:00
Matteo Meucci d7acc33f62 Merge pull request #10 from didier-durand/fix-typos
fixing typos in multiple texts.
2025-06-29 15:32:17 +02:00
Didier Durand e754867dd5 fixing typos in multiple texts. 2025-06-29 13:48:42 +02:00
Matteo Meucci fd20d35e01 Merge pull request #9 from GraoMelo/patch-1
Update 2.2_Appendix_B.md
2025-06-26 20:16:11 +02:00
GraoMelo b03267133e Update 2.2_Appendix_B.md
fixed #8
2025-06-26 15:12:53 -03:00
Matteo Meucci 451a558764 Merge pull request #6 from federicodotta/main
Updates to AITG-APP-01, AITG-APP-03, AITG-APP-05, AITG-APP-06, AITG-APP-07 and AITG-INF-02
2025-06-26 19:27:44 +02:00