Matteo Meucci
c0c38b582e
Merge pull request #32 from zangobot/main
...
Include more testing tools, by dividing them between general-purpouse or domain-specific
2025-09-09 16:37:06 +02:00
Luca Demetrio
0749eeda55
Update AITG-MOD-01_Testing_for_Evasion_Attacks.md
...
Removed typo
2025-09-02 11:21:23 +02:00
Matteo Meucci
5268eff3ae
Merge pull request #31 from RoeiArpaly/main
...
Update AITG-APP-04_Testing_for_Input_Leakage.md
2025-09-01 09:43:06 +02:00
Roei Arpaly
4182d8f869
Update AITG-APP-04_Testing_for_Input_Leakage.md
...
Co-authored-by: Yoni Birman <birmanbirman@gmail.com >
2025-08-31 23:13:40 +03:00
Matteo Meucci
ddd1d12544
Merge pull request #29 from RoeiArpaly/main
...
Update AITG-APP-04_Testing_for_Input_Leakage.md
2025-08-13 10:53:21 +02:00
Roei Arpaly
296224d780
Update AITG-APP-04_Testing_for_Input_Leakage.md
...
adding adversarial input test cases
2025-08-13 11:46:54 +03:00
maurapintor
0ed6bb99ad
added secml-torch and adv-lib, updated description of deepsec
2025-08-08 10:16:15 +02:00
Luca Demetrio
be0385d8cf
Update AITG-MOD-01_Testing_for_Evasion_Attacks.md
...
Update AI security testing tools by adding difference between general-purpose and domain-specific libraries
2025-08-08 09:57:15 +02:00
Matteo Meucci
2399f8293b
Merge pull request #26 from fedric95/main
...
Hallucination - "Debunking" vs "Factuality and Misinformation"
2025-08-04 10:18:41 +02:00
Federico Ricciuti
befe2755c7
Introduced Debunking tests and a differentiation between "Factuality and Misinformation" and "Debunking" hallucinations. As described by Giskard in the Phrase benchmark.
2025-08-03 14:34:38 +02:00
Matteo Meucci
066bfaa2dd
Merge pull request #25 from fedric95/main
2025-07-26 00:17:36 +04:00
fedric95
d27026fda7
Merge branch 'OWASP:main' into main
2025-07-25 20:30:56 +02:00
Federico Ricciuti
0dd87354da
1. Specified that temperature=0 does not imply reproducibility ( https://arxiv.org/pdf/2506.09501 )
...
2. Pointed out that LLMs are generally less secure in low-resource languages
3. Made some order on the payloads for the bias test, now it using always the same base example.
2025-07-25 20:26:32 +02:00
Matteo Meucci
124c92f538
Merge pull request #24 from federicodotta/main
2025-07-25 14:55:11 +04:00
federicodotta
897c532bba
+ Planning instructions to avoid issues with token consumption
2025-07-25 12:18:11 +02:00
Matteo Meucci
dfee7656c2
Merge pull request #23 from fedric95/main
2025-07-17 18:26:58 +04:00
Federico Ricciuti
9da16a16c1
Correction of the readme to refer to the correct changed test
2025-07-17 15:22:07 +02:00
Federico Ricciuti
977235af4d
Introduction of the AITG-APP-10_Testing_for_Content_Bias.md, with tests to detect biased decisions made by the AI System.
2025-07-17 15:16:22 +02:00
Federico Ricciuti
49ee4b9d6c
The unsafe output test now includes hate releated unsafe content as part of the tests.
...
AITG-APP-10_Testing_for_Harmful_Content_Bias.md replaced with AITG-APP-10_Testing_for_Content_Bias.md, and now it focuses on the detection of biases contened in the generated outputs.
2025-07-17 15:14:33 +02:00
Matteo Meucci
11e22f40cd
Merge pull request #22 from federicodotta/main
2025-07-14 11:09:32 +04:00
federicodotta
82b7a18ef4
README updated
2025-07-14 08:19:58 +02:00
Matteo Meucci
db71d7c1a4
Merge pull request #21 from federicodotta/main
2025-07-13 13:52:59 +04:00
federicodotta
2b16a5c5f3
+ Testing Limitations and Requirements
2025-07-13 11:21:09 +02:00
Matteo Meucci
71b4f26900
Merge pull request #20 from fedric95/main
2025-07-12 21:30:58 +04:00
Federico Ricciuti
198167aebe
- Introduced the necessity of defining a safety taxonomy before conducting the tests: the definition of what is safe and what is unsafe depends on the application.
...
- Linked an existing safety taxonomy
- Added examples of moderation models
- Removed most of the references to the concept of bias. They should be addressed in another test.
TO-DO
- Include tests that consider the potential multimodal nature of the application (right now it is more text-only)
- Make a specific test to evaluate the biases of the AI application under test and remove all the references to biases in this test
2025-07-12 19:12:00 +02:00
Matteo Meucci
f4a5804a70
Merge pull request #19 from federicodotta/main
2025-07-12 16:42:53 +04:00
federicodotta
5dbedf3dc3
Prompt Injection Techniques section addeded
2025-07-12 13:51:10 +02:00
federicodotta
5a434e776b
Update in typo tricks
2025-07-12 12:35:05 +02:00
federicodotta
a56ba3f4e6
+ Echo Chamber Attack
2025-07-12 12:24:58 +02:00
federicodotta
b483d240cf
+ AntiGPT reference
2025-07-12 11:53:03 +02:00
federicodotta
abfcbde568
+ AntiGPT Prompt Injection
2025-07-12 11:49:27 +02:00
Matteo Meucci
a6b1ed20fe
Merge pull request #18 from mmorana1/patch-8
...
Update 2.1_Identify_AI_Threats.md
2025-07-09 20:11:59 +04:00
Marco Morana
250ead1ffc
Update 2.1_Identify_AI_Threats.md
...
Re-aligned all references and links
2025-07-09 11:38:48 -04:00
Matteo Meucci
d452ac3a95
Merge pull request #17 from mmorana1/patch-7
...
Update 2.1_Identify_AI_Threats.md
2025-07-09 18:34:53 +04:00
Marco Morana
f821459f13
Update 2.1_Identify_AI_Threats.md
...
Reference more specialized taxonomies like the one developed by Pangea
2025-07-09 10:18:43 -04:00
Matteo Meucci
13315f501a
Merge pull request #16 from mmorana1/patch-6
...
Update References.md
2025-07-09 18:08:20 +04:00
Marco Morana
5fef43e31f
Update References.md
...
Added ref [23] to PJI taxonomy
2025-07-09 09:55:52 -04:00
Matteo Meucci
9ceb54ed27
Merge pull request #15 from mmorana1/patch-5
2025-07-09 10:37:59 +04:00
Marco Morana
2c6a41ef75
Update 2.1_Identify_AI_Threats.md
...
Add note on risk
2025-07-08 18:17:12 -04:00
Matteo Meucci
8175757126
Merge pull request #13 from mmorana1/patch-2
2025-07-08 22:36:47 +04:00
Marco Morana
c17d9cdf46
Update README.md
...
Cosmetic changes
2025-07-01 14:59:33 -04:00
Matteo Meucci
aa34513214
Merge pull request #12 from mmorana1/patch-2
...
Update README.md
2025-07-01 20:26:40 +02:00
Marco Morana
def23545ab
Update README.md
...
Added references to CSA Red Teaming guide and OWASP AI VSS
2025-07-01 14:16:04 -04:00
Matteo Meucci
4e44d02705
Merge pull request #11 from mmorana1/patch-1
...
Testing small edits
2025-06-30 22:52:26 +02:00
Marco Morana
84c9c7c989
Testing small edits
2025-06-30 15:36:22 -04:00
Matteo Meucci
d7acc33f62
Merge pull request #10 from didier-durand/fix-typos
...
fixing typos in multiple texts.
2025-06-29 15:32:17 +02:00
Didier Durand
e754867dd5
fixing typos in multiple texts.
2025-06-29 13:48:42 +02:00
Matteo Meucci
fd20d35e01
Merge pull request #9 from GraoMelo/patch-1
...
Update 2.2_Appendix_B.md
2025-06-26 20:16:11 +02:00
GraoMelo
b03267133e
Update 2.2_Appendix_B.md
...
fixed #8
2025-06-26 15:12:53 -03:00
Matteo Meucci
451a558764
Merge pull request #6 from federicodotta/main
...
Updates to AITG-APP-01, AITG-APP-03, AITG-APP-05, AITG-APP-06, AITG-APP-07 and AITG-INF-02
2025-06-26 19:27:44 +02:00