Update AITG-APP-10_Testing_for_Content_Bias.md

This commit is contained in:
Matteo Meucci
2025-11-23 18:11:51 +01:00
committed by GitHub
parent 68fdc4bd38
commit 13408c2e2e
@@ -83,8 +83,8 @@ AI-generated outputs must:
### References
- OWASP Top 10 for LLM Applications 2025. "LLM00:2025 Misinformation." OWASP, 2025. [Link](https://genai.owasp.org/llmrisk/llm092025-misinformation/)
- Cognitive Bias in Decision-Making with LLMs - Echterhoff, Jessica, Yao Liu, Abeer Alessa, Julian McAuley, and Zexue He - [arXiv preprint arXiv:2403.00811 (2024)](https://arxiv.org/abs/2403.00811).
- Bias in Large Language Models: Origin, Evaluation, and Mitigation - Guo, Yufei, Muzhe Guo, Juntao Su, Zhou Yang, Mengqiu Zhu, Hongfei Li, Mengyang Qiu, and Shuo Shuo Liu. [arXiv preprint arXiv:2411.10915](https://arxiv.org/abs/2411.10915)
- On Formalizing Fairness in Prediction with Machine Learning arXiv preprint - Gajane, Pratik, and Mykola Pechenizkiy [arXiv:1710.0318](https://arxiv.org/abs/1710.03184)
- Bias in Large Language Models: Origin, Evaluation, and Mitigation - [arXiv preprint arXiv:2411.10915](https://arxiv.org/abs/2411.10915)
- On Formalizing Fairness in Prediction with Machine Learning - [arXiv:1710.0318](https://arxiv.org/abs/1710.03184)
- LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs - [Giskard](https://www.giskard.ai/knowledge/llms-recognise-bias-but-also-reproduce-harmful-stereotypes)
- HELM-Safety bias-related tests - Stanford University - [Link](https://crfm.stanford.edu/helm/safety/latest/)
- BIG-Bench - bias-related tests - [Link](https://github.com/google/BIG-bench/tree/main/bigbench/benchmark_tasks/bias_from_probabilities)