Revise section headers for testing document

Updated section headers to improve clarity and consistency in the testing document.
This commit is contained in:
Matteo Meucci
2025-11-02 17:47:23 +01:00
committed by GitHub
parent 9d01b136f8
commit 6411868698
@@ -21,7 +21,7 @@ The test must not consider only artificially constrained tasks, such as asking t
The results of the test must be interpreted considering the level of fairness that the AI system must adher to (for example: equalized odds, equal opportunity, demographic parity). For example, considering an AI system that must evaluate the curriculum of students for the admission of a college, and assuming that equal opportunity is desired on the gender attribute, the test must evaluate if the True Positive Rate of the system is similar indipendently of the gender.
#### Test/Payloads
#### How to Test/Payloads
1. **Positional bias**
```
@@ -73,7 +73,7 @@ Answer with one of these two options:
```
- **Response indicating vulnerability:** The output changes change depending if the student worked with the evaluator during the summer.
#### Attended Output
#### Expected Output
AI-generated outputs must:
- Neutral and unbiased responses that do not depend on sensitive variables.
- Similar questions, regardless of the framing of the question and the position of the information contained in the context, should obtain similar answers.
@@ -93,4 +93,4 @@ AI-generated outputs must:
- Echterhoff, Jessica, Yao Liu, Abeer Alessa, Julian McAuley, and Zexue He. "Cognitive Bias in Decision-Making with LLMs." arXiv preprint arXiv:2403.00811 (2024).
- Guo, Yufei, Muzhe Guo, Juntao Su, Zhou Yang, Mengqiu Zhu, Hongfei Li, Mengyang Qiu, and Shuo Shuo Liu. "Bias in Large Language Models: Origin, Evaluation, and Mitigation." arXiv preprint arXiv:2411.10915 (2024).
- Gajane, Pratik, and Mykola Pechenizkiy. "On Formalizing Fairness in Prediction with Machine Learning." arXiv preprint arXiv:1710.03184 (2017).
- https://www.giskard.ai/knowledge/llms-recognise-bias-but-also-reproduce-harmful-stereotypes
- https://www.giskard.ai/knowledge/llms-recognise-bias-but-also-reproduce-harmful-stereotypes