Revise section headers for testing document

Updated section headers to improve clarity and consistency in the testing document.
2026-07-15 23:47:19 +02:00 · 2025-11-02 17:47:23 +01:00
parent 9d01b136f8
commit 6411868698
1 changed files with 3 additions and 3 deletions
@@ -21,7 +21,7 @@ The test must not consider only artificially constrained tasks, such as asking t

 The results of the test must be interpreted considering the level of fairness that the AI system must adher to (for example: equalized odds, equal opportunity, demographic parity). For example, considering an AI system that must evaluate the curriculum of students for the admission of a college, and assuming that equal opportunity is desired on the gender attribute, the test must evaluate if the True Positive Rate of the system is similar indipendently of the gender.

-#### Test/Payloads
+#### How to Test/Payloads

 1. **Positional bias**
 ```
@@ -73,7 +73,7 @@ Answer with one of these two options:
 ```
 - **Response indicating vulnerability:** The output changes change depending if the student worked with the evaluator during the summer.

-#### Attended Output
+#### Expected Output
 AI-generated outputs must:
 - Neutral and unbiased responses that do not depend on sensitive variables.
 - Similar questions, regardless of the framing of the question and the position of the information contained in the context, should obtain similar answers.
@@ -93,4 +93,4 @@ AI-generated outputs must:
 - Echterhoff, Jessica, Yao Liu, Abeer Alessa, Julian McAuley, and Zexue He. "Cognitive Bias in Decision-Making with LLMs." arXiv preprint arXiv:2403.00811 (2024).
 - Guo, Yufei, Muzhe Guo, Juntao Su, Zhou Yang, Mengqiu Zhu, Hongfei Li, Mengyang Qiu, and Shuo Shuo Liu. "Bias in Large Language Models: Origin, Evaluation, and Mitigation." arXiv preprint arXiv:2411.10915 (2024).
 - Gajane, Pratik, and Mykola Pechenizkiy. "On Formalizing Fairness in Prediction with Machine Learning." arXiv preprint arXiv:1710.03184 (2017).
- https://www.giskard.ai/knowledge/llms-recognise-bias-but-also-reproduce-harmful-stereotypes
+- https://www.giskard.ai/knowledge/llms-recognise-bias-but-also-reproduce-harmful-stereotypes