120 Commits

Author SHA1 Message Date
lightbroker
fc80135823 Merge branch 'main' into scheduled-test-runs 2025-11-12 15:17:52 -07:00
Adam Wilson
0f1ca26821 Update guidelines.yml 2025-09-02 15:13:51 -06:00
Adam Wilson
377a38e786 Update guidelines.yml 2025-09-01 13:48:39 -06:00
Adam Wilson
a7d582fc4f Update guidelines.yml 2025-09-01 13:44:51 -06:00
Adam Wilson
75552bf87e Update guidelines.yml 2025-09-01 13:43:39 -06:00
Adam Wilson
abf8c43d46 end test workflow runs 2025-08-31 13:27:52 -06:00
Adam Wilson
aa779c343b increase test #0 (baseline) runs 2025-08-30 14:28:26 -06:00
Adam Wilson
1b22ad0670 redistribute cron schedules (heavy catch up for test #0 samples; move some from #4 to increase #2) 2025-08-30 08:53:37 -06:00
Adam Wilson
93a1f563a9 redistribute cron schedules (emphasize test #4) 2025-08-28 19:44:45 -06:00
Adam Wilson
b1a374b827 redistribute cron schedules (emphasize tests #2, #4) 2025-08-28 06:53:41 -06:00
Adam Wilson
12403d5e56 redistribute cron schedules 2025-08-26 19:00:50 -06:00
Adam Wilson
f971df2741 redistribute cron schedules 2025-08-26 15:34:46 -06:00
Adam Wilson
5293143eb5 redistribute cron schedules 2025-08-26 08:59:43 -06:00
Adam Wilson
450d28293f times 2025-08-21 18:57:19 -06:00
Adam Wilson
117fa4f257 update cron expressions; 3 workflows/hr; 16 minute timeout 2025-08-21 16:05:56 -06:00
Adam Wilson
c3b611fc7c workflow names 2025-08-21 13:26:05 -06:00
Adam Wilson
586c4d5115 update scheduled jobs 2025-08-21 12:50:09 -06:00
Adam Wilson
046e622aba test 3 base 2025-08-21 10:33:51 -06:00
Adam Wilson
62a87c6fd9 cleanup for test #4 scheduled runs 2025-08-21 06:36:42 -06:00
Adam Wilson
ab012ab21d Update test_04_malicious_prompts_rag_and_cot_microsoft_phi_3_mini4k_instruct.81-100.yml 2025-08-20 14:02:35 -06:00
Adam Wilson
1a55a020bb Update test_04_malicious_prompts_rag_and_cot_microsoft_phi_3_mini4k_instruct.61-80.yml 2025-08-20 14:01:32 -06:00
Adam Wilson
7f3f154f4c Merge pull request #57 from lightbroker/lightbroker-patch-2
Update test_04_malicious_prompts_rag_and_cot_microsoft_phi_3_mini4k_i…
2025-08-20 13:42:27 -06:00
Adam Wilson
cc90afb6b6 Update test_04_malicious_prompts_rag_and_cot_microsoft_phi_3_mini4k_instruct.41-60.yml 2025-08-20 13:41:39 -06:00
Adam Wilson
424a52df43 Update test_04_malicious_prompts_rag_and_cot_microsoft_phi_3_mini4k_instruct.21-40.yml 2025-08-20 13:41:17 -06:00
Adam Wilson
290bc573c2 Update test_04_malicious_prompts_rag_and_cot_microsoft_phi_3_mini4k_instruct.01-20.yml 2025-08-20 13:40:20 -06:00
Adam Wilson
3283e7e68c Update test_04.abstract_base.yml 2025-08-20 13:39:01 -06:00
Adam Wilson
6b682f84a8 try 2025-08-20 11:33:27 -06:00
Adam Wilson
22bcc8e598 model name 2025-08-20 06:25:26 -06:00
Adam Wilson
aca0128877 file path 2025-08-19 21:58:51 -06:00
Adam Wilson
cb9d230db3 req 2025-08-19 20:18:34 -06:00
Adam Wilson
f32c8b2410 version 2025-08-19 20:11:34 -06:00
Adam Wilson
cc124a91a3 support batch tests 2025-08-19 20:09:34 -06:00
Adam Wilson
74f99a36ec unneeded "build" phase 2025-08-04 21:37:35 -06:00
Adam Wilson
4c317634a3 fix order; add PR trigger 2025-08-04 21:23:56 -06:00
Adam Wilson
865a4b923a job dependencies 2025-08-04 08:02:59 -06:00
Adam Wilson
3b5c7b9f69 CI/CD test 2025-08-04 07:59:02 -06:00
Adam Wilson
cb1be6746f support testing malicious prompts with no guidelines 2025-06-28 12:18:35 -06:00
Adam Wilson
8b42671e4a mark workflows as deprecated 2025-05-28 16:52:32 -06:00
Adam Wilson
121b17633e duplicate working workflow; no RAG 2025-05-28 08:48:22 -06:00
Adam Wilson
2495d72c9a 200 max attempts 2025-05-28 06:09:59 -06:00
Adam Wilson
6d6082cf00 fix garak config paths 2025-05-28 05:57:23 -06:00
Adam Wilson
5bf67d7432 fix garak config paths 2025-05-28 05:25:16 -06:00
Adam Wilson
a4f834e033 comment out misconfigured HTTP test 2025-05-28 05:19:29 -06:00
Adam Wilson
bfa4c82f1e Use bash script to set up workflow 2025-05-27 07:48:29 -06:00
Adam Wilson
8bb4a473ca restructure as domain-driven design architecture 2025-05-23 13:48:29 -06:00
Adam Wilson
ff429365ac remove unused files, imports 2025-05-21 14:00:19 -06:00
Adam Wilson
d8a6609b8b increase parallel attempts 2025-05-20 17:04:30 -06:00
Adam Wilson
6b104bfcc3 debugging in workflow 2025-05-20 16:44:08 -06:00
Adam Wilson
8266342fda debugging in workflow 2025-05-20 16:40:08 -06:00
Adam Wilson
38a346d876 fix generator_option_file ref 2025-05-20 14:51:32 -06:00