Update README and requirements

2026-02-12 17:22:44 +00:00 · 2025-12-10 03:16:48 +08:00
parent f3af94df0b
commit 93f0702172
2 changed files with 28 additions and 21 deletions
--- a/README.md
+++ b/README.md
@@ -29,7 +29,20 @@ git clone https://github.com/dongdongunique/EvoSynth.git
 cd EvoSynth
 pip install -r requirements.txt
 ```
+## Create an .env file
+```bash
+OPENAI_KEY="YOUR-KEY"
+OPENAI_API_KEY="YOUR-KEY"
+OPENAI_BASE_URL="YOUR-URL"

+
+LANGFUSE_PUBLIC_KEY="YOUR-KEY"
+LANGFUSE_SECRET_KEY="YOUR-KEY"
+LANGFUSE_HOST="https://cloud.langfuse.com"
+
+DEFAULT_MODEL="deepseek-chat"
+OPENAI_MODEL="deepseek-chat"
+```
 ## Quick Start

 ### Environment Setup
@@ -152,6 +165,12 @@ Results are evaluated using:
 - **LLM Judge**: Scores responses on a 1-5 scale. Prompts are from ActorAttack.
 - **Success Threshold**: Score >= 5 indicates successful jailbreak

+## TODO
+
+- [x] **Phase 1: Framework Development** - Core architecture and multi-agent system implementation
+- [ ] **Phase 2: SOTA LLM Testing** - Evaluating framework against recent released state-of-the-art LLMs (GPT-5.1, GEMINI 3.0 Pro, Claude 4.5 Opus etc.)
+- [ ] **Phase 3: Dataset Curation** - Filtering and curating generated attack results to create a new research dataset
+
 ## Ethical Disclaimer

 **For Defensive Security Research Only.**
@@ -181,4 +200,4 @@ If you use EvoSynth in your research, please cite:

 ## Contributing

-Contributions are welcome! Please read our contributing guidelines before submitting pull requests.
+Contributions are welcome! 
--- a/requirements.txt
+++ b/requirements.txt
@@ -1,27 +1,15 @@
 # Core Dependencies
-openai>=1.0.0
-openai-agents>=0.1.0
-nest-asyncio>=1.5.0
-aiofiles>=23.0.0
-httpx>=0.24.0
-python-dotenv>=1.0.0
+openai>=2.8.1
+openai-agents>=0.6.1
+nest-asyncio>=1.6.0
+aiofiles>=25.1.0
+python-dotenv>=1.2.1

 # Data Processing
-pandas>=2.0.0
-numpy>=1.24.0
-
-# Machine Learning
-torch>=2.0.0
-transformers>=4.30.0
-scikit-learn>=1.3.0
+pandas>=2.2.3

 # Image Processing
-Pillow>=10.0.0
-diffusers>=0.25.0
+Pillow>=12.0.0

 # Progress & Utilities
-tqdm>=4.65.0
-
-# Optional: Observability
-langfuse>=2.0.0
-logfire>=0.30.0
+tqdm>=4.67.1