diff --git a/README.md b/README.md index d853354..cb34188 100644 --- a/README.md +++ b/README.md @@ -29,7 +29,20 @@ git clone https://github.com/dongdongunique/EvoSynth.git cd EvoSynth pip install -r requirements.txt ``` +## Create an .env file +```bash +OPENAI_KEY="YOUR-KEY" +OPENAI_API_KEY="YOUR-KEY" +OPENAI_BASE_URL="YOUR-URL" + +LANGFUSE_PUBLIC_KEY="YOUR-KEY" +LANGFUSE_SECRET_KEY="YOUR-KEY" +LANGFUSE_HOST="https://cloud.langfuse.com" + +DEFAULT_MODEL="deepseek-chat" +OPENAI_MODEL="deepseek-chat" +``` ## Quick Start ### Environment Setup @@ -152,6 +165,12 @@ Results are evaluated using: - **LLM Judge**: Scores responses on a 1-5 scale. Prompts are from ActorAttack. - **Success Threshold**: Score >= 5 indicates successful jailbreak +## TODO + +- [x] **Phase 1: Framework Development** - Core architecture and multi-agent system implementation +- [ ] **Phase 2: SOTA LLM Testing** - Evaluating framework against recent released state-of-the-art LLMs (GPT-5.1, GEMINI 3.0 Pro, Claude 4.5 Opus etc.) +- [ ] **Phase 3: Dataset Curation** - Filtering and curating generated attack results to create a new research dataset + ## Ethical Disclaimer **For Defensive Security Research Only.** @@ -181,4 +200,4 @@ If you use EvoSynth in your research, please cite: ## Contributing -Contributions are welcome! Please read our contributing guidelines before submitting pull requests. +Contributions are welcome! diff --git a/requirements.txt b/requirements.txt index 4a66163..5ed00a0 100644 --- a/requirements.txt +++ b/requirements.txt @@ -1,27 +1,15 @@ # Core Dependencies -openai>=1.0.0 -openai-agents>=0.1.0 -nest-asyncio>=1.5.0 -aiofiles>=23.0.0 -httpx>=0.24.0 -python-dotenv>=1.0.0 +openai>=2.8.1 +openai-agents>=0.6.1 +nest-asyncio>=1.6.0 +aiofiles>=25.1.0 +python-dotenv>=1.2.1 # Data Processing -pandas>=2.0.0 -numpy>=1.24.0 - -# Machine Learning -torch>=2.0.0 -transformers>=4.30.0 -scikit-learn>=1.3.0 +pandas>=2.2.3 # Image Processing -Pillow>=10.0.0 -diffusers>=0.25.0 +Pillow>=12.0.0 # Progress & Utilities -tqdm>=4.65.0 - -# Optional: Observability -langfuse>=2.0.0 -logfire>=0.30.0 +tqdm>=4.67.1