mirror of
https://github.com/dongdongunique/EvoSynth.git
synced 2026-02-12 17:22:44 +00:00
Update README and requirements
This commit is contained in:
21
README.md
21
README.md
@@ -29,7 +29,20 @@ git clone https://github.com/dongdongunique/EvoSynth.git
|
||||
cd EvoSynth
|
||||
pip install -r requirements.txt
|
||||
```
|
||||
## Create an .env file
|
||||
```bash
|
||||
OPENAI_KEY="YOUR-KEY"
|
||||
OPENAI_API_KEY="YOUR-KEY"
|
||||
OPENAI_BASE_URL="YOUR-URL"
|
||||
|
||||
|
||||
LANGFUSE_PUBLIC_KEY="YOUR-KEY"
|
||||
LANGFUSE_SECRET_KEY="YOUR-KEY"
|
||||
LANGFUSE_HOST="https://cloud.langfuse.com"
|
||||
|
||||
DEFAULT_MODEL="deepseek-chat"
|
||||
OPENAI_MODEL="deepseek-chat"
|
||||
```
|
||||
## Quick Start
|
||||
|
||||
### Environment Setup
|
||||
@@ -152,6 +165,12 @@ Results are evaluated using:
|
||||
- **LLM Judge**: Scores responses on a 1-5 scale. Prompts are from ActorAttack.
|
||||
- **Success Threshold**: Score >= 5 indicates successful jailbreak
|
||||
|
||||
## TODO
|
||||
|
||||
- [x] **Phase 1: Framework Development** - Core architecture and multi-agent system implementation
|
||||
- [ ] **Phase 2: SOTA LLM Testing** - Evaluating framework against recent released state-of-the-art LLMs (GPT-5.1, GEMINI 3.0 Pro, Claude 4.5 Opus etc.)
|
||||
- [ ] **Phase 3: Dataset Curation** - Filtering and curating generated attack results to create a new research dataset
|
||||
|
||||
## Ethical Disclaimer
|
||||
|
||||
**For Defensive Security Research Only.**
|
||||
@@ -181,4 +200,4 @@ If you use EvoSynth in your research, please cite:
|
||||
|
||||
## Contributing
|
||||
|
||||
Contributions are welcome! Please read our contributing guidelines before submitting pull requests.
|
||||
Contributions are welcome!
|
||||
|
||||
@@ -1,27 +1,15 @@
|
||||
# Core Dependencies
|
||||
openai>=1.0.0
|
||||
openai-agents>=0.1.0
|
||||
nest-asyncio>=1.5.0
|
||||
aiofiles>=23.0.0
|
||||
httpx>=0.24.0
|
||||
python-dotenv>=1.0.0
|
||||
openai>=2.8.1
|
||||
openai-agents>=0.6.1
|
||||
nest-asyncio>=1.6.0
|
||||
aiofiles>=25.1.0
|
||||
python-dotenv>=1.2.1
|
||||
|
||||
# Data Processing
|
||||
pandas>=2.0.0
|
||||
numpy>=1.24.0
|
||||
|
||||
# Machine Learning
|
||||
torch>=2.0.0
|
||||
transformers>=4.30.0
|
||||
scikit-learn>=1.3.0
|
||||
pandas>=2.2.3
|
||||
|
||||
# Image Processing
|
||||
Pillow>=10.0.0
|
||||
diffusers>=0.25.0
|
||||
Pillow>=12.0.0
|
||||
|
||||
# Progress & Utilities
|
||||
tqdm>=4.65.0
|
||||
|
||||
# Optional: Observability
|
||||
langfuse>=2.0.0
|
||||
logfire>=0.30.0
|
||||
tqdm>=4.67.1
|
||||
|
||||
Reference in New Issue
Block a user