10 Commits

Author SHA1 Message Date
Stella Biderman 501ff0c963 Add gpu-calc command and document precision/quantization options
New `obliteratus gpu-calc` subcommand estimates minimum GPU count from
model params, dtype, and GPU VRAM. Auto-detects param counts from HF
configs including MoE expert structure.

README now covers --dtype, --quantization flags, the gpu-calc command,
and references both in the "Choosing the right setup" table.
2026-03-17 14:01:18 -04:00
Stella Biderman 79b469d3dc Add DeepSeek-R1-Distill-Llama-70B pipeline parallel benchmarks
Benchmarked 70B dense model (149 GB bf16) on 2/3/4/8 A100-80GB GPUs.
3 GPUs was fastest (536s), confirming minimum-viable-GPU-count guidance.
Combined stage breakdown table for both models.
2026-03-16 15:29:57 -04:00
Stella Biderman c723da02c8 Document multi-GPU parallelism, benchmarks, and remote SSH execution
Add a comprehensive section covering:
- How model sharding (pipeline parallelism) works and its limitations
- GPU selection via --gpus flag
- Pipeline parallel benchmarks on GPT-OSS-120B across 3-8 A100-80GB GPUs
- Stage-by-stage timing breakdown
- When data parallelism helps (and when it doesn't)
- Remote SSH execution with CLI and YAML examples
- Decision table for choosing the right setup
2026-03-16 14:39:22 -04:00
Stella Biderman a2bb748f1b Revert "Add data parallel support for PROBE stage"
This reverts commit 1a6e2577bb.
2026-03-13 16:54:31 -04:00
Stella Biderman 1a6e2577bb Add data parallel support for PROBE stage
When --data-parallel is passed and the model fits on a single GPU,
wraps it with nn.DataParallel to split prompt batches across all
available GPUs during activation collection. Batch size scales by
GPU count. Hooks already move activations to CPU so they work
correctly across replicas.
2026-03-13 01:24:31 -04:00
pliny 984ce14059 Add files via upload 2026-03-05 10:03:46 -08:00
pliny 66ea4a6f86 Add files via upload 2026-03-05 00:50:44 -08:00
pliny f67f13ca57 Update README.md 2026-03-04 13:35:24 -08:00
pliny 904092fcdb Update README.md 2026-03-04 12:44:17 -08:00
pliny 0f6114fe87 Add files via upload 2026-03-04 12:38:18 -08:00