OBLITERATUS

mirror of https://github.com/elder-plinius/OBLITERATUS.git synced 2026-05-01 07:27:53 +02:00

Author	SHA1	Message	Date
Stella Biderman	501ff0c963	Add gpu-calc command and document precision/quantization options New `obliteratus gpu-calc` subcommand estimates minimum GPU count from model params, dtype, and GPU VRAM. Auto-detects param counts from HF configs including MoE expert structure. README now covers --dtype, --quantization flags, the gpu-calc command, and references both in the "Choosing the right setup" table.	2026-03-17 14:01:18 -04:00
Stella Biderman	79b469d3dc	Add DeepSeek-R1-Distill-Llama-70B pipeline parallel benchmarks Benchmarked 70B dense model (149 GB bf16) on 2/3/4/8 A100-80GB GPUs. 3 GPUs was fastest (536s), confirming minimum-viable-GPU-count guidance. Combined stage breakdown table for both models.	2026-03-16 15:29:57 -04:00
Stella Biderman	c723da02c8	Document multi-GPU parallelism, benchmarks, and remote SSH execution Add a comprehensive section covering: - How model sharding (pipeline parallelism) works and its limitations - GPU selection via --gpus flag - Pipeline parallel benchmarks on GPT-OSS-120B across 3-8 A100-80GB GPUs - Stage-by-stage timing breakdown - When data parallelism helps (and when it doesn't) - Remote SSH execution with CLI and YAML examples - Decision table for choosing the right setup	2026-03-16 14:39:22 -04:00
Stella Biderman	a2bb748f1b	Revert "Add data parallel support for PROBE stage" This reverts commit `1a6e2577bb`.	2026-03-13 16:54:31 -04:00
Stella Biderman	1a6e2577bb	Add data parallel support for PROBE stage When --data-parallel is passed and the model fits on a single GPU, wraps it with nn.DataParallel to split prompt batches across all available GPUs during activation collection. Batch size scales by GPU count. Hooks already move activations to CPU so they work correctly across replicas.	2026-03-13 01:24:31 -04:00
pliny	984ce14059	Add files via upload	2026-03-05 10:03:46 -08:00
pliny	66ea4a6f86	Add files via upload	2026-03-05 00:50:44 -08:00
pliny	f67f13ca57	Update README.md	2026-03-04 13:35:24 -08:00
pliny	904092fcdb	Update README.md	2026-03-04 12:44:17 -08:00
pliny	0f6114fe87	Add files via upload	2026-03-04 12:38:18 -08:00

10 Commits