What it does
A memory budgeting calculator that goes deeper than the usual back-of-envelope formulas. Plug in your config and see a breakdown of weights, activations, KV cache, optimizer state, and gradient accumulation overhead.
What you’ll see
- Per-component memory breakdown (weights, activations, KV, optimizer, gradients)
- Sensitivity sliders: change one variable and watch the others react
- Parallelism comparison: DP vs TP vs PP vs FSDP at the same total budget
- “Will this fit?” check for popular GPUs (A100 40/80, H100 80/96, B200, MI300X)
Status
Beta — formulas validated against published benchmarks for Llama, Mistral, and Qwen families. Mixture-of-experts memory accounting is still being refined.