
The NVIDIA DGX Spark is the productized version of Project DIGITS — a 150 mm cube housing the GB10 Grace Blackwell Superchip, 128 GB unified LPDDR5x, and the full CUDA AI stack out of the box. Tom's Hardware called it 'a well-rounded toolkit for local AI'; ServeTheHome called it 'must-have for AI developers'; LMSYS published the most thorough independent benchmarks. The 128 GB unified-memory ceiling is the headline feature: it loads models that would otherwise need a $30K+ multi-GPU rig. The catch is bandwidth-limited decode — LMSYS measured Llama-3.1 70B FP8 at 2.7 tokens/sec single-batch, while GPT-OSS 120B (MoE, ~17B active) hits ~14.5 tokens/sec per ServeTheHome. Best understood as a CUDA-native development box for buyers who need to iterate on big-model code without renting cloud GPUs.
- — 128 GB unified LPDDR5x memory — fits 70B FP8 / 120B Q4 / 405B with two clustered units
- — Full CUDA + NVIDIA AI stack preinstalled; the most polished local-AI dev box on the market
- — Compact 150 mm cube, 240 W max — fits any desk, runs cool and quiet
- — 273 GB/s LPDDR5x bandwidth caps decode tok/s on dense large models — 70B FP8 measures ~2.7 tok/s on a single unit
- — Linux-only, no Windows or gaming use; specialist hardware for AI developers
- — Price raised from $3,999 to $4,699 in February 2026 due to memory supply
