
The HP Z6 G5 A is the smallest Threadripper Pro OEM workstation on the market and the rational mid-tier pick under HP's flagship Z8 Fury G5. Reviewers across PCMag, AnandTech, StorageReview, Phoronix, and DEVELOP3D consistently praised its build quality, toolless serviceability, and 96-core CPU ceiling — StorageReview gave it their 'highest recommendation for a high-end tower workstation.' For local-LLM use, configurations with 1–3 RTX 6000 Ada GPUs (48 GB VRAM each at ~960 GB/s) deliver in the 25–40 tokens/sec range on Llama-3-70B Q4 single-GPU and substantially more with multi-GPU tensor parallelism. Note that none of the published professional reviews ran formal Llama-3 70B Q4 benchmarks, so LLM-specific performance numbers here are from single-GPU norms rather than published HP Z6 measurements specifically.
- — Smallest Threadripper Pro OEM tower on the market — compact 4U chassis with built-in handle
- — AMD Ryzen Threadripper Pro 7000 WX-Series scales from 12 to 96 cores at the same chassis price floor
- — Toolless serviceability, modular interior, ECC DDR5 — enterprise pedigree at mid-tier pricing
- — 95°C all-core CPU thermals reported under sustained load (StorageReview)
- — Pricing scales steeply — 96-core configs push $18,000+
- — No published Llama 70B Q4 tokens/sec figures in mainstream reviews — LLM-specific benchmarking is thin
