Verdict
Ranked #3 of 5Reviewed by Mike Hun·April 25, 2026

HP Z6 G5 A

Averaged from 5 derived from review text
The verdict

The HP Z6 G5 A is the smallest Threadripper Pro OEM workstation on the market and the rational mid-tier pick under HP's flagship Z8 Fury G5. Reviewers across PCMag, AnandTech, StorageReview, Phoronix, and DEVELOP3D consistently praised its build quality, toolless serviceability, and 96-core CPU ceiling — StorageReview gave it their 'highest recommendation for a high-end tower workstation.' For local-LLM use, configurations with 1–3 RTX 6000 Ada GPUs (48 GB VRAM each at ~960 GB/s) deliver in the 25–40 tokens/sec range on Llama-3-70B Q4 single-GPU and substantially more with multi-GPU tensor parallelism. Note that none of the published professional reviews ran formal Llama-3 70B Q4 benchmarks, so LLM-specific performance numbers here are from single-GPU norms rather than published HP Z6 measurements specifically.

HP Z6 G5 A

Full review

Design and Form Factor

The HP Z6 G5 A is built around AMD's Ryzen Threadripper Pro 7000 WX-Series, scaling from 12 cores up to the 96-core 7995WX in the same 4U chassis. The unit measures 169 x 465 x 445 mm and ships with a built-in carrying handle that DEVELOP3D's review specifically called out as a thoughtful detail — high-core-count workstations are heavy, and an integrated handle matters more than its triviality suggests. The interior is toolless, with modular drive cages and PCIe support trays designed for servicing without screws. StorageReview praised the Z6 G5 A as the 'smallest Threadripper Pro OEM workstation' on the market, positioning it as a credible desk-side alternative to a full tower for buyers who don't need the Z8 Fury G5's 4-GPU footprint.

AI and Multi-GPU Configuration

The Z6 G5 A supports up to three dual-height professional GPUs — typically an RTX A6000 or RTX 6000 Ada Generation for serious AI work, or smaller A4000/A5000 cards for compute-light visualization. With three RTX 6000 Ada cards, the system pools 144 GB of VRAM at roughly 960 GB/s per card, which is substantially more memory bandwidth per dollar than the unified-memory boxes in this guide and enough to hold Llama-3-70B Q4 (~40 GB) entirely in VRAM with headroom for KV cache and a second concurrent model. Phoronix's testing focused on traditional rendering and CFD workloads where the 7995WX outpaced everything in its class; StorageReview ran UL Procyon AI Inference benchmarks and showed Real-ESRGAN completing in 83.8 seconds on RTX 6000 Ada via TensorRT versus 2,891 seconds on CPU — the kind of two-orders-of-magnitude speedup that justifies a multi-GPU box for any team doing AI development at scale.

Where It Falls Short

The Z6 G5 A's biggest practical issue is thermals on the highest core counts. StorageReview measured 95°C all-core temperatures on the 7995WX under sustained load — within spec, but at the high end of comfort. Buyers picking the 96-core variant should plan for adequate room ventilation. The other constraint is that none of the published professional reviews ran formal large-language-model benchmarks; reviewers focused on rendering, simulation, and traditional workstation workloads. LLM-specific performance numbers in this guide are extrapolated from single-GPU RTX 6000 Ada norms rather than measured on the Z6 G5 A directly. Finally, configuration pricing scales steeply: a 12-core entry config starts near $3,100, a typical 1-GPU AI build lands around $5,500, and a 96-core 3-GPU monster pushes $18,000+.

Who It's Best For

The HP Z6 G5 A is the right pick for AI teams that need multi-GPU capability and traditional workstation workloads on the same machine — mixed AI/CFD/rendering environments, research labs, and product teams whose engineers compile, simulate, and train on the same desk-side hardware. It's the rational tier under the Z8 Fury G5 for buyers who don't need 4 GPUs or the Z8's larger chassis. Versus the Puget Genesis II it trades the bespoke configurator for HP's enterprise service network. Versus the DGX Spark it's three to four times the price but offers a fundamentally more flexible machine — one that can run Windows, drive multiple displays, and serve as a primary workstation rather than a dedicated AI dev box.

Strengths

  • +Smallest Threadripper Pro OEM tower on the market — compact 4U chassis with built-in handle
  • +AMD Ryzen Threadripper Pro 7000 WX-Series scales from 12 to 96 cores at the same chassis price floor
  • +Toolless serviceability, modular interior, ECC DDR5 — enterprise pedigree at mid-tier pricing
  • +Supports up to 3 dual-height pro GPUs (RTX A6000, RTX 6000 Ada) for serious multi-GPU AI work

Watch-outs

  • 95°C all-core CPU thermals reported under sustained load (StorageReview)
  • Pricing scales steeply — 96-core configs push $18,000+
  • No published Llama 70B Q4 tokens/sec figures in mainstream reviews — LLM-specific benchmarking is thin

How it compares

The HP Z6 G5 A is the mid-tier sweet spot in this lineup. Versus the HP Z8 Fury G5 (its flagship sibling), it's a smaller chassis with the same Threadripper Pro CPU family at a noticeably lower entry price — trading the Z8's 4-GPU ceiling for a 3-GPU ceiling and a more desk-friendly footprint. Versus the Puget Genesis II, it offers similar build pedigree without Puget's bespoke configurator and handpicked components, at a meaningfully lower starting price. Versus the DGX Spark, it's a different class of machine — the HP Z6 G5 A is a multi-GPU general workstation, the Spark is a single-purpose 128 GB unified-memory dev box. Pick the HP Z6 G5 A when you need both AI horsepower and traditional workstation workloads (rendering, simulation, multi-app productivity) on the same machine.

Who this is for

At a glance: Best for mid-tier — Threadripper Pro multi-GPU under Z8 pricing.

Why you’d buy the HP Z6 G5 A

  • Smallest Threadripper Pro OEM tower on the market — compact 4U chassis with built-in handle.
  • AMD Ryzen Threadripper Pro 7000 WX-Series scales from 12 to 96 cores at the same chassis price floor.
  • Toolless serviceability, modular interior, ECC DDR5 — enterprise pedigree at mid-tier pricing.

Why you’d skip it

  • 95°C all-core CPU thermals reported under sustained load (StorageReview).
  • Pricing scales steeply — 96-core configs push $18,000+.
  • No published Llama 70B Q4 tokens/sec figures in mainstream reviews — LLM-specific benchmarking is thin.

Rating sources

Our 4.5 score is the average of these published ratings. Ratings marked * were derived from the reviewer’s written analysis or video transcript — the publisher didn’t print an explicit numeric score, so we inferred one from their own words. Click through to verify. More about methodology.

Frequently asked questions

Is the HP Z6 G5 A worth buying?
The HP Z6 G5 A is the smallest Threadripper Pro OEM workstation on the market and the rational mid-tier pick under HP's flagship Z8 Fury G5. Reviewers across PCMag, AnandTech, StorageReview, Phoronix, and DEVELOP3D consistently praised its build quality, toolless serviceability, and 96-core CPU ceiling — StorageReview gave it their 'highest recommendation for a high-end tower workstation.' For local-LLM use, configurations with 1–3 RTX 6000 Ada GPUs (48 GB VRAM each at ~960 GB/s) deliver in the 25–40 tokens/sec range on Llama-3-70B Q4 single-GPU and substantially more with multi-GPU tensor parallelism. Note that none of the published professional reviews ran formal Llama-3 70B Q4 benchmarks, so LLM-specific performance numbers here are from single-GPU norms rather than published HP Z6 measurements specifically.
What is the HP Z6 G5 A's biggest strength?
Smallest Threadripper Pro OEM tower on the market — compact 4U chassis with built-in handle
What is the main drawback of the HP Z6 G5 A?
95°C all-core CPU thermals reported under sustained load (StorageReview)
What sources back the 4.5/5 rating?
Our 4.5/5 rating is the average of scores from 5 independent ai workstations reviews — pcmag, anandtech, storagereview, phoronix, and develop3d. Click any source on the product page to read the original review.

How it compares

See all 5
Puget Systems Genesis II
#1 · Top Score

Puget Systems Genesis II

The Puget Systems Genesis II is the enterprise pick. Versus the HP Z8 Fury G5, it offers comparable scale-up capability but in a quieter chassis with a more thoughtful configurator. Versus the HP Z6 G5 A, it's two tiers up in price and ceiling. Versus the NVIDIA DGX Spark, it's a different class of machine entirely — the DGX Spark is a 128 GB unified-memory dev box, the Genesis II is a multi-GPU training/inference workstation. For buyers whose only goal is running large local LLMs, the DGX Spark is the more cost-effective answer; the Genesis II earns its premium when training, fine-tuning, or multi-application workstation duty are part of the picture.

NVIDIA DGX Spark
#2

NVIDIA DGX Spark

The DGX Spark is the cheapest path to 128 GB of CUDA-addressable unified memory anywhere on the market. Versus the GMKtec EVO-X2 ($1,699) or Beelink GTR9 Pro ($2,000), it's roughly 2.5x the price but offers the full NVIDIA software stack the Strix Halo boxes can only approximate via ROCm or Vulkan. Versus the Puget Genesis II ($10K+), it's a single-purpose dev box — no multi-display creative workflow, no gaming, no general workstation duty. Pair two Sparks via the ConnectX-7 networking and you get 405B-class model coverage at roughly $9,400, the cheapest legal path to that ceiling.

HP Z8 Fury G5
#4

HP Z8 Fury G5

Similar to the Dell Precision 7960 Tower, the HP Z8 Fury G5 supports four-GPU configurations for extreme parallel processing, but it differentiates itself with a built-in handle and a design prioritizing easy serviceability. Versus its smaller sibling the HP Z6 G5 A, the Z8 Fury G5 is the right pick when you genuinely need 4 GPUs (versus 3) or the Xeon W9 platform's enterprise ECC and reliability features. Versus the Puget Genesis II, the Z8 Fury G5 brings HP's enterprise service network and parts availability, while Puget brings hand-tuned assembly and a more thoughtful configurator. Versus the Apple Mac Studio M3 Ultra, the Z8 Fury G5 is twice the size and triple the price for a 1-GPU build, but unlocks training-class workloads the Mac Studio cannot touch.

Apple Mac Studio M3 Ultra
#5

Apple Mac Studio M3 Ultra

The Apple Mac Studio M3 Ultra is the best Mac-ecosystem AI workstation and competitive on raw local-LLM throughput per dollar. Versus the DGX Spark ($4,699 / 128 GB), the base Mac Studio M3 Ultra ($3,999 / 96 GB) loses on memory ceiling but wins on memory bandwidth (819 vs 273 GB/s) — meaning faster decode tok/s on dense models that fit. Step up to a 256 GB or 512 GB Mac Studio config and you exceed the Spark's memory ceiling at higher bandwidth, at the cost of premium Apple memory pricing. Versus the multi-GPU PC workstations (Puget, HP Z6/Z8), the Mac Studio cannot match peak training throughput but is silent, half the size, and roughly half the price of an equivalent dual-GPU PC build.

HP Z6 G5 A
4.5/5· $5,499
Buy at hp.com