Verdict
Ranked #4 of 5Reviewed by Mike Hun·April 25, 2026

HP Z8 Fury G5

Averaged from 2 published ratings + 2 derived from video reviews
The verdict

The HP Z8 Fury G5 is HP's flagship workstation — a formidable, highly scalable tower designed specifically for demanding professional media, VFX, and AI creators who need a four-GPU ceiling. Built around Intel's Xeon W9-3495X (56 cores), 128 GB DDR5 ECC, and up to four NVIDIA RTX A6000 cards, it is a credible local-LLM training and inference rig at the upper end. The configured price varies enormously: a 1-GPU base build lands around $7,995, a 2-GPU build around $14,000, and a fully loaded 4x RTX A6000 configuration pushes well past $25,000. The price field below reflects a typical 1-GPU configured build; readers planning multi-GPU AI work should expect to roughly triple that figure.

HP Z8 Fury G5

Full review

Design and Build

The HP Z8 Fury G5 is HP's largest workstation chassis — a full tower built around Intel's Sapphire Rapids-WS Xeon W9 platform with the explicit design goal of supporting four dual-height professional GPUs. The chassis includes a built-in carrying handle, toolless drive bays, and modular PCIe support trays. PCMag's review noted the design language is 'serviceable rather than premium' — black plastic interior surfaces rather than the brushed aluminum found on Mac Pros — but the engineering is sound and the layout makes adding or removing GPUs a five-minute job rather than a half-day teardown. Reviewers across PCMag UK and IT Creations consistently called out serviceability as a primary virtue: this is a machine designed to spend years in a corporate IT fleet, not a desk ornament.

Multi-GPU AI Performance

The Z8 Fury G5's value proposition for AI workloads is the four-GPU ceiling. With four NVIDIA RTX A6000 cards — each contributing 48 GB of GDDR6 ECC at 768 GB/s — the system pools 192 GB of VRAM with NVLink-paired bandwidth. That is enough VRAM to hold Llama-3-405B at low-bit quantization (Q3-Q4 territory) entirely in GPU memory, or to run a 70B Q4 model with substantial KV cache and a second concurrent model loaded. Single-GPU 70B Q4 inference lands in the 25–40 tokens/sec range; four-GPU tensor-parallel inference scales to roughly 40–60 tokens/sec depending on batch size and the specific inference engine (vLLM, TensorRT-LLM). The published professional reviews focus on traditional rendering and simulation rather than LLM-specific benchmarks, so these numbers are extrapolated from RTX A6000 norms rather than measured on the Z8 specifically; the multi-GPU scaling assumes well-tuned tensor parallelism.

Configuration and Pricing Reality

Pricing on the Z8 Fury G5 varies by an order of magnitude depending on configuration. HP's bare-CPU starter SKU lists in the high-$2K to low-$3K range without GPUs and minimal RAM, which is what some prior shopping comparisons quoted — but that configuration is essentially useless for AI work. A practical entry build with 1x RTX A6000, 128 GB DDR5 ECC, and 2 TB NVMe lands around $7,995, which is the figure shown in the price field above and reflected in the spec table. A 2x A6000 build is closer to $14,000. A fully loaded 4x A6000 build with 1 TB RAM and redundant PSU pushes well past $25,000. Buyers planning multi-GPU AI work should think of this as a $15K–$30K class machine; the $7,995 number represents a 1-GPU starting point, not a finished AI workstation.

Where It Falls Short

The Z8 Fury G5's primary downsides are size, cost, and interior presentation. The full-tower chassis demands serious desk or floor space — buyers running it under a desk should measure first. Configuration cost scales steeply once GPUs are added, and the price gap between an entry 1-GPU build and a fully loaded 4-GPU build is more than $20,000. Some reviewers called the interior trim 'plain' compared to premium boutique workstations like the Mac Pro, though this matters less to operators than to buyers who plan to look at the open chassis frequently. For buyers who specifically need the 4-GPU ceiling, the Z8 Fury G5 has fewer compromises than its peers; for buyers whose AI workloads fit in 1–2 GPUs, the smaller HP Z6 G5 A delivers most of the same capability in a more desk-friendly package.

Strengths

  • +Supports up to a four-GPU configuration for extreme parallel AI inference and tensor-parallel training
  • +Features an easily accessible design with a built-in handle for serviceability
  • +Offers a massive range of customization options for specific workloads
  • +Includes an optional redundant power supply for critical uptime

Watch-outs

  • Scaling up configurations becomes prohibitively expensive — 4x A6000 builds push $25,000+
  • Enormous tower chassis requires significant floor or desk space
  • Interior uses plain black plastic rather than premium materials

How it compares

Similar to the Dell Precision 7960 Tower, the HP Z8 Fury G5 supports four-GPU configurations for extreme parallel processing, but it differentiates itself with a built-in handle and a design prioritizing easy serviceability. Versus its smaller sibling the HP Z6 G5 A, the Z8 Fury G5 is the right pick when you genuinely need 4 GPUs (versus 3) or the Xeon W9 platform's enterprise ECC and reliability features. Versus the Puget Genesis II, the Z8 Fury G5 brings HP's enterprise service network and parts availability, while Puget brings hand-tuned assembly and a more thoughtful configurator. Versus the Apple Mac Studio M3 Ultra, the Z8 Fury G5 is twice the size and triple the price for a 1-GPU build, but unlocks training-class workloads the Mac Studio cannot touch.

Who this is for

At a glance: Best for for 4-gpu training and inference — enterprise tier with HP support.

Why you’d buy the HP Z8 Fury G5

  • Supports up to a four-GPU configuration for extreme parallel AI inference and tensor-parallel training.
  • Features an easily accessible design with a built-in handle for serviceability.
  • Offers a massive range of customization options for specific workloads.

Why you’d skip it

  • Scaling up configurations becomes prohibitively expensive — 4x A6000 builds push $25,000+.
  • Enormous tower chassis requires significant floor or desk space.
  • Interior uses plain black plastic rather than premium materials.

Rating sources

Our 4.4 score is the average of these published ratings. Ratings marked * were derived from the reviewer’s written analysis or video transcript — the publisher didn’t print an explicit numeric score, so we inferred one from their own words. Click through to verify. More about methodology.

Frequently asked questions

Is the HP Z8 Fury G5 worth buying?
The HP Z8 Fury G5 is HP's flagship workstation — a formidable, highly scalable tower designed specifically for demanding professional media, VFX, and AI creators who need a four-GPU ceiling. Built around Intel's Xeon W9-3495X (56 cores), 128 GB DDR5 ECC, and up to four NVIDIA RTX A6000 cards, it is a credible local-LLM training and inference rig at the upper end. The configured price varies enormously: a 1-GPU base build lands around $7,995, a 2-GPU build around $14,000, and a fully loaded 4x RTX A6000 configuration pushes well past $25,000. The price field below reflects a typical 1-GPU configured build; readers planning multi-GPU AI work should expect to roughly triple that figure.
What is the HP Z8 Fury G5's biggest strength?
Supports up to a four-GPU configuration for extreme parallel AI inference and tensor-parallel training
What is the main drawback of the HP Z8 Fury G5?
Scaling up configurations becomes prohibitively expensive — 4x A6000 builds push $25,000+
What sources back the 4.4/5 rating?
Our 4.4/5 rating is the average of scores from 4 independent ai workstations reviews — pcmag, uk.pcmag, IT Creations, and HP Z8 Fury G5. Click any source on the product page to read the original review.

How it compares

See all 5
Puget Systems Genesis II
#1 · Top Score

Puget Systems Genesis II

The Puget Systems Genesis II is the enterprise pick. Versus the HP Z8 Fury G5, it offers comparable scale-up capability but in a quieter chassis with a more thoughtful configurator. Versus the HP Z6 G5 A, it's two tiers up in price and ceiling. Versus the NVIDIA DGX Spark, it's a different class of machine entirely — the DGX Spark is a 128 GB unified-memory dev box, the Genesis II is a multi-GPU training/inference workstation. For buyers whose only goal is running large local LLMs, the DGX Spark is the more cost-effective answer; the Genesis II earns its premium when training, fine-tuning, or multi-application workstation duty are part of the picture.

NVIDIA DGX Spark
#2

NVIDIA DGX Spark

The DGX Spark is the cheapest path to 128 GB of CUDA-addressable unified memory anywhere on the market. Versus the GMKtec EVO-X2 ($1,699) or Beelink GTR9 Pro ($2,000), it's roughly 2.5x the price but offers the full NVIDIA software stack the Strix Halo boxes can only approximate via ROCm or Vulkan. Versus the Puget Genesis II ($10K+), it's a single-purpose dev box — no multi-display creative workflow, no gaming, no general workstation duty. Pair two Sparks via the ConnectX-7 networking and you get 405B-class model coverage at roughly $9,400, the cheapest legal path to that ceiling.

HP Z6 G5 A
#3

HP Z6 G5 A

The HP Z6 G5 A is the mid-tier sweet spot in this lineup. Versus the HP Z8 Fury G5 (its flagship sibling), it's a smaller chassis with the same Threadripper Pro CPU family at a noticeably lower entry price — trading the Z8's 4-GPU ceiling for a 3-GPU ceiling and a more desk-friendly footprint. Versus the Puget Genesis II, it offers similar build pedigree without Puget's bespoke configurator and handpicked components, at a meaningfully lower starting price. Versus the DGX Spark, it's a different class of machine — the HP Z6 G5 A is a multi-GPU general workstation, the Spark is a single-purpose 128 GB unified-memory dev box. Pick the HP Z6 G5 A when you need both AI horsepower and traditional workstation workloads (rendering, simulation, multi-app productivity) on the same machine.

Apple Mac Studio M3 Ultra
#5

Apple Mac Studio M3 Ultra

The Apple Mac Studio M3 Ultra is the best Mac-ecosystem AI workstation and competitive on raw local-LLM throughput per dollar. Versus the DGX Spark ($4,699 / 128 GB), the base Mac Studio M3 Ultra ($3,999 / 96 GB) loses on memory ceiling but wins on memory bandwidth (819 vs 273 GB/s) — meaning faster decode tok/s on dense models that fit. Step up to a 256 GB or 512 GB Mac Studio config and you exceed the Spark's memory ceiling at higher bandwidth, at the cost of premium Apple memory pricing. Versus the multi-GPU PC workstations (Puget, HP Z6/Z8), the Mac Studio cannot match peak training throughput but is silent, half the size, and roughly half the price of an equivalent dual-GPU PC build.

HP Z8 Fury G5
4.4/5· $7,995
Check Price on Amazon