When does Venice ship in volume?

AMD says production is ramping now in Taiwan with commercial shipments later in 2026, and a later production stage planned at TSMC's Arizona fab.

How does Venice compare to Intel's server roadmap?

Intel's competing P-core Xeon 'Diamond Rapids' is reportedly slipping to mid-2027, leaving AMD with a near-term lead on the leading-edge node for high-core-count server silicon.

Why does a CPU launch matter for GPU-bound AI clusters?

The EPYC host orchestrates data movement, networking and storage feeding the accelerators; higher core counts and ~1.6 TB/s per-socket memory bandwidth ease head-node bottlenecks in agentic and inference workloads.

AMD Ramps EPYC 'Venice' on TSMC 2nm — First HPC Chip to Hit the Node

AMD has started the volume production ramp of its sixth-generation EPYC server processor, codenamed "Venice," on TSMC's 2nm (N2) process — making it the first high-performance computing product in the industry to reach production on the node. Venice scales to 256 Zen 6 cores and, per figures reported alongside the ramp, delivers roughly 70% more compute than the current "Turin" generation. AMD says commercial shipments arrive later in 2026.

The ramp is running in Taiwan first, with a later production stage planned at TSMC's Arizona fab.

The host side of the AI cluster

Venice is a general-purpose server CPU, not an accelerator — but in an AI data center the EPYC host is what feeds the GPUs. It schedules work, moves data between storage, network and accelerator memory, and runs the orchestration layer for agentic pipelines that fan requests across tool calls and retrieval. AMD frames Venice squarely at "increasingly complex agentic workloads" where the CPU coordinates data movement, networking and storage for the accelerators around it.

Reported platform specs back that positioning: a new SP7 socket, up to 16 DDR5 memory channels delivering around 1.6 TB/s of per-socket bandwidth, and doubled CPU-to-GPU bandwidth versus the prior generation. Memory bandwidth at the host is precisely where mixed inference and retrieval clusters tend to stall.

First to 2nm, and the Intel gap

Being first to volume on N2 is the strategic headline. "Ramping 'Venice' on TSMC 2nm process technology marks an important step forward in accelerating the next generation of AI infrastructure," said AMD chair and CEO Lisa Su. TSMC chairman and CEO C.C. Wei said the collaboration reflects "pairing leadership process technology with advanced design innovation to enable the next era of high-performance and AI computing."

The timing favors AMD. Intel's competing P-core Xeon, "Diamond Rapids," is reportedly delayed toward mid-2027 — meaning AMD could hold the leading-edge node largely uncontested in the high-core-count server tier through the next procurement cycle.

Verano and the agentic-memory bet

AMD also confirmed it will extend TSMC 2nm across the data center roadmap with "Verano," a follow-on sixth-gen EPYC part tuned for performance-per-dollar-per-watt. Notably, Verano integrates LPDDR — AMD's bet that memory capacity and efficiency, not just core count, become the constraint for agentic AI workloads running long, tool-heavy contexts.

What changes for builders

For infra teams sizing 2026–2027 fleets, the practical signal is twofold: host-side capacity and bandwidth are scaling to match GPU density, and the leading-edge node now carries an efficiency story (perf-per-watt, perf-per-dollar) that feeds directly into cluster TCO. With Intel's high-end server refresh slipping, buyers planning GPU build-outs have a clearer near-term default for the head nodes — and a reason to revisit power and rack budgets before the Venice and Verano parts land in volume.

AMD Ramps EPYC 'Venice' on TSMC 2nm — First HPC Chip to Hit the Node

The host side of the AI cluster

First to 2nm, and the Intel gap

Verano and the agentic-memory bet

What changes for builders

More in Industry

SpaceX's Public S-1 Exposes xAI's $6.4B Loss as It Lines Up an $80B IPO

WhiteFiber Signs $160M+ Five-Year AI Compute Deal in France, Funded by Customer Prepayments

Broadcom, Meta and Three Chip Giants Commit $125M to UCLA Semiconductor Hub