Back to stories
Industry

NVIDIA GTC 2026: Jensen Huang Projects $1 Trillion in AI Chip Orders, Previews Kyber Architecture

Michael Ouroumis2 min read
NVIDIA GTC 2026: Jensen Huang Projects $1 Trillion in AI Chip Orders, Previews Kyber Architecture

NVIDIA CEO Jensen Huang used his two-hour GTC 2026 keynote on March 16 in San Jose to lay out the company's most ambitious roadmap yet — projecting $1 trillion in combined purchase orders for its Blackwell and Vera Rubin chip architectures through 2027. That figure doubles a previous estimate of $500 billion and underscores how aggressively hyperscalers and enterprises are investing in AI infrastructure.

The Inference Inflection

A central theme of the keynote was what Huang called the shift from training to inference as the primary driver of AI compute demand. With frontier models now deployed at massive scale, the cost and speed of generating tokens — not training them — is becoming the bottleneck for the industry.

To address this, Huang formally introduced the Groq 3 Language Processing Unit (LPU), the first chip to emerge from NVIDIA's $20 billion acquisition of inference startup Groq. Each Groq 3 LPU contains approximately 500 MB of stacked SRAM, and a full Groq LPX rack holds 256 LPUs with roughly 128 GB of aggregate on-chip memory and 640 TB/s of scale-up bandwidth. According to NVIDIA, the Groq LPX rack can boost tokens-per-watt performance by 35 times when paired with Rubin GPUs.

Vera Rubin in Full

The Vera Rubin platform — named after the astronomer whose work revealed dark matter — now comprises seven chips across five rack-scale systems. The flagship Vera Rubin NVL72 integrates 72 Rubin GPUs and 36 Vera CPUs connected through a massive NVLink copper spine, effectively functioning as a single GPU. NVIDIA says the system delivers 10x more performance per watt than its predecessor, Grace Blackwell.

Kyber: The Next Leap

Perhaps the most forward-looking reveal was Kyber, NVIDIA's next-generation rack architecture. Kyber rotates compute trays 90 degrees to a vertical orientation, packing 144 GPUs into a single rack for significantly higher density and lower latency. At full scale, the NVL576 configuration — with 576 GPUs across 144 packages — is expected to deliver 14 times the performance of the current GB300 NVL72 for both training and inference workloads.

Kyber will first appear in Vera Rubin Ultra systems, which NVIDIA expects to ship in 2027.

What It Means

The $1 trillion projection signals that NVIDIA sees no slowdown in AI infrastructure spending. With hyperscalers like AWS and Microsoft already committed to deploying Vera Rubin systems at scale, the company is positioning itself at the center of an inference-driven economy where token generation — not model training — defines the next wave of AI value creation.

How AI Actually Works — Free Book on FreeLibrary

A free book that explains the AI concepts behind the headlines — no jargon, just clarity.

More in Industry

NVIDIA GTC 2026: Jensen Huang Projects $1 Trillion AI Chip Pipeline, Previews Kyber Architecture
Industry

NVIDIA GTC 2026: Jensen Huang Projects $1 Trillion AI Chip Pipeline, Previews Kyber Architecture

NVIDIA CEO Jensen Huang opened GTC 2026 by forecasting $1 trillion in combined Blackwell and Vera Rubin orders through 2027, while previewing the Kyber rack architecture that will power the next era of AI supercomputing.

9 hours ago2 min read
NVIDIA and Uber to Deploy Level 4 Robotaxis Across 28 Cities by 2028
Industry

NVIDIA and Uber to Deploy Level 4 Robotaxis Across 28 Cities by 2028

Announced at GTC 2026, NVIDIA and Uber will launch fully driverless robotaxis starting in Los Angeles and San Francisco in 2027, scaling to 28 global cities by 2028 using NVIDIA's DRIVE Hyperion platform.

9 hours ago2 min read
Amazon Brings Health AI to 200 Million Prime Members With Free Virtual Care
Industry

Amazon Brings Health AI to 200 Million Prime Members With Free Virtual Care

Amazon expanded its Health AI agent to its main website and app, offering 200 million Prime members free 24/7 virtual care consultations powered by agentic AI and One Medical providers.

1 day ago3 min read