Back to stories
Industry

NVIDIA Unveils Rubin: Six New Chips Powering the Next AI Supercomputer

Michael Ouroumis2 min read
NVIDIA Unveils Rubin: Six New Chips Powering the Next AI Supercomputer

NVIDIA has officially launched the Rubin platform — its most ambitious chip architecture to date and the successor to the record-breaking Blackwell generation. The platform comprises six co-designed chips built from the ground up to work as a single AI supercomputer.

Six Chips, One System

Unlike previous generations where the GPU did most of the heavy lifting, Rubin is a tightly integrated six-chip platform:

The key insight is extreme codesign: all six chips were developed together to eliminate bottlenecks between compute, memory, and networking. The result is 3.6 terabytes per second of bandwidth per GPU and 260 TB/s of total connectivity.

The Numbers That Matter

Compared to Blackwell, Rubin delivers:

For enterprises running large language models in production, that 10x inference cost reduction is the headline number. It means the same workload that costs $100,000/month on Blackwell could run for $10,000 on Rubin — or the same budget buys 10x more throughput.

Who Gets It First

Rubin is already in full production, with partner products shipping in the second half of 2026. AWS, Google Cloud, Microsoft Azure, and Oracle Cloud will be among the first to offer Rubin-based instances. Cloud partners CoreWeave, Lambda, Nebius, and Nscale are also in the first wave.

Built for Agentic AI

NVIDIA is positioning Rubin specifically for the agentic AI workloads that are defining 2026: autonomous reasoning systems that chain multiple model calls together, maintain long contexts, and interact with external tools. These workloads are inference-heavy and latency-sensitive — exactly where Rubin's architecture is optimized.

GTC Preview

With NVIDIA's GTC conference set for March 16, CEO Jensen Huang has hinted at "several new chips the world has never seen before." Whether that means Rubin Ultra variants or something entirely new, the AI compute race shows no signs of slowing down.

Rubin doesn't just raise the performance ceiling — it fundamentally changes the economics of running AI at scale.

Learn AI for Free — FreeAcademy.ai

Take "AI for Business: Practical Implementation" — a free course with certificate to master the skills behind this story.

More in Industry

Cadence and NVIDIA Expand Partnership to Close the Sim-to-Real Gap for Robotics and Chip Design
Industry

Cadence and NVIDIA Expand Partnership to Close the Sim-to-Real Gap for Robotics and Chip Design

At CadenceLIVE 2026, Cadence and NVIDIA announced an expanded partnership combining agentic AI, physics simulation, and digital twins — targeting robotics sim-to-real, AI factory efficiency, and 10x productivity in chip design.

1 hours ago2 min read
SoundHound AI to Acquire LivePerson in $43M All-Stock Deal, Forging Omnichannel Conversational AI Leader
Industry

SoundHound AI to Acquire LivePerson in $43M All-Stock Deal, Forging Omnichannel Conversational AI Leader

SoundHound AI will acquire LivePerson for $43 million in an all-stock deal valuing the combined business at a $250 million enterprise value, uniting voice agentic AI with digital messaging that powers one billion customer messages per month.

3 hours ago2 min read
Google Taps Marvell for Two Custom AI Inference Chips, Shaking Broadcom's TPU Grip
Industry

Google Taps Marvell for Two Custom AI Inference Chips, Shaking Broadcom's TPU Grip

Google is in talks with Marvell to co-design a memory processing unit and an inference-optimized TPU, adding a third design partner to its custom silicon supply chain and sending Marvell shares to a record high while Broadcom slid.

9 hours ago2 min read