Back to stories
Industry

Groq Raises Up to $650M for Inference Neocloud Pivot After Nvidia's $20B Licensing Deal

Michael Ouroumis2 min read
Groq Raises Up to $650M for Inference Neocloud Pivot After Nvidia's $20B Licensing Deal

Groq is raising up to $650 million from its existing investors, with backers Disruptive and Infinitum backstopping the round if it isn't fully subscribed, Axios reported on May 28. Current shareholders can participate pro rata. No new valuation was disclosed — a notable omission given the company's prior September 2025 round, which closed at a $6.9 billion post-money valuation on $750 million of fresh financing.

This is not a continuation of the old Groq. It is a structural reset, and the mechanics of how it's being financed are arguably the more interesting story for anyone watching AI private markets.

The Nvidia overhang

The raise follows the $20 billion licensing agreement Nvidia struck with Groq in late December 2025. Rather than an outright acquisition, Nvidia paid for a non-exclusive, perpetual license to Groq's Language Processing Unit (LPU) hardware and software stack, then hired roughly 80% of the workforce — including founder and CEO Jonathan Ross, the former Google engineer behind the original TPU, and president Sunny Madra. The reverse-acqui-hire structure left assets and entities behind, a pattern increasingly used to sidestep antitrust review.

The practical consequence: most of the people who designed the LPU now work at Nvidia, while the non-exclusive license means Groq retains the legal right to keep operating its own silicon. What remains is being rebuilt as a neocloud — selling inference capacity rather than chips — under CEO Adam Winter and CFO Matt Eng, both company veterans.

A second bite for the same investors

The financing sequence is the part builders and LPs should note. Final cash distributions from the Nvidia transaction are hitting shareholders soon. Those same investors are then being invited to recycle capital into the new neocloud entity — what Axios frames as a "second bite at the apple" and a potential template for AI private-market deals: monetize a licensing windfall, then re-fund the residual operating business with the proceeds.

That structure lets early backers realize returns from the Nvidia payout while keeping exposure to the inference cloud thesis, without writing net-new checks.

What changes for the inference market

Groq spent years positioning its LPU as a low-latency alternative to GPUs for token generation. With Nvidia now holding that IP and Groq reconstituting as a cloud operator, the competitive question shifts: a Groq neocloud competes on cost-per-token, utilization, and capacity availability against CoreWeave, Cerebras, and the hyperscalers — not on chip architecture differentiation.

For enterprise buyers evaluating dedicated inference providers, the signal is that the LPU bet now lives inside Nvidia's roadmap, while standalone Groq is a smaller, capital-light cloud play whose durability depends on how much compute $650 million actually buys in a market where power and chip supply remain the binding constraints. The absence of a headline valuation suggests the new entity is being priced cautiously — a sober marker in an otherwise frothy funding cycle.

Learn AI for Free — FreeAcademy.ai

Take "AI for Business: Practical Implementation" — a free course with certificate to master the skills behind this story.

More in Industry

JPMorgan's Best-Case US Debt Path Now Hinges on AI Productivity
Industry

JPMorgan's Best-Case US Debt Path Now Hinges on AI Productivity

JPMorgan strategist David Kelly mapped five US debt scenarios out to 2036. The only one that caps debt at 115% of GDP requires an AI-driven productivity surge that beats current expectations.

3 hours ago2 min read
Mistral Weighs Designing Its Own AI Chips to Cut Inference Costs
Industry

Mistral Weighs Designing Its Own AI Chips to Cut Inference Costs

Mistral CEO Arthur Mensch says the French lab is exploring designing its own AI chips to lower per-token costs, even as it builds a 13,800-GPU Nvidia data center near Paris.

4 hours ago2 min read
Tensormesh Lands $20M from Nvidia, AMD and CoreWeave to Turn KV Caching Into an Inference Layer
Industry

Tensormesh Lands $20M from Nvidia, AMD and CoreWeave to Turn KV Caching Into an Inference Layer

Tensormesh raised a $20M seed extension from Nvidia's NVentures, AMD Ventures and CoreWeave and shipped its KV-cache inference platform to general availability, claiming up to 10x cuts in latency and GPU spend.

6 hours ago2 min read