Groq is raising up to $650 million from its existing investors, with backers Disruptive and Infinitum backstopping the round if it isn't fully subscribed, Axios reported on May 28. Current shareholders can participate pro rata. No new valuation was disclosed — a notable omission given the company's prior September 2025 round, which closed at a $6.9 billion post-money valuation on $750 million of fresh financing.
This is not a continuation of the old Groq. It is a structural reset, and the mechanics of how it's being financed are arguably the more interesting story for anyone watching AI private markets.
The Nvidia overhang
The raise follows the $20 billion licensing agreement Nvidia struck with Groq in late December 2025. Rather than an outright acquisition, Nvidia paid for a non-exclusive, perpetual license to Groq's Language Processing Unit (LPU) hardware and software stack, then hired roughly 80% of the workforce — including founder and CEO Jonathan Ross, the former Google engineer behind the original TPU, and president Sunny Madra. The reverse-acqui-hire structure left assets and entities behind, a pattern increasingly used to sidestep antitrust review.
The practical consequence: most of the people who designed the LPU now work at Nvidia, while the non-exclusive license means Groq retains the legal right to keep operating its own silicon. What remains is being rebuilt as a neocloud — selling inference capacity rather than chips — under CEO Adam Winter and CFO Matt Eng, both company veterans.
A second bite for the same investors
The financing sequence is the part builders and LPs should note. Final cash distributions from the Nvidia transaction are hitting shareholders soon. Those same investors are then being invited to recycle capital into the new neocloud entity — what Axios frames as a "second bite at the apple" and a potential template for AI private-market deals: monetize a licensing windfall, then re-fund the residual operating business with the proceeds.
That structure lets early backers realize returns from the Nvidia payout while keeping exposure to the inference cloud thesis, without writing net-new checks.
What changes for the inference market
Groq spent years positioning its LPU as a low-latency alternative to GPUs for token generation. With Nvidia now holding that IP and Groq reconstituting as a cloud operator, the competitive question shifts: a Groq neocloud competes on cost-per-token, utilization, and capacity availability against CoreWeave, Cerebras, and the hyperscalers — not on chip architecture differentiation.
For enterprise buyers evaluating dedicated inference providers, the signal is that the LPU bet now lives inside Nvidia's roadmap, while standalone Groq is a smaller, capital-light cloud play whose durability depends on how much compute $650 million actually buys in a market where power and chip supply remain the binding constraints. The absence of a headline valuation suggests the new entity is being priced cautiously — a sober marker in an otherwise frothy funding cycle.



