Baseten's valuation has more than doubled in four months. The AI inference company closed a $300 million Series E in January 2026 at a $5 billion post-money valuation — up from the $2.15 billion it commanded in September 2025 — as demand for production AI inference accelerates. With total funding now near $585 million, Baseten has become one of the most richly valued companies in the layer that turns trained models into live products.
A valuation that keeps climbing
The trajectory is steep even by 2026 standards. Baseten raised a $150 million Series D at a $2.15 billion valuation in September 2025, led by BOND. Barely four months later, in January 2026, it closed a $300 million Series E at $5 billion, co-led by IVP and CapitalG — Alphabet's growth arm — with Nvidia joining the round alongside Altimeter, Battery Ventures, BOND, Conviction and Greylock. That more-than-doubling in a single quarter is what marks Baseten out, with cumulative funding already near $585 million.
Why inference is suddenly the prize
Baseten sells serverless inference: it converts trained models into production-grade APIs, handling autoscaling, GPU scheduling and latency optimization so application teams don't have to operate their own fleets. That pitch maps directly onto where 2026 spend is shifting — away from one-off training runs and toward the per-token cost of serving models at scale. The company has said inference volume on its platform grew roughly 100x during 2025, and its customer list — Cursor, Notion, Abridge and Clay among them — reads like a roster of the apps actually putting models in front of paying users.
That demand explains why Nvidia sits on the cap table and why Alphabet's growth fund keeps doubling down. Inference is the recurring, compounding workload; training is lumpy and concentrated among a handful of labs. Rivals are circling the same opening — Modal Labs and other serverless-GPU startups have raised at billion-dollar-plus valuations — but Baseten's enterprise logos and growth curve have made it the bid investors keep chasing.
What it changes for builders
For teams deciding where to serve models, a balance sheet this size signals Baseten can keep subsidizing GPU capacity and undercutting hyperscaler inference pricing while locking in multi-year supply. The flip side is concentration: as inference platforms consolidate funding and Nvidia allocation, the independent middle layer between model labs and application developers is narrowing to a few extremely well-capitalized players. Expect aggressive pricing in the near term — and stickier switching costs once those dependencies are wired into production.



