What is Baseten's valuation?

Baseten's January 2026 Series E raised $300 million at a $5 billion post-money valuation — more than double the $2.15 billion mark it secured in its September 2025 Series D. Total funding to date is near $585 million.

Who are Baseten's existing investors?

Its January Series E was co-led by IVP and CapitalG (Alphabet's growth fund), with Nvidia joining alongside Altimeter, Battery Ventures, BOND, Conviction and Greylock. The earlier Series D was led by BOND.

Baseten Hits $5B Valuation as Inference Becomes AI's Hottest Layer

Q: What does Baseten actually sell to builders?

Serverless inference: it turns trained models into production-grade APIs, handling autoscaling, GPU scheduling and latency optimization. Customers include Cursor, Notion, Abridge and Clay.

Baseten's valuation has more than doubled in four months. The AI inference company closed a $300 million Series E in January 2026 at a $5 billion post-money valuation — up from the $2.15 billion it commanded in September 2025 — as demand for production AI inference accelerates. With total funding now near $585 million, Baseten has become one of the most richly valued companies in the layer that turns trained models into live products.

A valuation that keeps climbing

The trajectory is steep even by 2026 standards. Baseten raised a $150 million Series D at a $2.15 billion valuation in September 2025, led by BOND. Barely four months later, in January 2026, it closed a $300 million Series E at $5 billion, co-led by IVP and CapitalG — Alphabet's growth arm — with Nvidia joining the round alongside Altimeter, Battery Ventures, BOND, Conviction and Greylock. That more-than-doubling in a single quarter is what marks Baseten out, with cumulative funding already near $585 million.

Why inference is suddenly the prize

Baseten sells serverless inference: it converts trained models into production-grade APIs, handling autoscaling, GPU scheduling and latency optimization so application teams don't have to operate their own fleets. That pitch maps directly onto where 2026 spend is shifting — away from one-off training runs and toward the per-token cost of serving models at scale. The company has said inference volume on its platform grew roughly 100x during 2025, and its customer list — Cursor, Notion, Abridge and Clay among them — reads like a roster of the apps actually putting models in front of paying users.

That demand explains why Nvidia sits on the cap table and why Alphabet's growth fund keeps doubling down. Inference is the recurring, compounding workload; training is lumpy and concentrated among a handful of labs. Rivals are circling the same opening — Modal Labs and other serverless-GPU startups have raised at billion-dollar-plus valuations — but Baseten's enterprise logos and growth curve have made it the bid investors keep chasing.

What it changes for builders

For teams deciding where to serve models, a balance sheet this size signals Baseten can keep subsidizing GPU capacity and undercutting hyperscaler inference pricing while locking in multi-year supply. The flip side is concentration: as inference platforms consolidate funding and Nvidia allocation, the independent middle layer between model labs and application developers is narrowing to a few extremely well-capitalized players. Expect aggressive pricing in the near term — and stickier switching costs once those dependencies are wired into production.

Baseten Hits $5B Valuation as Inference Becomes AI's Hottest Layer

A valuation that keeps climbing

Why inference is suddenly the prize

What it changes for builders

More in Industry

Suno Raises $250M+ at a $5B Valuation as Bond Capital Bets on Generative Audio

Micron Crosses $1 Trillion Market Cap as HBM Sells Out and UBS Triples Its Target to $1,625

Cornerstone Robotics Clears EU CE Mark for Sentire, Sharpening China's Challenge to Intuitive Surgical's da Vinci