Site icon Converge Digest

Baseten Raises $1.5 Billion to Expand AI Inference Infrastructure

Baseten raised $1.5 billion in Series F financing as demand accelerates for AI inference infrastructure that supports custom and post-trained models in production environments. The funding round was led by Altimeter Capital, Conviction, and Spark Capital, with participation from Sands Capital, Wellington Management, IVP, Greylock, Battery Ventures, D. E. Shaw Ventures, Blackbird, Durable Capital Partners, Verified Capital, 01A, and existing investors. The financing was completed in two tranches at valuations of $11 billion and $13 billion.

The company said enterprises increasingly deploy specialized models trained on proprietary data rather than relying exclusively on frontier foundation models. Baseten reported approximately 20x year-over-year revenue growth and said its platform now handles more than one billion inference calls per day. The company operates 87 clusters across 18 cloud environments, providing a multi-cloud architecture designed to support large-scale AI application deployments.

Founded in 2019 and headquartered in San Francisco, Baseten provides infrastructure software that manages AI workloads, including GPU orchestration, autoscaling, observability, billing, and developer tooling. The company said it will use the new capital to expand engineering, research, operations, and go-to-market teams while increasing investments in compute capacity. Baseten plans to triple headcount during 2026 as it scales to meet growing enterprise demand.

• Raised $1.5 billion in Series F financing

• Funding completed at valuations of $11 billion and $13 billion

• Reports approximately 20x year-over-year revenue growth

• Processes more than 1 billion inference calls daily

• Operates 87 AI clusters globally

• Supports deployments across 18 cloud environments

• Plans to triple employee headcount during 2026

• Has raised more than $2 billion since founding

• Customers include Abridge, Clay, Cursor, Lovable, Mercor, and OpenEvidence

“The future of AI will be built on millions of specialized models, and the companies building the best ones know that post-training has become existential. It’s how they build intelligence they own, on data that’s theirs, optimized for the customers they serve,” said Tuhin Srivastava, CEO and Co-Founder of Baseten.

🌐 Analysis

Baseten’s financing highlights the rapid emergence of inference as one of the fastest-growing segments of the AI infrastructure market. While much of the industry’s attention has focused on training ever-larger foundation models, enterprises are increasingly investing in the infrastructure required to serve AI applications in production. That shift has fueled growth for a new generation of inference-focused providers that optimize deployment, scaling, observability, and cost efficiency across distributed GPU environments.

The announcement also reflects the growing importance of post-training and model customization. Enterprises increasingly seek AI systems trained on proprietary datasets and optimized for specific workflows, creating demand for infrastructure platforms capable of managing large fleets of specialized models. As open-source models continue to improve, inference platforms may become a key layer in the AI stack alongside GPUs, networking, storage, and cloud infrastructure.

Exit mobile version