• Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Sunday, May 31, 2026
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » Baseten Expands AI Inference Ambitions with $150M Funding Round

Baseten Expands AI Inference Ambitions with $150M Funding Round

September 5, 2025
in Start-ups
A A

Baseten surged to a $2.15 billion valuation with a $150 million Series D, nearly tripling its worth just six months after raising $75 million in Series C funding. The round, led by BOND with participation from CapitalG, Premji, and existing investors such as Greylock, Spark, IVP, Conviction, and 01a, brings the company’s total funding to over $285 million. The raise underscores the critical role inference now plays in bringing AI models into production at scale, positioning Baseten as one of the leading infrastructure providers for generative AI applications.

Founded in San Francisco by Tuhin Srivastava, Amir Haghighat, and Philip Howes, Baseten has focused from the start on building infrastructure specifically optimized for inference. The company has reached key milestones quickly: powering billions of weekly model calls for healthcare, enterprise, and consumer AI products; introducing Baseten Model APIs and Baseten Training to accelerate deployment and customization of models; and securing major customers including Abridge, Clay, Captions, OpenEvidence, and Writer. Its technology is already embedded in mission-critical use cases such as generating millions of clinical notes per week for healthcare providers.

With Series D funding, Baseten plans to expand applied research, infrastructure, and developer tooling, while strengthening its customer teams. CEO Tuhin Srivastava framed the company’s ambition: “If cloud was the foundation that enabled the latest generation of great technology companies, inference is the foundation for the next.”

• $150 million Series D funding round led by BOND; valuation rises to $2.15 billion

• Founded by Tuhin Srivastava, Amir Haghighat, and Philip Howes in San Francisco

• Powers billions of model inference calls weekly across healthcare and enterprise AI

• Customers include Abridge, Clay, Captions, OpenEvidence, and Writer

• Milestones include launch of Baseten Model APIs and Baseten Training

• Total capital raised now exceeds $285 million

🌐 Analysis: Baseten’s trajectory reflects how purpose-built inference platforms are moving from niche infrastructure to essential components of AI adoption. The founders’ backgrounds span applied machine learning, infrastructure engineering, and developer platform design—giving the company a multi-disciplinary edge. Milestones such as supporting large healthcare deployments demonstrate enterprise-grade reliability, while new developer APIs and training services broaden the stack. While Baseten has not disclosed hardware partners, its positioning suggests close alignment with GPU-accelerated environments common in hyperscale AI deployments. Against competitors like Together AI and Anyscale, Baseten’s focus on inference as the “app layer” of AI sets a clear differentiation as it builds out its ecosystem.

ShareTweetShareSummarizeSummarize
Previous Post

Broadcom: AI, Networking Drive Record Results

Next Post

1Finity Unveils Innovation Hub in Sunnyvale

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

All

Trans Pacific Networks Taps Indigo for Subsea Operations

May 26, 2026
Last Mile / Middle Mile

Broadcom Adds 50G PON Edge AI Gateway SoC

May 26, 2026
Subsea

Biznet Upgrades BNCS-1 Subsea Net with Ciena across Indonesia

May 26, 2026
All

AMD Commits $10 Billion to Taiwan Ecosystem

May 26, 2026
Quantum

QuiX Quantum Unveils Photonic Assembly Control Unit

May 26, 2026
Data Centers

NextSilicon’s Spectra Supercomputer at Sandia

May 26, 2026
Next Post

1Finity Unveils Innovation Hub in Sunnyvale

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version