SambaNova Systems has launched SambaNova Cloud, the world’s fastest AI inference platform powered by its SN40L chip, offering unmatched performance for developers. The platform supports Meta’s popular open-source Llama 3.1 models, including the large-scale 405B and the 70B, allowing developers to create generative AI applications at record speeds. With no waiting list, SambaNova Cloud enables immediate access to build with Llama 3.1 405B at 132 tokens per second and Llama 3.1 70B at 461 tokens per second, both at full 16-bit precision.
SambaNova’s platform is positioned to meet growing demand from businesses needing high-speed, high-fidelity models, particularly for AI applications requiring real-time responses or multi-agent systems. This breakthrough in performance is enabled by the SN40L chip’s patented dataflow architecture and advanced memory system, which outperforms traditional Nvidia GPUs. The launch of SambaNova Cloud is part of the company’s broader goal to democratize access to state-of-the-art AI models, providing scalable solutions across free, developer, and enterprise tiers.
• Platform: SambaNova Cloud, available in Free, Developer, and Enterprise tiers
• AI Models: Supports Llama 3.1 models, including 405B at 132 tokens per second and 70B at 461 tokens per second
• Technology: Powered by SambaNova’s SN40L AI chip, designed for fast, efficient model inference
• Target Audience: Developers and enterprises needing scalable, real-time AI capabilities
• Performance: Full precision, industry-leading speeds for generative AI applications
“SambaNova Cloud is the fastest API service for developers. We deliver world record speed and in full 16-bit precision – all enabled by the world’s fastest AI chip,” said Rodrigo Liang, CEO of SambaNova Systems.
- SambaNova Systems, headquartered in Palo Alto, California, was founded in 2017 by industry veterans from Sun/Oracle and Stanford University. The company focuses on delivering enterprise-scale AI platforms for next-generation AI computing. SambaNova has attracted significant investment from leading firms, including SoftBank Vision Fund 2, BlackRock, Intel Capital, GV (formerly Google Ventures), Temasek, GIC, and others, positioning it as a key player in the AI infrastructure space.