Nebius Plans Early Deployment of NVIDIA Vera Rubin NVL72

Jim Carroll

5 months ago

Nebius plans to offer the NVIDIA Vera Rubin NVL72 accelerated computing platform in the United States and Europe beginning in the second half of 2026. The company says it will be among the first NVIDIA Cloud Partners to bring the platform to market, integrating Rubin NVL72 across its full-stack infrastructure in regional data centers to support advanced AI workloads with local availability and control.

Unveiled at CES 2026, the NVIDIA Vera Rubin NVL72 targets agentic AI, advanced reasoning, and large mixture-of-experts models that require long token sequences and multi-step problem solving. Nebius plans to make Rubin NVL72 available through Nebius AI Cloud for direct infrastructure access and through Nebius Token Factory, its enterprise inference and post-training platform. The addition expands Nebius’ portfolio alongside existing NVIDIA GB200 NVL72 and NVIDIA Grace Blackwell Ultra NV72 capacity.

As an NVIDIA Exemplar Cloud Partner, Nebius says its infrastructure aligns with NVIDIA reference architectures and benchmarks. The company positions Rubin NVL72 as a complementary option for customers selecting platforms optimized for different AI workload profiles, while maintaining benchmark-validated bare-metal performance across its cloud and inference services.

Deployment planned for H2 2026 across US and European data centers
One of the first NVIDIA Cloud Partners to offer Vera Rubin NVL72
Targets agentic AI, advanced reasoning, and large MoE models
Available via Nebius AI Cloud and Nebius Token Factory
Complements existing NVIDIA GB200 NVL72 and Grace Blackwell Ultra NV72 capacity

“We are proud to be one of the first on the market to offer Vera Rubin GPUs as we fuel the next wave of AI innovation,” said Arkady Volozh, founder and CEO of Nebius. “By integrating Vera Rubin into Nebius AI Cloud and our inference platform Nebius Token Factory, we’re giving AI innovators and enterprises the infrastructure they need to develop agentic and reasoning AI systems faster and more efficiently.”

🌐 Analysis

Nebius has been steadily building a differentiated AI infrastructure strategy that combines early access to NVIDIA platforms with geographically distributed, purpose-built capacity in the US and Europe. Today, Nebius operates large-scale AI clusters based on NVIDIA Grace Blackwell systems, including GB200 NVL72, delivered as bare-metal infrastructure through Nebius AI Cloud. These deployments focus on training and large-scale inference for foundation models, with an emphasis on predictable performance validated against NVIDIA reference architectures.