• Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Saturday, June 20, 2026
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » Astera Labs Ships 320-Lane Scorpio Fabric Switch for AI Scale-Up Clusters

Astera Labs Ships 320-Lane Scorpio Fabric Switch for AI Scale-Up Clusters

May 5, 2026
in Data Centers, Semiconductors
A A

Astera Labs introduced its Scorpio X-Series 320-lane Smart Fabric Switch, positioning it as a high-radix, memory-semantic interconnect designed to support large-scale AI clusters with reduced latency and improved efficiency. The device is now shipping to hyperscalers and targets production AI deployments where multi-trillion parameter models and distributed agentic workloads are stressing traditional interconnect architectures.  

The Scorpio X-Series integrates memory-semantic connectivity, allowing accelerators to access shared fabric resources using native load/store operations instead of software-managed messaging. This approach reduces protocol overhead and improves fabric efficiency at scale. The platform also incorporates hardware-accelerated Hypercast and in-network compute engines, which can double the performance of collective operations such as all-reduce and all-to-all, improving time-to-first-token and tokens-per-watt metrics.  (see below)

Astera Labs expanded its broader Scorpio portfolio with the PCIe 6-based P-Series, spanning 32 to 320 lanes to support diverse accelerator configurations and system topologies. The COSMOS software stack unifies management across the platform, offering telemetry, diagnostics, and non-disruptive updates to maintain uptime in large AI clusters. The company said the scale-up switching silicon market could reach $20 billion by 2030, with Scorpio production ramping in the second half of 2026.  

  • Scorpio X-Series delivers 320 lanes, enabling high-radix, single-hop scale-up topologies
  • Supports up to ~80 GPUs per switch with reduced hops versus legacy multi-switch designs (see page 9 diagram)  
  • Bandwidth per switch scales to ~20 Tbps vs ~9 Tbps for prior-generation designs  
  • Hypercast and in-network compute engines accelerate collective operations by up to 2x
  • Memory-semantic fabric enables native load/store access across accelerators
  • Scorpio P-Series expands PCIe fabric options from 32 to 320 lanes
  • COSMOS software provides unified management, telemetry, and resiliency features
  • Targets hyperscalers, AI labs, and neo-clouds building heterogeneous accelerator clusters

“The frontier models driving today’s most demanding AI applications require connectivity infrastructure that keeps pace with the accelerators powering them,” said Jitendra Mohan, CEO of Astera Labs.

🌐 Analysis: Hypercast is Astera Labs’ purpose-built multicast mechanism designed specifically for AI workloads, addressing a fundamental bottleneck in modern GPU clusters: the explosive growth of communication overhead driven by mixture-of-experts (MoE) models and large-scale collective operations. In MoE architectures, each token is dynamically routed to a subset of experts distributed across multiple GPUs, turning every routing decision into a multicast event. Traditional switching architectures struggle here because they either lack sufficient multicast group capacity or require slow, unpredictable configuration times—often ranging from hundreds of microseconds to milliseconds—introducing latency variability that directly impacts model performance and user experience.  

🌐 Analysis: Hypercast addresses this by creating lightweight, pre-configurable multicast groups that can be instantiated quickly and at scale, enabling deterministic, low-latency data distribution across GPUs. This is critical not only for MoE routing but also for dense collective operations such as AllGather and all-to-all, which occur frequently during both training and inference. By accelerating these operations in hardware and eliminating redundant data replication or slow control-plane setup, Hypercast improves GPU utilization, reduces idle time, and increases tokens-per-watt efficiency. The net effect is a more predictable and efficient fabric, where communication no longer constrains model architecture or forces compromises in expert placement and routing decisions.  

🌐 Analysis: Strategically, Hypercast is central to Astera Labs’ positioning in the AI interconnect market. It moves the company beyond traditional PCIe switching into the domain of intelligent, AI-aware fabrics with in-network compute capabilities. 

Tags: Astera LabsPCIe
ShareTweetShareSummarizeSummarize
Previous Post

Astera Labs Q1 Revenue Jumps 93%, AI Fabric Growth

Next Post

Lumen to Acquire Alkira for $475M to Build Cloud-Native Control Plane

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

All

PCIe 8.0 Advances Toward 256 GT/s and 1 TB/s by 2028

May 9, 2026
Semiconductors

Credo Secures PCI-SIG Compliance for 7nm PCIe 6.0-Capable Retimer

February 10, 2026
Optical

Astera Labs Acquires aiXscale Photonics to Advance Optical Chiplet Integration 

October 22, 2025
Optical

Marvell Demos First End-to-End PCIe Gen 6 Over Optics

March 27, 2025
Semiconductors

Astera Labs Unveils PCIe 6 Scorpio Switches for Cloud-Scale AI Platforms

October 8, 2024
Semiconductors

Astera demos PCIe optical connectivity for expansive GPU clusters

June 19, 2024
Next Post

Lumen to Acquire Alkira for $475M to Build Cloud-Native Control Plane

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version