• Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Friday, June 5, 2026
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » AWS Launches Trainium3 UltraServers to Scale Frontier AI 

AWS Launches Trainium3 UltraServers to Scale Frontier AI 

December 2, 2025
in Semiconductors
A A

Amazon Web Services rolled out its new EC2 Trn3 UltraServers, a Trainium3-powered system built for large-scale AI training and high-throughput inference. The new 3nm Trainium3 chip delivers up to 4.4x more compute performance, 4x higher energy efficiency, and nearly 4x more memory bandwidth compared to Trainium2, allowing customers to train larger models, reduce inference latency, and cut operational costs. Each UltraServer integrates up to 144 Trainium3 chips—reaching 362 FP8 PFLOPs per system—and uses AWS-engineered networking to eliminate scale-out bottlenecks.

Trn3 UltraServers introduce enhanced NeuronSwitch-v1 bandwidth and sub-10-microsecond Neuron Fabric networking to support next-generation AI workloads including agentic systems, mixture-of-experts, and reinforcement learning. EC2 UltraClusters 3.0 can now interconnect thousands of UltraServers—scaling to deployments of one million Trainium chips—to support trillion-token training datasets and massive real-time inference fleets. AWS reports that customers such as Anthropic, Karakuri, Metagenomi, NetoAI, Ricoh, and Splash Music are reducing training and inference costs by up to 50% using Trainium, while Decart is achieving 4x faster real-time generative video at half the cost of GPUs.

Amazon Bedrock is already serving production workloads on Trainium3, and AWS confirmed that Trainium4 is in development. The next-generation chip will target at least 6x processing performance (FP4), 3x FP8 gains, 4x memory bandwidth, and support for NVIDIA NVLink Fusion—enabling Trainium4, Graviton, and EFA to share common MGX rack architectures for flexible, mixed GPU–ASIC clusters.

• Trainium3 delivers 4.4x more compute performance, 4x greater energy efficiency, and nearly 4x more memory bandwidth than Trainium2.

• Each Trn3 UltraServer integrates up to 144 Trainium3 chips for 362 FP8 PFLOPs.

• NeuronSwitch-v1 provides 2x the internal bandwidth, with sub-10-microsecond Neuron Fabric chip-to-chip latency.

• EC2 UltraClusters 3.0 scale to 1 million Trainium chips—10x the previous generation.

• Customers report up to 50% lower costs; Decart reports 4x faster real-time generative video at half the cost of GPUs.

• Trainium4 will add NVLink Fusion support for mixed Trainium/GPU workloads inside MGX racks.

🌐 Analysis: Hyperscalers are increasingly relying on custom silicon to control performance, cost, and power efficiency at AI-factory scale. AWS continues to invest in Trainium and Graviton as strategic alternatives to GPU-only architectures, following similar moves from Google (TPU v5p), Microsoft (Maia), and Meta (MTIA). Trainium3 and the upcoming Trainium4 show AWS tightening its integration with high-speed fabrics—now including NVLink Fusion—to support mixed GPU–ASIC clusters and reduce dependency on external supply chains for frontier-model infrastructure.

Tags: AWS
ShareTweetShareSummarizeSummarize
Previous Post

Marvell Posts Record Q3 Revenue as Data-Center Demand Surges 37%

Next Post

AMD and HPE Team on Open Rack-Scale “Helios” Architecture

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

Clouds and Carriers

Vodafone and AWS Expand Sovereign Cloud Services for Germany

May 11, 2026
Financials

Amazon Q1 2026: AWS Surges 28% as Custom AI Chips Top $20B Run Rate

April 29, 2026
Semiconductors

Meta Deploys Tens of Millions of AWS Graviton5 Cores

April 26, 2026
AI Infrastructure

Oracle and AWS Link Clouds with Private Interconnect for AI Workloads

April 16, 2026
AI Infrastructure

Amazon Ties $200 Billion 2026 Capex Plan to AI, AWS, and Custom Silicon

April 9, 2026
AI Infrastructure

Amazon Commits €33.7B to Expand Data Centers in Spain

March 3, 2026
Next Post

AMD and HPE Team on Open Rack-Scale “Helios” Architecture

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version