• Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Wednesday, June 10, 2026
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » AWS Launches Graviton5 CPU with 192 Cores for Agentic AI

AWS Launches Graviton5 CPU with 192 Cores for Agentic AI

June 10, 2026
in All
A A

Amazon Web Services (AWS) has made its new Graviton5 processor generally available, positioning the Arm-based CPU as a key platform for the growing class of agentic AI applications that require real-time reasoning, code generation, and orchestration of complex tasks. Available through Amazon EC2 M9g and M9gd instances, Graviton5 delivers up to 25% higher compute performance than the previous generation while offering the highest CPU core density in Amazon EC2 with 192 cores per package.

AWS said Graviton5 incorporates a 5x larger L3 cache, DDR5-8800 memory, PCIe Gen 6 support, and up to 33% lower inter-core latency compared to Graviton4. The company also reports up to 35% faster web application performance, 35% faster machine learning inference, and 30% faster database performance. M9gd instances add up to 11.4 TB of local NVMe SSD storage and 30% higher IOPS for storage-intensive workloads. The processor is manufactured on a 3nm process node and is integrated with AWS’s sixth-generation Nitro System, including the new Nitro Isolation Engine, which AWS describes as a formally verified security component designed to provide mathematically proven isolation between virtual machines.

AWS executives highlighted growing adoption of Graviton as AI workloads drive increased demand for CPU infrastructure alongside accelerators. CEO Andy Jassy noted that approximately 98% of AWS’s top 1,000 EC2 customers use Graviton, with more than 120,000 customers now running workloads on the platform. He also disclosed that Meta plans to deploy tens of millions of Graviton cores to support agentic AI initiatives, while Uber and Snowflake are expanding their use of the processor family.

• Graviton5 delivers up to 25% higher compute performance than Graviton4

• 192 Arm-based CPU cores per package

• Up to 33% lower inter-core latency

• 5x larger L3 cache than the previous generation

• DDR5-8800 memory support

• PCIe Gen 6 connectivity

• Up to 15% higher network bandwidth and 20% higher EBS bandwidth

• Built on a 3nm manufacturing process

• Available in EC2 M9g and M9gd instance families

• Meta plans deployment of tens of millions of Graviton cores for agentic AI

“About 11 years ago, with our very talented Annapurna team and informed by the unusual scale and insight we had in operating the largest cloud infrastructure, we decided to design and build our own CPU chip,” said Andy Jassy, President and CEO of Amazon. “The reason customers are so excited about Graviton is that it offers about 30-40% better price-performance than comparable instances. When you layer on top of how much CPU customers normally use with the fact that AI’s growth is driving explosive CPU expansion given that post-training, reinforcement learning, and agentic actions use CPU, Graviton becomes even more compelling.”

🌐 Analysis: Graviton5 reflects AWS’s long-term strategy of vertically integrating its infrastructure stack, following a path similar to its custom Trainium AI accelerators, Nitro DPUs, and Annapurna networking silicon. While NVIDIA remains dominant for AI training and inference acceleration, AWS is emphasizing the growing importance of CPUs for orchestration, reinforcement learning environments, retrieval systems, agent execution, and AI infrastructure management. The company’s focus on agentic AI represents a notable shift from traditional cloud messaging centered on web applications and databases.

🌐 The announcement also underscores intensifying competition among hyperscalers developing proprietary silicon. AWS now fields Graviton CPUs, Trainium AI accelerators, Inferentia inference processors, Nitro infrastructure offload chips, and custom networking silicon. Meanwhile, competitors continue expanding their own silicon portfolios, including Google’s Tensor Processing Units (TPUs) and Microsoft’s Maia AI accelerator family. Meta’s commitment to deploy tens of millions of Graviton cores highlights the growing role of Arm-based server CPUs in large-scale AI infrastructure.

Profile: Amazon Web Services (AWS)
HeadquartersSeattle, Washington, USA
Parent CompanyAmazon
CEOMatt Garman
Parent CEOAndy Jassy
Founded2006
Core TechnologiesCloud infrastructure, AI platforms, custom silicon, networking, storage
Custom Silicon PortfolioGraviton, Trainium, Inferentia, Nitro
Latest CPUGraviton5 (192 cores, 3nm process)
AI StrategyFoundation models, agentic AI infrastructure, custom accelerators, Bedrock platform
Scale120,000+ Graviton customers; 98% of top 1,000 EC2 customers use Graviton
ShareTweetShareSummarizeSummarize
Previous Post

Dell’Oro: AI Infrastructure Spending Pushes 2026 Data Center Capex Above $1 Trillion

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

Research

Dell’Oro: AI Infrastructure Spending Pushes 2026 Data Center Capex Above $1 Trillion

June 10, 2026
Semiconductors

TDK Acquires Fabric8Labs to Scale Advanced Cooling for AI Data Centers

June 10, 2026
Quantum

Xanadu Sets New Benchmark for Ultra-Low-Loss Photonic Chip Packaging

June 10, 2026
Optical

Colt and Ciena Achieve 800GbE Quantum-Safe Across the Atlantic

June 10, 2026
Research

Dell’Oro: Campus Ethernet Switch Revenue Climbs in 1Q 2026

June 10, 2026
Semiconductors

Micron Selects Bechtel to Build First Phase of Massive New York Memory Manufacturing Campus

June 10, 2026

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version