• Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Wednesday, June 24, 2026
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » OpenAI and Broadcom Unveil “Jalapeño” Inference Accelerator

OpenAI and Broadcom Unveil “Jalapeño” Inference Accelerator

June 24, 2026
in Semiconductors
A A

OpenAI and  Broadcom⁠ unveiled “Jalapeño,” a custom AI inference processor designed specifically for large language model (LLM) workloads. The chip marks OpenAI’s first internally architected accelerator and represents a significant step in the company’s strategy to control more of the AI infrastructure stack, spanning models, software, systems, and now silicon. The companies said engineering samples are already running production-class workloads, including GPT-5.3-Codex-Spark, at target frequency and power levels.

OpenAI designed Jalapeño around the inference characteristics of current and future LLMs, focusing on minimizing data movement and balancing compute, memory, and networking resources to improve hardware utilization. Broadcom contributed silicon implementation, manufacturing, and networking technologies, including its Tomahawk Ethernet switching portfolio, while  Celestica⁠ provided board, rack, and system-level engineering. The companies said early testing indicates performance-per-watt improvements over current state-of-the-art AI accelerators, although detailed benchmark data has not yet been released.

The announcement signals OpenAI’s intention to become a full-stack infrastructure provider rather than relying exclusively on merchant GPUs. The companies said Jalapeño was developed from initial design through tape-out in approximately nine months and will serve as the first member of a multi-generation accelerator roadmap. Deployments are expected to begin by the end of 2026 in large-scale AI data centers, with OpenAI and Broadcom targeting gigawatt-scale infrastructure deployments alongside partners including Microsoft.

Profile: Jalapeño AI Accelerator
Updated: June 24, 2026
DeveloperOpenAI with Broadcom and Celestica
Chip TypeCustom AI inference accelerator (ASIC)
Primary WorkloadLarge Language Model inference
Architecture GoalOptimize compute, memory and networking utilization for frontier AI models
Current StatusEngineering samples operational at target frequency and power
Demonstrated WorkloadGPT-5.3-Codex-Spark
NetworkingBroadcom Tomahawk Ethernet switching technology
Development CycleApproximately 9 months from design to tape-out
Performance ClaimHigher performance-per-watt than current state-of-the-art accelerators (early testing)
Deployment TimelineInitial deployments planned by end of 2026
Scale TargetGigawatt-scale AI data centers over multiple generations

“The world is moving to a compute-powered economy,” said Greg Brockman, President and Co-Founder of OpenAI. “Jalapeño is part of our long-term full-stack infrastructure strategy to make compute more abundant, resulting in AI which is faster, more reliable, more affordable for people and businesses, and can be used to solve more important problems.”

🌐 Analysis

The Jalapeño announcement confirms growing industry speculation that leading AI model developers are moving beyond software and into custom silicon. OpenAI joins a growing list of hyperscalers and AI platform providers—including  Google Cloud⁠ (TPU),  Amazon Web Services⁠ (Trainium and Inferentia),  Microsoft Azure⁠ (Maia), and  Meta⁠ (MTIA)—that are developing workload-specific AI processors to reduce dependence on merchant accelerators and optimize infrastructure economics.

The partnership also highlights Broadcom’s growing influence in AI infrastructure. Beyond networking silicon, the company has become a major supplier of custom AI ASICs for hyperscalers and large cloud providers. If Jalapeño achieves its stated performance-per-watt objectives, it could strengthen Broadcom’s position as a preferred partner for organizations pursuing vertically integrated AI infrastructure strategies. The emphasis on inference rather than training reflects an industry-wide shift as AI deployments increasingly focus on serving production workloads efficiently at scale.

OpenAI AI Infrastructure Stack & Ecosystem
Updated: June 24, 2026
ApplicationsChatGPT, Codex, API Services, Enterprise AI, Agentic AI Platforms
Foundation ModelsGPT family, reasoning models, multimodal systems, coding agents
Serving SoftwareOpenAI-developed kernels, orchestration software, schedulers, serving infrastructure and inference optimization
Custom AI SiliconJalapeño Intelligence Processor — purpose-built ASIC optimized for large-scale LLM inference
ASIC Development PartnerBroadcom — silicon implementation, packaging, custom ASIC development and AI infrastructure roadmap
Networking FabricBroadcom Tomahawk Ethernet switching architecture for AI cluster connectivity
System IntegrationCelestica — board design, rack integration, manufacturing and deployment engineering
Cloud PartnerMicrosoft Azure — strategic cloud and data center deployment platform
Current Training InfrastructureNVIDIA GPUs remain the primary platform for frontier AI model training and much of current inference deployment
Alternative Accelerator EcosystemAMD accelerators deployed through selected hyperscale cloud environments
Potential Network OEM LayerArista, Cisco, Juniper and others may participate in broader AI infrastructure deployments but were not identified in this announcement
Potential Server OEM LayerDell, HPE, Supermicro and ODM partners may support deployment infrastructure but were not identified in this announcement
Development Speed9-month tape-out cycle from architecture definition to manufacturing
Current Validation StatusEngineering samples running GPT-5.3-Codex-Spark at production target frequency and power
Roadmap Scale10 GW deployment target through 2029
Infrastructure StrategyVertical integration across models, software, networking, systems and custom silicon
Industry SignificanceOpenAI is evolving from an AI model developer into a full-stack AI infrastructure company with its own silicon roadmap and gigawatt-scale deployment ambitions

Tags: BroadcomOpenAI
ShareTweetShareSummarizeSummarize
Previous Post

Qualcomm to Acquire Modular, Strengthening AI Software Stack

Next Post

Inside the Confidential Computing Summit: Trusted AI

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

Semiconductors

Broadcom Targets 20 GW of AI Compute Capacity with $35B Financing

June 9, 2026
5G / 6G / Wi-Fi

Broadcom and Samsung Link 5G Release 17 Modem with Wi-Fi 8

May 27, 2026
Last Mile / Middle Mile

Broadcom Adds 50G PON Edge AI Gateway SoC

May 26, 2026
Semiconductors

Broadcom Joins Applied Materials EPIC to Accelerate Chip Packaging

May 21, 2026
Semiconductors

Broadcom Pushes Wi-Fi 8 into Volume Markets with Integrated 10G PON Silicon

May 1, 2026
AI Infrastructure

OpenAI Targets 30GW AI Compute Buildout by 2030

April 23, 2026

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version