IBM Unveils z17 Mainframe with On-Chip AI and Spyre Accelerator

IBM unveiled the z17 mainframe, the latest evolution of its enterprise computing platform, designed to bring AI acceleration directly into core infrastructure. Central to the system is the new IBM Telum II processor, which features an on-chip AI accelerator capable of executing over 450 billion inference operations daily with sub-millisecond response times. The chip includes a 40% increase in cache size over its predecessor and enhanced compute capacity, expanding the z17’s performance envelope for transactional and real-time AI workloads.

Complementing Telum II, IBM introduced the Spyre Accelerator, a PCIe-based AI engine expected in Q4 2025. The Spyre Accelerator brings generative AI capabilities to the mainframe, optimized for multi-model inference and direct integration with enterprise data. Together, Telum II and Spyre support native execution of assistants and agents like watsonx Code Assistant and watsonx Assistant, providing enterprise users with AI-enhanced development tools and real-time incident management through integrations such as Z Operations Unite.

IBM Telum® II Chip & IBM Spyre Accelerator Chip for AI Acceleration

IBM z17 will launch with support for the upcoming z/OS 3.2 operating system, which adds hardware-accelerated AI support, NoSQL compatibility, and hybrid cloud data processing. Enhancements also include new OpenTelemetry-based observability tools and expanded AI-driven cybersecurity features. IBM is also deploying Vault (from HashiCorp) for integrated secrets management, and Threat Detection for z/OS, a Telum-powered AI anomaly detection system designed to harden mission-critical workloads.

• Processor architecture:

• IBM Telum II with integrated AI acceleration

• 40% larger cache, higher frequency, expanded compute

• Up to 450B AI inference ops/day; 1ms response times

• AI hardware acceleration:

• IBM Spyre Accelerator via PCIe (Q4 2025)

• Optimized for generative AI, assistants, and agents

• Supports Granite-based models, runs enterprise AI workloads natively

• AI and OS software integration:

• z/OS 3.2 (expected Q3 2025) adds AI ops insight, NoSQL, hybrid cloud support

• watsonx Code Assistant and watsonx Assistant for Z integrated with Z Operations Unite

• Unified monitoring and response via IBM Concert and OpenTelemetry

• Security and observability:

• IBM Vault (by HashiCorp) for hybrid secrets management

• AI-based sensitive data discovery and classification

• IBM Threat Detection for z/OS for anomaly identification

• Storage and availability:

• IBM Storage DS8000 Gen 10 optimized for z17

• z17 general availability: June 18, 2025

• Spyre Accelerator availability: Q4 2025

“With z17, we’re bringing AI to the core of the enterprise with the software, processing power, and storage to make AI operational quickly,” said Ross Mauri, General Manager of IBM Z and LinuxONE.

Jim Carroll

Editor & Publisher

Every article published by Converge Digest is researched, curated, fact-checked and editorially reviewed by Jim Carroll, Editor & Publisher. AI-assisted drafting may be used to accelerate production, but all content is reviewed, refined and approved prior to publication.