• Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Tuesday, June 9, 2026
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » Majestic Labs Targets AI Memory Bottleneck with 128TB Server

Majestic Labs Targets AI Memory Bottleneck with 128TB Server

April 28, 2026
in Data Centers, Start-ups
A A

Start-up Unveils “Prometheus” AI Server to Tackle Memory Bottlenecks in Large-Scale Models

Majestic Labs introduced Prometheus, an AI server engineered to address the “memory wall,” a growing constraint in scaling modern AI workloads. The system adopts a memory-first architecture that connects significantly larger pools of high-speed memory directly to processing elements, aiming to reduce latency and improve utilization. The company positions Prometheus as an alternative to conventional GPU-centric designs, which often leave processors underutilized while waiting on data movement across fragmented memory hierarchies.

The Prometheus platform integrates up to 128 TB of shared, contiguous memory within a single standard-sized server, enabling execution of large-scale AI models that typically require distributed infrastructure. Majestic Labs claims the system can deliver performance comparable to multiple racks of traditional servers while reducing power consumption and total cost of ownership. The design reflects broader industry pressures, as hyperscaler capital expenditures on AI infrastructure continue to rise sharply, with a growing share allocated to memory and data movement challenges rather than raw compute.

At the silicon level, Prometheus incorporates proprietary AI Processing Units (AIUs) called Ignite, combining ARM-based CPU cores with RISC-V vector and tensor engines in a unified memory space. The system supports widely used frameworks such as PyTorch, vLLM, and OpenAI Triton, allowing developers to run existing workloads without code modification. Majestic Labs states that the architecture can support multi-trillion-parameter models, large context windows, and emerging AI workloads such as mixture-of-experts and agentic systems within a single node.

  • Memory-first architecture designed to overcome AI “memory wall” constraints
  • Up to 128 TB of high-speed, shared memory per server
  • Claims of performance equivalent to multiple racks in a single system
  • Ignite AIUs combine ARM CPUs with RISC-V vector/tensor cores
  • Supports PyTorch, vLLM, and Triton without code changes
  • Targets multi-trillion-parameter models and large context windows
  • Early deployments underway; broader availability expected next year

“Prometheus represents the first ground-up reimagination of AI infrastructure with memory as a first-class citizen,” said Ofer Shacham, Co-Founder and CEO of Majestic Labs.

Profile: Majestic AI
CompanyMajestic Labs
Founded2023
HeadquartersSan Francisco, California, USA and Tel Aviv, Israel
CEO / Co-FounderOfer Shacham
Other FoundersSha Rabii (President), Masumi Reynders (COO)
Founder BackgroundsEngineering and product leadership experience at Google and Meta, with focus on custom silicon, AI infrastructure, and large-scale systems
Recent Funding$100 million Series A (September 2025)
Key InvestorsLux Capital, Bow Wave Capital Management, Upfront Ventures, SBI Investment
Core ProductPrometheus AI Server (memory-first architecture)
Custom SiliconIgnite AI Processing Units (AIUs)
ArchitectureMemory-first design with large, shared, contiguous memory pool directly attached to compute
Maximum MemoryUp to 128 TB per server
Software CompatibilitySupports PyTorch, vLLM, and Triton without code modification
Target WorkloadsLarge language models, multi-trillion-parameter models, mixture-of-experts, agentic AI systems, graph neural networks
MissionExpand access to advanced AI while improving power efficiency and reducing data center footprint
ShareTweetShareSummarizeSummarize
Previous Post

OpenLight Raises $50M to Advance Heterogeneous Silicon Photonics

Next Post

Corning Reports Accelerating Optical Demand

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

Optical

Amazon, Corning Sign Multibillion-Dollar Deal

June 8, 2026
All

SpaceXAI Details Gigawatt-Scale AI Data Centers in LEO

June 8, 2026
AI Infrastructure

Google and Intersect Plan Texas Data Center with 1 GW+ Dedicated Energy

June 8, 2026
Semiconductors

Cadence, Intel Foundry Deepen Partnership on Intel 14A

June 8, 2026
All

Qnity Unveils Advanced Packaging Materials

June 8, 2026
All

NTT DATA Expands Google Cloud Partnership

June 8, 2026
Next Post

Corning Reports Accelerating Optical Demand

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version