AI Hardware

Arm Just Made Its First Chip in 35 Years — and OpenAI, Meta, and Cloudflare Are First in Line

March 30, 2026 · Happycapy Guide

TL;DR

On March 24, 2026, Arm Holdings made history: after 35 years of only licensing chip designs, the company unveiled its first in-house silicon — the AGI CPU. The 136-core data center processor targets AI infrastructure directly, with 2x performance per rack versus x86 and 40% lower power draw. Meta co-designed it. OpenAI, Cloudflare, Cerebras, and SAP are already customers. Arm stock jumped 16% on the news. Here is what it means for the AI industry.

136

Neoverse V3 cores (dual chiplet)

performance per rack vs x86

300W

TDP vs 500W for AMD/Intel

$10B

saved per GW of AI datacenter

The Historic Shift: From Licensor to Silicon Maker

Arm Holdings was founded in 1990 as a joint venture between Acorn Computers, Apple, and VLSI Technology. For the next 35 years, its business model was simple and extraordinarily lucrative: design chip architectures, license them to other manufacturers, collect royalties. That is how Arm became the foundation of virtually every smartphone processor on the planet — Apple A-series, Qualcomm Snapdragon, Samsung Exynos — without making a single chip itself.

On March 24, 2026, Arm changed that model forever. The AGI CPU is Arm's first piece of physical silicon — designed, tested, and sold by Arm directly. It is not a general-purpose processor but a chip built specifically for agentic AI workloads in data centers: the exact compute category that is now consuming billions of dollars in infrastructure investment every month.

Arm's stock surged approximately 16% on the announcement. The company projects the AGI CPU will generate $15 billion in annual revenue by 2031 — a number that, if realized, would nearly double Arm's current revenue base.

AGI CPU Specifications

Core Count

Up to 136 cores

Dual-chiplet design, Neoverse V3

Process Node

TSMC 3nm

Same node as Apple M4 family

Clock Speed

3.7 GHz boost

3.2 GHz all-core sustained

Thermal Design Power

300W TDP

vs 500W for AMD EPYC / Intel Xeon

Memory Bandwidth

800+ GB/s

12-channel DDR5 configuration

PCIe / CXL

96 PCIe Gen6 lanes

Native CXL 3.0 support

Launch Customers

Meta served as the co-development partner, helping Arm optimize the chip for its AI infrastructure needs alongside Meta's custom MTIA accelerators. The confirmed customer roster at launch:

Meta (lead partner)

OpenAI

Cloudflare

Cerebras

SAP

SK Telecom

Rebellions

Positron

Server OEM partners building systems around the AGI CPU include Lenovo, Supermicro, ASRock Rack, and Quanta Computer. Systems are available to order now; volume shipments begin H2 2026.

Why This Changes AI Infrastructure Economics

The headline number: Arm estimates the AGI CPU saves up to $10 billion in capital expenditure per gigawatt of AI data center capacity versus x86 alternatives — driven by 2x performance per rack and 40% lower power consumption.

Power is the binding constraint in AI infrastructure. Every major tech company — Microsoft, Google, Meta, Amazon — is racing to build data centers faster than the grid can supply electricity. A chip that delivers the same AI compute at 40% lower power does not just reduce the electricity bill. It means more inference capacity in the same physical space, from the same power allocation, sooner.

For cloud AI providers, the downstream effect is lower per-query inference costs over time. A data center running 2x more efficiently is a data center that can price AI services more competitively — or generate more margin at the same price.

Try Happycapy Pro — AI agents powered by next-gen infrastructure from $17/mo

What the AGI CPU Is Not

Despite the name, the AGI CPU does not run AI models by itself. It is a central processing unit optimized for orchestration, inference serving, and memory bandwidth — the compute tasks around AI workloads. It is designed to pair with GPU accelerators (like Nvidia H100s or Cerebras chips), not replace them. The "AGI" in the name is marketing shorthand for "AI-native data center compute," not a claim that the chip has anything to do with artificial general intelligence.

Arm also continues its traditional licensing business. The AGI CPU is an addition to the business model, not a replacement. Apple, Qualcomm, Samsung, and others still license Arm architecture for their own custom chips.

Arm AGI CPU vs Competing Data Center CPUs

Processor	Cores	TDP	Memory BW	Process	AI Focus
Arm AGI CPU	136 (Neoverse V3)	300W	800+ GB/s DDR5	TSMC 3nm	Yes — built for AI
AMD EPYC Genoa (9004)	96 (Zen 4)	360–400W	460 GB/s DDR5	TSMC 5nm	Partial
Intel Xeon SP (Granite Rapids)	128	500W	307 GB/s DDR5	Intel 3	Partial
AWS Graviton 4	96 (Neoverse V2)	330W	537 GB/s DDR5	TSMC 4nm	Partial
Ampere Altra Max	128 (Neoverse N1)	250W	400 GB/s DDR4	TSMC 5nm	General

What This Means for People Who Use AI Tools

Infrastructure improvements do not show up in AI tool pricing immediately. But the direction is clear: more efficient data center compute means lower cost per query, which means AI tools can do more per dollar over time.

Timeline to user impact:

H2 2026: AGI CPU volume shipments begin; first deployments at early customers (Meta, OpenAI, Cloudflare)
2027: Broad datacenter rollout; providers begin migrating workloads to Arm-based racks
2028+: Infrastructure efficiency gains start flowing through to pricing; AI inference costs continue falling

For users of AI tools like Happycapy today, the practical implication is that the direction of travel on price-performance is strongly positive. The same $17/month Pro subscription will have access to more compute capacity as the infrastructure underlying it becomes more efficient.

Frequently Asked Questions

What is the Arm AGI CPU?

The Arm AGI CPU is Arm Holdings' first-ever in-house silicon chip, unveiled on March 24, 2026. It features 136 Neoverse V3 cores on TSMC 3nm, delivers 2x performance per rack versus x86 processors, and runs at 300W TDP compared to 500W for competing AMD EPYC or Intel Xeon parts. Meta is the lead design partner; OpenAI, Cloudflare, and SAP are among the first customers.

Why is the Arm AGI CPU significant?

Arm has licensed chip architecture for 35 years without making its own silicon. The AGI CPU marks a fundamental shift in its business model, directly competing with Intel and AMD in data centers. For AI specifically, its 2x performance per rack and 40% lower power draw could save up to $10 billion in capital expenditure per gigawatt of data center capacity.

When will the Arm AGI CPU be widely available?

The AGI CPU is available for order immediately through server partners including Lenovo, Supermicro, ASRock Rack, and Quanta Computer. Volume shipments are expected in the second half of 2026, with material revenue impact projected from 2028 onward.

How does the Arm AGI CPU affect AI tool costs?

More efficient data center infrastructure means lower inference costs for cloud AI providers over time. If providers like OpenAI can serve the same AI workloads from racks that use 40% less power, per-query pricing can fall. The savings compound at scale — Arm estimates $10 billion saved per gigawatt of AI data center capacity.

Start with Happycapy — AI agents, memory, and skills from $17/mo

Sources

← Back to all articles