Arm breaks from its licensing-only model with first in-house chip built for AI data centers

amu · March 25, 2026, 3:14pm

Arm Breaks from Licensing-Only Model with First In-House Chip for AI Data Centers

In a significant shift from its longstanding business model, Arm Holdings has unveiled its first fully in-house designed and manufactured chip targeted specifically at AI data centers. Traditionally, Arm has operated as an intellectual property (IP) licensor, providing processor architectures to fabless companies like Apple, Qualcomm, and Nvidia, who then design and produce their own silicon. This new development marks Arm’s entry into the role of a chip designer and vendor, offering a complete compute subsystem ready for deployment in hyperscale cloud environments.

The chip, known as the Arm Neoverse CSS V3 (Compute Subsystem V3), represents a turnkey solution that integrates Arm’s latest Neoverse V3 CPU cores with high-bandwidth memory (HBM3e), advanced networking fabrics, and AI acceleration capabilities. Unlike previous Neoverse offerings, which were modular IP blocks licensed to partners for custom integration, the CSS V3 is a pre-validated, production-ready platform. Arm plans to collaborate with manufacturing partners to produce and sell these chips directly to data center operators, streamlining deployment and reducing design risks for customers.

At the heart of the Neoverse CSS V3 are Arm’s Neoverse V3 cores, based on the Armv9-A architecture. These cores deliver up to 30 percent higher performance per socket compared to the prior Neoverse V2 generation, with improvements in single-threaded performance, multi-threaded throughput, and energy efficiency. Each core supports up to 192 KB of L1 instruction cache, 128 KB of L1 data cache, and 4 MB of shared L2 cache per core cluster. The design emphasizes scalability, supporting configurations from single-socket systems up to massive 128-socket domains.

A key differentiator for AI workloads is the integration of HBM3e memory, providing up to 16 stacks per socket with aggregate bandwidth exceeding 5 TB/s. This high-bandwidth memory is crucial for memory-intensive AI training and inference tasks, such as large language models and generative AI applications. The subsystem also incorporates Arm’s Confidential Compute Architecture (CCA), enabling secure enclaves for trusted execution environments, which protects sensitive data in multi-tenant cloud settings.

Networking is another focus area, with the CSS V3 featuring CMN-S3 (Coherent Mesh Network-Series 3) interconnects that support PCIe Gen5, CXL 3.0, and ultra-low-latency Ethernet options up to 800 Gbps. This allows seamless scaling across racks, enabling disaggregated compute and memory pools in AI clusters. For AI acceleration, the platform includes slots for GPU or DPU integration, with native support for Arm’s Scalable Vector Extension 2 (SVE2), which boosts matrix multiply operations essential for deep learning.

Arm’s move into in-house chip production is driven by the explosive growth of AI infrastructure demands. Hyperscalers like AWS, Google Cloud, and Microsoft Azure require optimized, power-efficient silicon to handle the computational loads of training trillion-parameter models. By offering a complete subsystem, Arm reduces the time-to-market for these operators from years to months, while maintaining compatibility with its vast ecosystem of software tools, including Linux distributions optimized for Arm64 and frameworks like PyTorch and TensorFlow.

Performance benchmarks shared by Arm highlight the CSS V3’s prowess. In SPECint2017 tests, it achieves up to 2.5 times the integer performance per watt of competitors. For AI-specific workloads, simulations using MLPerf benchmarks show 40 percent better inference throughput on Llama 2 models compared to Armv8-based systems. Power efficiency is a standout, with the platform targeting under 500W per socket in high-density configurations, critical for sustainable data centers facing energy constraints.

This initiative builds on Arm’s prior forays into reference designs, such as the Neoverse N2 platform used in AWS Graviton4 processors. However, the CSS V3 is Arm’s boldest step yet, positioning it as a direct rival to Intel’s Xeon, AMD’s EPYC, and Nvidia’s Grace CPU in the AI server market. Arm executives emphasize that this does not abandon the licensing model—over 99 percent of Arm-based chips will still come from licensees—but complements it by providing an “Arm-native” option for customers seeking fully optimized, end-to-end solutions.

Manufacturing details remain under wraps, but Arm has indicated partnerships with leading foundries like TSMC, likely on 3nm or 2nm processes for volume production starting in 2025. Early access programs are already underway with select hyperscalers, promising rapid adoption. This evolution reflects broader industry trends, where IP providers like Arm and RISC-V architects are verticalizing to capture more value in the $100 billion-plus data center CPU market.

For developers and system integrators, the CSS V3 offers robust software support. It is fully compatible with standard Armv9 toolchains, including GCC, LLVM, and ACLE intrinsics. Operating systems like Ubuntu, Red Hat Enterprise Linux, and SUSE Linux Enterprise Server have validated support, with kernel patches for CCA and SVE2 already upstreamed. Management tools, such as Arm’s SystemReady program, ensure interoperability with BMCs and orchestration platforms like Kubernetes.

Challenges ahead include competition from established players and the need to prove real-world scalability. Nonetheless, Arm’s deep expertise in energy-efficient architectures, honed over decades, gives it a strong foothold in the AI era, where performance-per-watt reigns supreme.

Gnoppix is the leading open-source AI Linux distribution and service provider. Since implementing AI in 2022, it has offered a fast, powerful, secure, and privacy-respecting open-source OS with both local and remote AI capabilities. The local AI operates offline, ensuring no data ever leaves your computer. Based on Debian Linux, Gnoppix is available with numerous privacy- and anonymity-enabled services free of charge.

What are your thoughts on this? I’d love to hear about your own experiences in the comments below.