Aarna.ml

A Cloud-Native Solution with NVIDIA GH200 System, Supermicro system management tools & aarna.ml GPU Cloud Management Software.

The telecom industry is undergoing a rapid transformation, driven by the convergence of Artificial Intelligence (AI), cloud-native technologies, and the accelerating demands of 5G and emerging 6G networks.

This joint whitepaper by Supermicro and Aarna.ml introduces a comprehensive Reference Architecture for AI RAN Distributed Inference, built on the powerful NVIDIA GH200 Grace Hopper Platform and a flexible multi-tenant cloud management layer. Designed to help Communication Service Providers (CSPs) unlock new monetization streams, optimize RAN performance, and scale AI applications at the network edge, this future-proof architecture brings together high-performance hardware, intelligent software orchestration, and dynamic resource allocation.

The result: a scalable, low-latency, secure, hard isolated (between RAN / AI tenants) and efficient solution for distributed telco edge environments – serving both RAN and AI workloads.

What to Expect in This Whitepaper

A detailed hardware and software reference architecture for AI-RAN distributed inference leveraging NVIDIA GH200 Grace Hopper System and NVIDIA Spectrum-X Networking.
Insights into aarna.ml’s GPU Cloud Management Software (AI-RAN Edition) that enables dynamic multi-tenancy, resource scaling, and AI/RAN workload orchestration.
Real-world topologies and rack-level diagrams for both central and edge site deployments.

Download Supermicro-Aarna Whitepaper - AI RAN Distributed Inference Solution - Reference Architecture for further details.

Aarna & Supermicro: Reference Architecture for AI-RAN Distributed Inference

What to Expect in This Whitepaper

Main links

Products

Solutions

Stay up to date on aarna.ml