Aarna & Supermicro: Reference Architecture for AI-RAN Distributed Inference

The telecom industry is undergoing a rapid transformation, driven by the convergence of Artificial Intelligence (AI), cloud-native technologies, and the accelerating demands of 5G and emerging 6G networks.

RAN sites constitute the biggest CAPEX of a 5G rollout. Sadly, the average hardware utilization of RAN sites is only 20-30%. So the most expensive part of a 5G rollout is sitting idle 70-80% of the time. How does this make any sense?

This joint whitepaper by Supermicro and Aarna.ml introduces a comprehensive Reference Architecture for AI RAN Distributed Inference, built on the powerful NVIDIA GH200 Grace Hopper Platform and a flexible multi-tenant cloud management layer. Designed to help Communication Service Providers (CSPs) unlock new monetization streams, optimize RAN performance, and scale AI applications at the network edge, this future-proof architecture brings together high-performance hardware, intelligent software orchestration, and dynamic resource allocation.

The result: a scalable, low-latency, secure, and efficient solution for distributed telco edge environments – serving both RAN and AI workloads.

Download Now

What to Expect in This Whitepaper

With the aarna.ml NCP ROI Calculator, business leaders can now visualize their ROI like never before. Input your assumptions to calculate baseline IRR with static rentals, then see the exponential boost—up to 18x—when layering on-demand instances and PaaS functionalities.

A detailed hardware and software reference architecture for AI-RAN distributed inference leveraging NVIDIA GH200 Grace Hopper System and NVIDIA Spectrum-X Networking.

Insights into aarna.ml’s GPU Cloud Management Software (AI-RAN Edition) that enables dynamic multi-tenancy, resource scaling, and AI/RAN workload orchestration.

Real-world topologies and rack-level diagrams for both central and edge site deployments.

Aarna & Supermicro: Reference Architecture for AI-RAN Distributed Inference

The telecom industry is undergoing a rapid transformation, driven by the convergence of Artificial Intelligence (AI), cloud-native technologies, and the accelerating demands of 5G and emerging 6G networks.

What to Expect in This Whitepaper

Unlock the full potential of your AI cloud with Aarna

Main links

Products

Solutions

Stay up to date on aarna.ml