Upscale AI Networking – What Has Changed with AI

This video is part of the appearance, "Upscale AI Presents at Networking Field Day 40". It was recorded as part of Networking Field Day 40 at 10:30-12:00 on April 9, 2026.

Watch on YouTube
Watch on Vimeo

Upscale AI argues that traditional cloud and front-end networks, which are largely based on a client-server architecture, are fundamentally ill-suited for the unique demands of AI workloads. While standard web traffic is connection-oriented and tolerant of latency, AI clusters rely on collective communication where GPUs perform synchronized all-to-all data exchanges. This shift results in a move from north-south traffic patterns to intense east-west traffic, where a single request triggers massive bursts of data across the fabric. The presentation establishes that to maintain efficiency, the network must evolve from a reactive system to an architected substrate that treats the entire cluster as a single, coordinated engine.

AI networking requires a radical departure from the traditional OSI seven-layer processing model. In a standard network, packets traverse the full stack and are processed by the CPU;. However, AI traffic utilizes RDMA (Remote Direct Memory Access) to bypass the kernel and CPU entirely, performing zero-copy memory transactions directly between GPUs. This creates a different packet profile where payload data is memory itself rather than application data. Furthermore, while cloud networks handle congestion reactively through TCP retransmits, AI clusters require a lossless environment. In these systems, a single dropped packet can stall thousands of GPUs, leading to a computational head-of-line blocking that halts progress across the entire token factory.

To solve these challenges, Upscale AI advocates for a purpose-built network stack that optimizes every layer from silicon to software. Traditional data center switches are often burdened by bloated feature sets and complex pipelines designed for general-purpose routing, which increases power consumption and latency. By stripping away unnecessary protocols and focusing on AI-specific requirements like microsecond-level telemetry and adaptive load balancing to prevent hash collisions, the company aims to deliver a more efficient fabric. The speakers conclude that achieving a 100% success rate for collective communication is necessary to maximize tokens per watt, moving beyond the tuning of existing hardware toward a clean-sheet architecture designed for the next decade of AI scale.

Personnel: Aravind Srikumar

AI Factories Need Private 5G From Celona

The Value of Validation with Nokia

Start with Wyebot at MFD14

AI Strategy in Chaos: Models, Infrastructure, and Neoclouds

Cisco Secures Wi-Fi 802.11bt from Post Quantum Cryptography at MFD14

HPE Juniper Host “AA” Meetings for Recovering NPS Users at MFD14

Upscale AI Networking – What Has Changed with AI

Sign up for updates to Tech Field day events

Sign up for updates to
Tech Field day events