Tech Field Day

The Independent IT Influencer Event

  • Home
    • The Futurum Group
    • FAQ
    • Staff
  • Sponsors
    • Sponsor List
      • 2026 Sponsors
      • 2025 Sponsors
      • 2024 Sponsors
      • 2023 Sponsors
      • 2022 Sponsors
    • Sponsor Tech Field Day
    • Best of Tech Field Day
    • Results and Metrics
    • Preparing Your Presentation
      • Complete Presentation Guide
      • A Classic Tech Field Day Agenda
      • Field Day Room Setup
      • Presenting to Engineers
  • Delegates
    • Delegate List
      • 2025 Delegates
      • 2024 Delegates
      • 2023 Delegates
      • 2022 Delegates
      • 2021 Delegates
      • 2020 Delegates
      • 2019 Delegates
      • 2018 Delegates
    • Become a Field Day Delegate
    • What Delegates Should Know
  • Events
    • All Events
      • Upcoming
      • Past
    • Field Day
    • Field Day Extra
    • Field Day Exclusive
    • Field Day Experience
    • Field Day Live
    • Field Day Showcase
  • Topics
    • Tech Field Day
    • Cloud Field Day
    • Mobility Field Day
    • Networking Field Day
    • Security Field Day
    • Storage Field Day
  • News
    • Coverage
    • Event News
    • Podcast
  • When autocomplete results are available use up and down arrows to review and enter to go to the desired page. Touch device users, explore by touch or with swipe gestures.
You are here: Home / Videos / Taking the Keysight AI Data Center Test Platform for a Test Drive

Taking the Keysight AI Data Center Test Platform for a Test Drive



AI Field Day 5


This video is part of the appearance, “Keysight Presents at AI Field Day 5“. It was recorded as part of AI Field Day 5 at 8:00-9:30 on September 11, 2024.


Watch on YouTube
Watch on Vimeo

This demonstration of the AI Data Center Test Platform shows how network events impact completion times. The first demo showcases the effects of congestion on completion times and how poor fabric utilization impacts performance. You’ll also see how the AI Data Center Test Platform can show how increasing parallelism of data transfer helps improve utilization and completion times.

In the presentation by Keysight Technologies at AI Field Day 5, Ankur Sheth, Director of AI Test R&D, demonstrated the AI Data Center Test Platform, focusing on how network events impact completion times. The setup involved emulating a server with eight GPUs connected to a two-tier fabric network, using the Arise 1 box to simulate the GPUs and network interface cards (NICs). The demonstration aimed to show the effects of network congestion on performance and how increasing the parallelism of data transfer can improve fabric utilization and completion times. The first scenario examined the impact of congestion on the network, revealing poor performance due to misconfigured congestion control settings.

Sheth explained the configuration and results of running an All Reduce Collective operation, which is commonly used during the backward pass of a training job. The initial test showed that the network’s poor configuration led to low utilization and high latency, with only 25% of the theoretical throughput achieved. Detailed flow completion times and cumulative distribution functions (CDFs) highlighted significant discrepancies in data transfer times, indicating a problem in the network configuration. After adjusting the network settings, particularly the Priority Flow Control (PFC) settings, the performance improved dramatically, achieving 95% utilization and significantly reducing completion times.

In a second experiment, Sheth demonstrated the impact of using different algorithms and increasing the number of Q-Pairs, which are connections used in the RDMA over Converged Ethernet (RoCE) protocol. The halving-doubling algorithm initially showed average performance with significant tail latencies. By increasing the Q-Pairs from one to eight, the network’s performance improved, with more parallel and consistent data transfer times. This change allowed the network to better load balance the traffic, resulting in more efficient utilization. The presentation concluded with a demonstration of how the platform’s metrics and data can be integrated into automated test cases and analyzed using tools like Jupyter notebooks, providing valuable insights for network designers and engineers.

Personnel: Ankur Sheth

  • Bluesky
  • LinkedIn
  • Mastodon
  • RSS
  • Twitter
  • YouTube

Event Calendar

  • Nov 5-Nov 6 — Networking Field Day 39
  • Nov 11-Nov 12 — Tech Field Day at KubeCon North America 2025
  • Jan 28-Jan 29 — AI Infrastructure Field Day 4
  • Mar 11-Mar 12 — Cloud Field Day 25
  • Apr 8-Apr 9 — Networking Field Day 40
  • Apr 15-Apr 16 — AI AppDev Field Day 3
  • Apr 29-Apr 30 — Security Field Day 15
  • May 6-May 8 — Mobility Field Day 14

Latest Coverage

  • NetApp Insight 2025: Building the Future Through Partnerships & AI with Spencer Sells
  • Guy Currier Gives A Rapid Reaction to the Cloud Field Day 24 Pure Storage Presentation
  • Why Enterprise Storage Is Still Stuck in 2010—And How Pure Storage Plans to Fix It
  • Ken Nalbone Gives A Rapid Reaction to Cloud Field Day 24 Presenter Pure Storage
  • Jack Poller Discusses Fortinet at Cloud Field Day 24

Tech Field Day News

  • Exploring How AI Transforms the Enterprise Network at Networking Field Day 39
  • Exploring the Future of Enterprise AI Deployment and Innovation at AI Field Day 7

Return to top of page

Copyright © 2025 · Genesis Framework · WordPress · Log in