|
This video is part of the appearance, “Keysight Presents at AI Field Day 5“. It was recorded as part of AI Field Day 5 at 8:00-9:30 on September 11, 2024.
Watch on YouTube
Watch on Vimeo
AI deployment is growing rapidly and the race to train and deliver new AI models quickly and efficiently is a top priority. The Keysight AI Data Center Test Platform is designed to accelerate innovation in AI network fabric validation and optimization, enabling you to test today’s AI networks with confidence. This presentation introduces Keysight, the challenges our customers face and why realistic emulation and testing of AI workloads is critical.
In the presentation by Ankur Sheth from Keysight Technologies, the focus is on the rapid growth of AI deployment and the critical need for effective testing of AI network infrastructures. Keysight, with its rich history stemming from Hewlett Packard, has established itself as a leader in test and measurement solutions across various technology sectors. The company has evolved through acquisitions and innovations, positioning itself to address the unique challenges posed by the increasing complexity of AI networks. As AI technologies proliferate, particularly in hyperscale environments, the demand for robust testing solutions becomes paramount to ensure that the underlying infrastructure can support the high bandwidth, low latency, and reliability required for optimal performance.
Sheth highlights the significant role that network failures play in the inefficiencies of AI training jobs, noting that 20% of failures can be attributed to network issues. With GPUs being the most expensive resources in AI infrastructures, it is crucial to minimize their idle time caused by data transfer delays. The challenges of testing at scale are compounded by the high costs and limited availability of GPUs, making it impractical to create large test environments. As a result, the need for realistic emulation and testing of AI workloads is emphasized, as it allows operators to identify and resolve potential network issues before deploying their systems in production.
To address these challenges, Keysight introduces its AI Data Center Test Platform, which combines advanced hardware and software solutions tailored for testing AI network fabrics. This platform enables testing without the need for physical GPUs, thereby alleviating some of the cost and resource constraints faced by operators. The presentation sets the stage for a deeper exploration of the specific tools and methodologies that Keysight offers, such as the ARIES-1 platform of traffic generators, which are designed to facilitate effective testing and validation of AI networks. By providing these innovative solutions, Keysight aims to empower its customers to accelerate their AI initiatives and ensure the reliability of their network infrastructures.
Personnel: Ankur Sheth