|
![]() Alex Saroyan presented for Netris at AI Infrastructure Field Day 2 |
Introduction to Multi-Tenancy & Network Automation for AI Infrastructure Operators with Netris
Watch on YouTube
Watch on Vimeo
Netris helps GPU-based AI infrastructure operators automate their networks, provide multi-tenancy and isolation, and offer essential cloud networking features like VPCs, internet gateways, and load balancers. Netris focuses on network software designed for AI and cloud infrastructure operators because the growing popularity of AI necessitates specialized networking solutions to handle demanding AI workloads. Netris’s technology is particularly well-aligned with NVIDIA’s networking offerings, which are based on the foundation of Mellanox and Cumulus networks.
The presentation highlights the importance of dynamic multi-tenancy for maximizing the utilization of expensive GPUs. Netris provides “cloud provider grade network automation software” that allows AI infrastructure operators to achieve security levels comparable to physical isolation while maintaining software-driven speed. This solves the problem of manual network configuration, which is time-consuming, error-prone, and doesn’t scale. Furthermore, Netris supports cloud networking functions like Internet gateways, NAT gateways, and load balancers, offering a complete solution that addresses the need for secure and flexible network management in AI environments.
Netris’s solution is built on three key pillars: VPCs for isolation, cloud networking functions for connectivity, and fabric management for network operations. They manage both Ethernet and InfiniBand fabrics, providing operators with a single pane of glass. For InfiniBand fabrics, Netris integrates with NVIDIA’s UFM controllers. On the Ethernet side, Netris acts as the fabric manager for several vendors, including NVIDIA, Dell, and Arista, automating the management of network switches and streamlining operations. The goal is to offer a comprehensive, integrated network automation platform tailored for the demands of AI infrastructure.
Personnel: Alex Saroyan
How it works. Multi-Tenancy & Network Automation for AI Infrastructure Operators with Netris
Watch on YouTube
Watch on Vimeo
Netris, as presented by CEO Alex Soroyan, offers cloud-provider-grade network automation and multi-tenancy software tailored for AI Infrastructure operators. The core of their solution lies in the Netris Controller, which acts as the centralized source of truth for network engineers. It allows for the modeling and simulating network infrastructure using tools like Terraform and CloudSim, while also providing APIs that can integrate into cloud provider platforms, facilitating the creation of VPCs and managing network functions. A key component of their offering is SoftGate, a gateway on Linux servers that provides functions such as elastic load balancing and NAT. It offers a streamlined, integrated solution compared to separate, third-party products.
The presentation details Netris’ approach to day-zero and day-one operations, highlighting the use of Terraform for infrastructure-as-code methodologies and how the controller facilitates the deployment and management of various switch vendors. The system supports granular multi-tenancy through VXLANs and is designed to integrate with shared storage solutions. Netris facilitates access and isolation by allowing access to the network from the storage and the tenants, intending to integrate directly with storage vendors via their API. This setup allows for a cloud-like experience for AI infrastructure operators, streamlining the onboarding of tenants and the allocation of resources.
Netris differentiates itself by being multi-vendor and providing cloud networking constructs not typically found in traditional network automation platforms. The presentation emphasized the efficiency and integration provided by SoftGate, which eliminates the complexity of connecting firewalls and load balancers while supporting InfiniBand through integration with NVIDIA UFM. Alex expressed confidence in Netris’ position, particularly given the growing demand for cloud-provider-like capabilities in the AI infrastructure space.
Personnel: Alex Saroyan
Multi-Tenancy & Network Automation for AI Infrastructure Operators Demonstrated with Netris
Watch on YouTube
Watch on Vimeo
Netris CEO Alex Soroyan demonstrated the multi-tenancy and network automation solution in AI infrastructure. The presentation began with a live demonstration of the Netris controller, showcasing how it facilitates the setup and management of AI infrastructure networking. Utilizing Terraform modules and a “CloudSim” simulation, Soroyan illustrated the process of initializing the controller, generating network configurations based on user-defined parameters, and creating a digital twin of the network for validation.
The core of the presentation focused on day-2 operations, specifically the creation and management of tenants and network isolation. Using templates, Soroyan showed how easy it is to establish isolated clusters (VPCs) for different tenants. These templates translate high-level server assignments into low-level switch port configurations, enabling a cloud-native approach to network management. The demo also highlighted the integration of Elastic IPs to expose the internal clusters to the outside world.
Finally, Soroyan discussed monitoring features, which automate the configuration of monitoring tools and provide network health checks, including link validation. The presentation also touched on InfiniBand networking, demonstrating Netris’s capability to manage InfiniBand fabrics and integrate them with Ethernet networks. The key takeaways were automating network tasks, simplifying complex configurations through templates, and comprehensive monitoring capabilities, all contributing to a more efficient and manageable AI infrastructure environment.
Personnel: Alex Saroyan