Tech Field Day

The Independent IT Influencer Event

  • Home
    • The Futurum Group
    • FAQ
    • Staff
  • Sponsors
    • Sponsor List
      • 2025 Sponsors
      • 2024 Sponsors
      • 2023 Sponsors
      • 2022 Sponsors
    • Sponsor Tech Field Day
    • Best of Tech Field Day
    • Results and Metrics
    • Preparing Your Presentation
      • Complete Presentation Guide
      • A Classic Tech Field Day Agenda
      • Field Day Room Setup
      • Presenting to Engineers
  • Delegates
    • Delegate List
      • 2025 Delegates
      • 2024 Delegates
      • 2023 Delegates
      • 2022 Delegates
      • 2021 Delegates
      • 2020 Delegates
      • 2019 Delegates
      • 2018 Delegates
    • Become a Field Day Delegate
    • What Delegates Should Know
  • Events
    • All Events
      • Upcoming
      • Past
    • Field Day
    • Field Day Extra
    • Field Day Exclusive
    • Field Day Experience
    • Field Day Live
    • Field Day Showcase
  • Topics
    • Tech Field Day
    • Cloud Field Day
    • Mobility Field Day
    • Networking Field Day
    • Security Field Day
    • Storage Field Day
  • News
    • Coverage
    • Event News
    • Podcast
  • When autocomplete results are available use up and down arrows to review and enter to go to the desired page. Touch device users, explore by touch or with swipe gestures.
You are here: Home / Videos / Google Kubernetes Engine – The Container Platform for AI at Scale from Google Cloud

Google Kubernetes Engine – The Container Platform for AI at Scale from Google Cloud



Cloud Field Day 20


This video is part of the appearance, “Google Cloud Presents at Cloud Field Day 20“. It was recorded as part of Cloud Field Day 20 at 16:00-17:00 on June 13, 2024.


Watch on YouTube
Watch on Vimeo

Brandon Royal, a Product Manager at Google Cloud, describes how Kubernetes can be leveraged for AI applications, particularly focusing on model training and serving. He begins by emphasizing the growing importance of generative AI across many organizations, highlighting that Google Kubernetes Engine (GKE) provides a robust platform for integrating AI into products and services. The platform is designed to handle the increasing complexity and scale of AI models, which demand high efficiency and cost-effectiveness. Royal mentions that GKE, often referred to as the operating system of Google’s AI hypercomputer, orchestrates workloads across storage, compute, and networking to deliver optimal price performance.

Royal addresses the challenges of scaling AI workloads, noting that model sizes are growing and pushing the limits of infrastructure. To tackle these challenges, GKE offers several optimizations, such as dynamic workload scheduling and container preloading, which enhance the efficiency and utilization of AI resources like CPUs, GPUs, and TPUs. He introduces the concept of “good put,” a metric for measuring machine learning productivity, which includes scheduling good put, runtime good put, and program good put. These metrics help ensure that resources are utilized effectively, minimizing idle time and maximizing forward progress in model training. Royal also highlights the importance of leveraging open-source frameworks like Ray and Kubeflow, which integrate seamlessly with GKE to provide a comprehensive AI development and deployment environment.

The presentation includes a demo showcasing the optimization capabilities of GKE. Royal demonstrates how container preloading and persistent volume claims can significantly reduce the time required to deploy AI models. By preloading container images and sharing model weights across instances, GKE can cut down deployment times from several minutes to mere seconds. This optimization is crucial for large-scale AI deployments, where efficiency and speed are paramount. Royal concludes by encouraging the audience to explore the resources and tutorials available for building AI platforms on GKE, emphasizing that these optimizations can provide a competitive edge in the fast-evolving field of AI.

Personnel: Brandon Royal


  • Bluesky
  • LinkedIn
  • Mastodon
  • RSS
  • Twitter
  • YouTube

Event Calendar

  • Jun 10-Jun 11 — Tech Field Day Extra at Cisco Live US 2025
  • Jul 9-Jul 10 — Networking Field Day 38
  • Aug 19-Aug 20 — Tech Field Day Extra at SHARE Cleveland 2025
  • Sep 10-Sep 11 — AI Infrastructure Field Day 3
  • Sep 24-Sep 25 — Security Field Day 14
  • Oct 22-Oct 23 — Cloud Field Day 24
  • Oct 29-Oct 30 — AI Field Day 7

Latest Coverage

  • Codiac Refreshes Container Management Across SDLC: Some Key Features on the Platform
  • HPE Greenlake Paves a Path to a Hybrid SAP ERP Future
  • Scality Showcases Real-World Resilient Cloud Storage Solutions
  • OpsRamp Shows Unified IT Operations for Fast Incident Management
  • VIAVI: PCAPs and LLMs

Tech Field Day News

  • Have A Classy Time with Tech Field Day Extra at Cisco Live US 2025
  • Exploring Cloud Resilience, AI, and Data at Cloud Field Day 23

Return to top of page

Copyright © 2025 · Genesis Framework · WordPress · Log in