Tech Field Day

The Independent IT Influencer Event

  • Home
    • The Futurum Group
    • FAQ
    • Staff
  • Sponsors
    • Sponsor List
      • 2025 Sponsors
      • 2024 Sponsors
      • 2023 Sponsors
      • 2022 Sponsors
    • Sponsor Tech Field Day
    • Best of Tech Field Day
    • Results and Metrics
    • Preparing Your Presentation
      • Complete Presentation Guide
      • A Classic Tech Field Day Agenda
      • Field Day Room Setup
      • Presenting to Engineers
  • Delegates
    • Delegate List
      • 2025 Delegates
      • 2024 Delegates
      • 2023 Delegates
      • 2022 Delegates
      • 2021 Delegates
      • 2020 Delegates
      • 2019 Delegates
      • 2018 Delegates
    • Become a Field Day Delegate
    • What Delegates Should Know
  • Events
    • All Events
      • Upcoming
      • Past
    • Field Day
    • Field Day Extra
    • Field Day Exclusive
    • Field Day Experience
    • Field Day Live
    • Field Day Showcase
  • Topics
    • Tech Field Day
    • Cloud Field Day
    • Mobility Field Day
    • Networking Field Day
    • Security Field Day
    • Storage Field Day
  • About Tech Field Day
    • Coverage
    • Podcast
    • Bluesky
  • When autocomplete results are available use up and down arrows to review and enter to go to the desired page. Touch device users, explore by touch or with swipe gestures.
You are here: Home / Videos / Google Kubernetes Engine – The Container Platform for AI at Scale from Google Cloud

Google Kubernetes Engine – The Container Platform for AI at Scale from Google Cloud



Cloud Field Day 20


This video is part of the appearance, “Google Cloud Presents at Cloud Field Day 20“. It was recorded as part of Cloud Field Day 20 at 16:00-17:00 on June 13, 2024.


Watch on YouTube
Watch on Vimeo

Brandon Royal, a Product Manager at Google Cloud, describes how Kubernetes can be leveraged for AI applications, particularly focusing on model training and serving. He begins by emphasizing the growing importance of generative AI across many organizations, highlighting that Google Kubernetes Engine (GKE) provides a robust platform for integrating AI into products and services. The platform is designed to handle the increasing complexity and scale of AI models, which demand high efficiency and cost-effectiveness. Royal mentions that GKE, often referred to as the operating system of Google’s AI hypercomputer, orchestrates workloads across storage, compute, and networking to deliver optimal price performance.

Royal addresses the challenges of scaling AI workloads, noting that model sizes are growing and pushing the limits of infrastructure. To tackle these challenges, GKE offers several optimizations, such as dynamic workload scheduling and container preloading, which enhance the efficiency and utilization of AI resources like CPUs, GPUs, and TPUs. He introduces the concept of “good put,” a metric for measuring machine learning productivity, which includes scheduling good put, runtime good put, and program good put. These metrics help ensure that resources are utilized effectively, minimizing idle time and maximizing forward progress in model training. Royal also highlights the importance of leveraging open-source frameworks like Ray and Kubeflow, which integrate seamlessly with GKE to provide a comprehensive AI development and deployment environment.

The presentation includes a demo showcasing the optimization capabilities of GKE. Royal demonstrates how container preloading and persistent volume claims can significantly reduce the time required to deploy AI models. By preloading container images and sharing model weights across instances, GKE can cut down deployment times from several minutes to mere seconds. This optimization is crucial for large-scale AI deployments, where efficiency and speed are paramount. Royal concludes by encouraging the audience to explore the resources and tutorials available for building AI platforms on GKE, emphasizing that these optimizations can provide a competitive edge in the fast-evolving field of AI.

Personnel: Brandon Royal


  • Bluesky
  • LinkedIn
  • Mastodon
  • RSS
  • Twitter
  • YouTube

Event Calendar

  • May 7-May 9 — Mobility Field Day 13
  • May 13-May 15 — Tech Field Day Experience at Qlik Connect 2025
  • May 28-May 29 — Security Field Day 13
  • Jun 4-Jun 5 — Cloud Field Day 23
  • Jun 10-Jun 11 — Tech Field Day Extra at Cisco Live US 2025
  • Jul 9-Jul 10 — Networking Field Day 38
  • Jul 16-Jul 17 — Edge Field Day 4
  • Jul 23-Jul 24 — AppDev Field Day 3

Latest Links

  • NB525: Cisco, IBM Recruit AI for Threat Response; HPE Air-Gaps Private Clouds
  • Key Takeaways from AI Infrastructure Field Day 2
  • Techstrong Gang – April 29, 2025
  • Google Cloud Builds on Storage Portfolio to Fuel AI Hypercomputer
  • Nutanix: Working on the Easy Button for AI

Return to top of page

Copyright © 2025 · Genesis Framework · WordPress · Log in