Videos

Podcast – Emerging Kubernetes tools for AI and optimizing GPU workloads

Discover how Kubernetes is evolving to support AI/ML workloads in this interview with John Platt, CTO at StormForge (now part of CloudBolt) on KubeFM’s podcast series.

This episode will cover:

  • Notable new Kubernetes tools, such as in-place pod resizing for smoother workload rightsizing, EKS Auto Mode to streamline node management, and NOS for simplifying GPU virtualization.
  • The growing adoption of Kubernetes as the preferred platform for AI/ML workloads, delivering substantial cost savings (up to 90%) over managed services like OpenAI, along with enhanced scalability and flexibility.
  • Key obstacles to running AI on Kubernetes, including managing GPU drivers, ensuring CUDA version compatibility, and accommodating very large models that may exceed tens of gigabytes.
     

Sign up for our newsletter

Exclusive insights and strategies for cloud pros. Delivered straight to your inbox.


AUTHOR
CloudBolt
  Learn more

Related Blogs

 
thumbnail
Ask the Experts: Navigating the Hypervisor Shakeup

Enterprise infrastructure strategy is being shaped by two forces at once: accelerated public-cloud investment and AI-driven capacity demand, paired with…

 
thumbnail
Kubernetes Resource Optimization with Humans in the Loop, with Ray Chen | KubeFM

Watch this Kube FM interview of Ray Chen, Head of SRE at Trumid, as he demonstrates how StormForge helps platform…

 
thumbnail
From Aria to agility: Modern hybrid cloud automation

If your automation strategy has been anchored in VMware Aria (vRA), the last two years have introduced real operational uncertainty.…