Videos

Podcast – Emerging Kubernetes tools for AI and optimizing GPU workloads

Discover how Kubernetes is evolving to support AI/ML workloads in this interview with John Platt, CTO at StormForge (now part of CloudBolt) on KubeFM’s podcast series.

This episode will cover:

  • Notable new Kubernetes tools, such as in-place pod resizing for smoother workload rightsizing, EKS Auto Mode to streamline node management, and NOS for simplifying GPU virtualization.
  • The growing adoption of Kubernetes as the preferred platform for AI/ML workloads, delivering substantial cost savings (up to 90%) over managed services like OpenAI, along with enhanced scalability and flexibility.
  • Key obstacles to running AI on Kubernetes, including managing GPU drivers, ensuring CUDA version compatibility, and accommodating very large models that may exceed tens of gigabytes.
     

The Cloud ROI Newsletter

Exclusive insights and strategies for cloud and FinOps pros. Delivered straight to your inbox.

Related Blogs

 
thumbnail
Automating Kubernetes rightsizing and container-level cost allocation

Platform teams have spent years squeezing more efficiency out of Kubernetes. The real pressure hits when your AWS bill rises…

 
thumbnail
How Acquia cut web node infrastructure by 65% with continuous Kubernetes rightsizing

Acquia modernized a platform that previously ran on roughly 26,000 EC2 nodes by moving to Kubernetes. The goal wasn’t just…

 
thumbnail
Kubernetes cost allocation demo

Watch this demo to see how CloudBolt finally brings bill-accurate visibility to Kubernetes spend. Instead of guessing with list prices…