Loading…
18-19 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Thursday June 18, 2026 11:03am - 11:13am IST
Two years ago, every platform team was building developer platforms. Today the same teams, and the AI Cloud providers selling to them, face a harder question. How do you safely share GPU infrastructure across multiple AI teams without stepping on each other?

Kubernetes is already the production platform for AI inference. Containers, autoscaling, multi-tenancy, RBAC are all solved. The remaining problem is GPUs themselves. Today Kubernetes asks for them in whole numbers. One pod, one GPU, even when the workload uses 10% of it.

This keynote walks through how the CNCF ecosystem, and Kubernetes itself, has answered. HAMi virtualizes one physical GPU into multiple slices today, each with its own memory budget. DRA evolves the platform's resource model so Kubernetes finally understands GPUs as rich devices instead of opaque numbers.

Then, we move to a live demo. There will be a MacBook on stage, connected directly to an NVIDIA DGX Spark. A single Blackwell GPU will run two open source LLMs for two teams, generating answers simultaneously. There are no slides and no recordings.

That's the AI factory. And yes, Kubernetes has solved it, with a little help from its friends.
Speakers
avatar for Saiyam Pathak

Saiyam Pathak

Head of Developer Relations, vCluster
Saiyam is working as Head of DevRel at vCluster. He is the founder of Kubesimplify, focusing on simplifying cloud-native & AI infrastructure. He is Kubecon Co-chair and has worked on many facets of Kubernetes, including machine learning platforms, scaling, multi-cloud, & managed Kubernetes... Read More →
Thursday June 18, 2026 11:03am - 11:13am IST
Jasmine 2 (Level 3)

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link