Loading…
18-19 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Friday June 19, 2026 12:40pm - 1:10pm IST
This talk tackles the global GPU shortage and a critical cloud-native reality in batch workload scheduling: "Kueue Quota reserved" does not equal "Capacity available". For production inference, this distinction determines whether workloads succeed or remain stuck indefinitely.

The session explores transcending regional silos using MultiKueue to manage a global federation of worker clusters. Moving beyond basic setup, the speakers address the "Quota vs. Datacenter Capacity" dilemma. They reveal a critical gap discovered through experiments: workloads becoming stranded in regions with exhausted capacity despite available quotas.

The speakers share their collaboration with the community to resolve these scheduling gaps (Issue #8089). This talk demonstrates the configuration aspects of Admission Checks and provisioning classes in Multikueue to force the scheduler to "hunt" for actual capacity in worker clusters across regions rather than relying solely on user-provided quota.
Speakers
avatar for Kishore Jagannath

Kishore Jagannath

Cloud Engineer, Google
Kishore Jagannath serves as a Cloud Solutions Engineer at Google, where he focuses on cloud infrastructure, large-scale Kubernetes orchestration and AI Platform Infrastructure. Over the past year, he has been architecting global compute planes to address GPU scarcity for production-critical... Read More →
avatar for Ram J A

Ram J A

Solutions Architect, Google
Ram J A is an engineer who enjoys learning new things and using technology to solve problems. Their work primarily focuses on Cloud computing and Kubernetes, with a recent interest in AI Infrastructure and building LLM-based agents. Outside of work, Ram spends time catching up on... Read More →
Friday June 19, 2026 12:40pm - 1:10pm IST
Jasmine 2 (Level 3)
  AI + ML

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link