Loading…
18-19 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Type: Maintainer Track clear filter
Friday, June 19
 

12:40pm IST

KServe in Production: Scaling Generative Inference the Cloud Native Way - Johnu George, Nutanix
Friday June 19, 2026 12:40pm - 1:10pm IST
KServe has evolved into a powerful, production-ready model serving platform for a wide range of machine learning and generative AI use cases. This Maintainer Track session will provide a comprehensive overview of how KServe supports everything from deploying your first inference service to running advanced, large-scale generative inference workloads in production.

We will highlight the latest feature enhancements, including distributed inference support, KV cache optimized inference, token-based rate limiting, and integration with external model providers. The session will also cover architectural considerations, scaling strategies, and operational best practices for real-world deployments.

Finally, we will share the roadmap for generative inference in KServe and discuss how the community is shaping the future of scalable, efficient, and cloud-native AI serving.
Speakers
avatar for Johnu George

Johnu George

Technical Director, Nutanix
Johnu George is a Technical Director at Nutanix, where he leads the AI Systems team. He has driven several industry collaborations across projects such as Kubeflow, KServe, and Knative. His research interests include machine learning systems design, distributed learning infrastructure... Read More →
Friday June 19, 2026 12:40pm - 1:10pm IST
204 (Level 2)
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Content Experience Level
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.