Loading…
18-19 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Friday June 19, 2026 12:00pm - 12:30pm IST
The surge of Generative AI and large language models (LLMs) is introducing new operational challenges for Kubernetes platforms. Unlike traditional APIs, GenAI traffic is highly variable, token-driven, and cost-sensitive, requiring new approaches to routing, security, and observability.

This session, presented by the project maintainers, examines Envoy AI Gateway, a CNCF-hosted open source project, and its role as a Kubernetes-native control plane for GenAI workloads.. We’ll break down an end-to-end architecture that uses Envoy AI Gateway to manage and govern traffic across multiple LLM backends both self-hosted and cloud-based while enforcing policies such as token-aware rate limiting, authentication, and dynamic model selection.

Attendees will leave with practical insights into designing resilient, scalable GenAI platforms on Kubernetes, and an understanding of how AI-aware gateways fit into modern cloud-native infrastructure.
Speakers
avatar for Gavrish Prabhu

Gavrish Prabhu

Technical Lead, Nutanix
Gavrish Prabhu is a Founding ML Engineer on the Nutanix Enterprise AI team with a background in distributed systems. He is active in open-source projects and is a maintainer of KServe and Envoy AI Gateway Projects. His key interests are systems involving the next generation of AI... Read More →
Friday June 19, 2026 12:00pm - 12:30pm IST
Lotus 1 (Level 3)
  AI + ML

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link