Loading…
18-19 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Company: Beginner clear filter
arrow_back View All Dates
Friday, June 19
 

12:00pm IST

When LLMs Hit Production: Why You Need an AI Gateway - Gavrish Prabhu, Nutanix
Friday June 19, 2026 12:00pm - 12:30pm IST
The surge of Generative AI and large language models (LLMs) is introducing new operational challenges for Kubernetes platforms. Unlike traditional APIs, GenAI traffic is highly variable, token-driven, and cost-sensitive, requiring new approaches to routing, security, and observability.

This session, presented by the project maintainers, examines Envoy AI Gateway, a CNCF-hosted open source project, and its role as a Kubernetes-native control plane for GenAI workloads.. We’ll break down an end-to-end architecture that uses Envoy AI Gateway to manage and govern traffic across multiple LLM backends both self-hosted and cloud-based while enforcing policies such as token-aware rate limiting, authentication, and dynamic model selection.

Attendees will leave with practical insights into designing resilient, scalable GenAI platforms on Kubernetes, and an understanding of how AI-aware gateways fit into modern cloud-native infrastructure.
Speakers
avatar for Gavrish Prabhu

Gavrish Prabhu

Technical Lead, Nutanix
Gavrish Prabhu is a Founding ML Engineer on the Nutanix Enterprise AI team with a background in distributed systems. He is active in open-source projects and is a maintainer of KServe and Envoy AI Gateway Projects. His key interests are systems involving the next generation of AI... Read More →
Friday June 19, 2026 12:00pm - 12:30pm IST
Lotus 1 (Level 3)
  AI + ML

12:00pm IST

Inference in Progress… Please Monitor Responsibly - Gaurav Sharma, NVIDIA
Friday June 19, 2026 12:00pm - 12:30pm IST
Running GPU inference on Kubernetes is no longer exotic — it’s becoming the default for modern AI workloads. But while teams obsess over model latency and throughput, the real problems usually hide deeper: GPU under-utilization, memory fragmentation, node-level contention, noisy neighbour, and observability gaps that make debugging feel like guesswork.
In this talk, we’ll walk through a practical, field-tested monitoring approach for GPU inference workloads on Kubernetes. Attendees will learn how to instrument GPU nodes, collect and correlate GPU-specific metrics, build alerting around inference SLOs, and detect performance regressions before they disrupt production. We’ll also cover common anti-patterns and what “good” looks like for GPU observability in 2025.
If you're running (or planning to run) GPU inference at scale, this session will help you monitor responsibly — and keep your cluster healthy, efficient, and fast.
Speakers
avatar for Gaurav Sharma

Gaurav Sharma

Senior Site Reliability Engineer, Nvidia
Currently working as Senior SRE for Nvidia AI.
In the past I have been part of SRE teams for Nvidia cloud gaming, Microsoft Azure Reliability, Adobe Analytics & VMware Cloud Services.
Friday June 19, 2026 12:00pm - 12:30pm IST
Lotus 3 (Level 3)
  Observability

2:30pm IST

Service Networking Within Air-Gapped Environments: Deployment Strategies and Operational Management - Anirban Nandi, Google
Friday June 19, 2026 2:30pm - 3:00pm IST
During the development of an air-gapped cloud infrastructure, Google engineering teams encountered a major challenge: the heavy operational burden on each service owner to manage common networking functions like load balancing, authorization, mTLS, rate limiting, etc. This was further complicated by the prevalent use of open-source software stacks with limited customisations for such use cases. Consequently, a collective decision was made to implement an Istio service mesh and in-cluster gateways to abstract these operations. However, operating a mesh in an air-gapped environment introduces unique technical and logistical difficulties.

This talks presents methodologies for reliably scaling meshes and gateways both zonally and globally across diverse bare-metal and KubeVirt clusters spanning multiple networks by integrating with established OSS technologies like MetalLB, Cilium ClusterMesh, etc. as well as OpenTelemetry, Fluentbit, Grafana, etc. for robust operational transparency.
Speakers
avatar for Anirban Nandi

Anirban Nandi

Software Engineer, Google
Anirban is working as a software Engineer at Google and has been involved in the Kubernetes ecosystem for the past 3 years while primarily working on Kubernetes networking technologies such as Istio, Envoy, xDS, Gateway API, etc.
Friday June 19, 2026 2:30pm - 3:00pm IST
Lotus 2 (Level 3)
  Connectivity

2:30pm IST

When the Edge Can’t Afford a Third Node: A Storage Solution for Two-Node Kubernetes Cluster - Parth Arora, IBM
Friday June 19, 2026 2:30pm - 3:00pm IST
Edge Kubernetes clusters run under strict constraints—limited hardware, power, and space—where deploying three nodes for storage quorum is often unrealistic. Still, these environments must support stateful workloads and recover safely from failures.

This session focuses on Rook-Ceph as a storage solution for two-node Kubernetes clusters at the edge. It explains why Ceph traditionally expects three nodes, what breaks in minimal deployments, and how recent design patterns make two-node setups practical when combined with fencing.

Attendees will learn how Rook-Ceph behaves in two-node clusters, how floating Ceph MONs and DRBD-backed monitor state help preserve quorum, and why fencing is critical to avoid split-brain during node crashes, power loss, or network partitions.
Speakers
avatar for parth arora

parth arora

Software Engineer, IBM
I am Parth Arora, a software developer specializing in distributed storage systems.
As a Maintainer and developer, I have extensive experience working with Rook, a cloud-native storage orchestrator that enables storage systems like Ceph to be run as a service on Kubernetes. I have contributed to the development of Rook and have worked on projects involving Ceph... Read More →
Friday June 19, 2026 2:30pm - 3:00pm IST
Lotus 3 (Level 3)
  Data Processing + Storage

3:10pm IST

From Textbooks To YAML: Mapping Your CS Degree To the Cloud Native World - Parvathy Nair, Adi Shankara Institute of Engineering and Technology & Sudhish Nair, Red Hat
Friday June 19, 2026 3:10pm - 3:40pm IST
Students graduate with strong OS and Networking knowledge but often feel lost facing the "Wall of YAML" in Cloud Native. They understand "Processes" and "TCP/IP" but struggle to connect them to "Pods" and "Service Meshes."

This session features a University Professor and a Industry Expert strictly mapping standard Computer Science curriculum to Kubernetes primitives. We will translate three core domains live on stage:

Compute: Moving from OS-level fork()/exec() to Containers, and mapping Process Groups to Pods.

Networking: Translating the OSI Model and physical switching to CNIs, Services, and Ingress.

Storage: Mapping Block Devices and File Systems to Persistent Volumes (PVs), Claims (PVCs), and CSI.

Attendees will leave with a mental bridge connecting their academic foundation to modern cloud architecture, realizing they already possess the core knowledge required—they just need to learn the new vocabulary.
Speakers
avatar for Parvathy Nair

Parvathy Nair

Assistant Professor, Adi Shankara Institute of Engineering and Technology
Research Scholar and Assistant Professor with 8+ years of experience in Python, C/C++, and R. Specializes in Theory of Computation, Machine Learning, Cloud Computing and Deep Learning. Committed to leveraging academic trends to enhance education and mentor students toward higher-order... Read More →
avatar for Sudhish Nair

Sudhish Nair

Senior Software Maintenance Engineer, Red Hat India Pvt. Ltd.
Senior Software Maintenance Engineer at Red Hat with 12+ years of experience in cloud-native development and support. Specializing in OpenShift Pipelines (Tekton), GitOps (ArgoCD), REd Hat Developer Hub (Backstage), Serverless logic, Openshift, Kubernetes, Quarkus & Java. I'm also... Read More →
Friday June 19, 2026 3:10pm - 3:40pm IST
Lotus 1 (Level 3)
  Cloud Native Novice

4:10pm IST

So You Want To Run AI Agents on Kubernetes: A 101 Guide - Rajas Kakodkar, Broadcom
Friday June 19, 2026 4:10pm - 4:40pm IST
Your platform team is trying to integrate AI workloads, your developers want to deploy agents and your leadership expects 10x efficiency with AI but when you dig into the specifics, the terminology becomes a maze: AI Agents, MCP Servers, and Dynamic Resource Allocation (DRA). Where do you even start?

This session is centered around what problem each of these solve. In this hands-on 101 guide I will start by demystifying how AI agents operate on Kubernetes—how they interact with workloads, the role of MCP servers in enabling multi-agent coordination, and the fundamentals of DRA for intelligent resource management. Then, I will bring it all together with a live demo showing how AI agents can dynamically tune DRA drivers to optimize scheduling and resource usage in real time. Whether you’re an engineer, researcher, or just Kubernetes-curious, this session will equip you with the foundational knowledge and confidence to start experimenting with agentic and adaptive systems on Kubernetes.
Speakers
avatar for Rajas Kakodkar

Rajas Kakodkar

Software Engineer, Broadcom
Rajas is a staff software engineer at Broadcom, where he focuses on low level functions of Kubernetes nodes. He is a tech lead of the CNCF Technical Advisory Group, Workload Foundation and a Kubernetes contributor. He has been co-chairing Cloud Native AI Day, a co-located event at... Read More →
Friday June 19, 2026 4:10pm - 4:40pm IST
Jasmine 2 (Level 3)
  AI + ML

4:10pm IST

The Leapfrog Upgrade Playbook: Upgrading When You’re Years Behind - Yug Gupta, Walmart Global Tech
Friday June 19, 2026 4:10pm - 4:40pm IST
This is a story of how Walmart Leapfrogged 11 Kubernetes Releases Without Customer Downtime.

If that sounds impossible, wait untill you see the constraints: Millions of k8s resources, an etcd database nearing ~7GB, and a version gap so wide that “just upgrade” wasn’t a plan. We needed a method that was safe across API removals, predictable under extreme object counts, and repeatable enough to become a standard operating procedure.

Attendees will learn: (1) how to preflight deprecated APIs and CRDs using upstream guidance (2) how to design restore ordering and “selective restore” boundaries to avoid bringing back broken objects, (3) how to tune API server concurrency so backup/restore doesn’t become a self-inflicted outage, and (4) how to validate correctness and plan rollbacks.

This is a field guide for anyone who’s fallen far behind and needs a safe way forward without betting production on a one-shot upgrade.

No secret sauce, just the hard-earned physics of Kubernetes at scale.
Speakers
avatar for Yug Gupta

Yug Gupta

Staff Software Engineer, Walmart Global Tech
Staff Software Engineer @ Walmart Global Tech India
An open source enthusiast, contributing to the kubernetes ecosystem.
Friday June 19, 2026 4:10pm - 4:40pm IST
Lotus 3 (Level 3)
  Operations + Performance

4:50pm IST

LLMs Behind Bars: Sandboxes at Scale for AI on a Short Leash - Prashanth Pai, CodeRabbit
Friday June 19, 2026 4:50pm - 5:20pm IST
LLMs can write code - and sometimes running that code is the most direct way to deliver product value. The moment you do, you’ve effectively introduced a remote-code-execution surface: the code is untrusted by default, but the system still has to execute it to stay useful.

In this talk, we’ll share what it took to build and operate production sandboxes for LLM-generated code at scale. We’ll cover the isolation model (containers, least-privilege defaults, syscall/filesystem restrictions), the operational reality (startup latency, resource limits, cold starts, observability), and the guardrails that matters when code or users try to misbehave. We’ll also dig into data protection: locking down egress, blocking exfiltration paths, and keeping secrets out of reach.

We’ll cover what worked, what failed, and what we’d do differently - ending with a practical, vendor-agnostic mental model and checklist you can apply.
Speakers
avatar for Prashanth Pai

Prashanth Pai

Principal Engineer, CodeRabbit
Prashanth Pai is a Principal Engineer at CodeRabbit, where he builds the infrastructure that powers safe, reliable execution for AI products in production.

He started his career at Red Hat and has been passionate about open source ever since.
Friday June 19, 2026 4:50pm - 5:20pm IST
205 (Level 2)
  AI + ML
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Content Experience Level
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -