Loading…
18-19 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon India 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in India Standard Time (UTC+5:30)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Thursday June 18, 2026 4:30pm - 4:50pm IST
Your AI coding bill is going to surprise you. Not because the models stopped working - because they're working too well, and you're running everything through the most expensive ones by default.

There's a 12× per-token cost gap between frontier and OSS models. On real coding benchmarks, the gap in output quality is almost nothing. The difference is the harness - the layer most teams never touch.

This talk is about that layer: how automatic task routing cuts spend without touching quality, why the harness matters more than the model you pick, and what the infrastructure underneath actually looks like (vLLM, prefix caching, Kubernetes-native GPU orchestration).

Based on Kimchi, the open-source coding agent we built after our own Anthropic bill went vertical. Practical, benchmarked, runs today.

**In order to facilitate networking and business relationships at the event, you may choose to visit a third party’s booth or access sponsored content. You are never required to visit third party booths or to access sponsored content. When visiting a booth or participating in sponsored activities, the third party will receive some of your registration data. This data includes your first name, last name, title, company, address, email, standard demographics questions (i.e. job function, industry), and details about the sponsored content or resources you interacted with. If you choose to interact with a booth or access sponsored content, you are explicitly consenting to receipt and use of such data by the third-party recipients, which will be subject to their own privacy policies.**
Speakers
avatar for Matas Kaminskas

Matas Kaminskas

Senior Engineering Manager, Cast AI
Matas is a Senior Engineering Manager at CAST AI, leading Kimchi - an open-source AI coding agent and inference platform built on open-source models. He has spent 10+ years building distributed systems and helping companies cut Kubernetes infrastructure costs at scale.
Thursday June 18, 2026 4:30pm - 4:50pm IST
205 (Level 2)

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link