Loading…
Open Source Summit + Embedded Linux Conference North America...
May 18-20, 2026
Minneapolis, MN
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Central DaylightTime (UTC -5). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.


Monday May 18, 2026 2:25pm - 3:05pm CDT
If a production incident hits on your first day, can you debug it? Or if you are a senior engineer, do you find it impossible to download your years of debugging intuition into a new hire’s head?
Kubernetes troubleshooting often depends on undocumented decision paths: where to look first, which signals to trust, and how to turn a sea of logs into a testable hypothesis.

In this talk, we introduce KUBE-RCA, an open-source incident assistant that plugs into your preferred external LLM and provides real-time cluster evidence (metrics, logs, events) as structured context. Using a ReAct loop, the agent proposes hypotheses, runs an allowlisted set of read-only commands/queries, ties each claim back to evidence, and publishes a concise RCA draft directly to your team channel.

We’ll share the design decisions behind our guardrailed execution loop and how we encode SRE intuition into prompts and checks. You’ll walk away with an understanding of how to make incident response more systematic. So engineers of any tenure can resolve issues faster, with less senior interruption.
Speakers
avatar for Bohyun Choi

Bohyun Choi

NVIDIA SW Engineer, UCLIX
Bohyun Choi builds Kubernetes platforms for GPU/AI workloads and NVIDIA orchestration. She architects and operates scalable GPU clusters on Kubernetes and focuses on production reliability and incident response.

She holds four CNCF Kubernetes certifications and is developing kube-rca, an open-source, guardrailed LLM-assisted tool that produces evidence-backed incident triage and RCA drafts from live cluster signals... Read More →
avatar for Woobin Hwang

Woobin Hwang

DevOps Engineer, NEOWIZ Partners
​DevOps for Web3 (Blockchain Validator Node Operator | DeFi Infra Operator)

​"Engineering mission-critical validator node in zero-trust environments. Designing and operating 24/7 high-availability infrastructure for global DeFi protocols within the Web3 ecosystem."
avatar for TaeJi Kim

TaeJi Kim

DevSecOps Engineer, Bungaejangter Inc.
DevSecOps Engineer at Bungaejangter Inc. and team lead of KUBE-RCA, an open-source Kubernetes incident assistant that pairs ReAct-based LLM agents with real-time cluster evidence for automated root cause analysis. Leads the project's architecture and guardrailed execution design to... Read More →
Monday May 18, 2026 2:25pm - 3:05pm CDT
200F (Level Two)
  Cloud + Orchestration

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link