Loading…
Open Source Summit + Embedded Linux Conference North America...
May 18-20, 2026
Minneapolis, MN
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Central DaylightTime (UTC -5). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.


Venue: 200F (Level Two) clear filter
arrow_back View All Dates
Monday, May 18
 

11:20am CDT

QoD-Centric NaaS Strategy: Policy-Orchestrated Multi-Access Service - Daniel Kibler, EIS Visual & Niem Dang, NHD Consulting LLC
Monday May 18, 2026 11:20am - 12:00pm CDT
Delivering predictable, high‑quality network services in dense, multi‑access edge environments remains a central challenge for operators pursuing a Network‑as‑a‑Service (NaaS) strategy, where programmable APIs expose network capabilities as on‑demand services. Quality‑on‑Demand (QoD) APIs act as the intent interface in this model, enabling applications to request session‑level performance characteristics. QoD requires sophisticated new network control‑plane orchestration to address heterogeneous enforcement, multi‑access behavior, and session continuity issues across Wi‑Fi, private 5G, and public 5G domains.

QoD refers specifically to the CAMARA Project’s implementation, which provides standardized APIs for dynamic multi‑access orchestration, session‑level QoS enforcement, and integration with 3GPP control‑plane functions, including PCF, SMF, NEF, and NWDAF together with UE‑side ATSSS for traffic steering, switching, and splitting.

In this session, we will overview the strategic importance of QoD APIs, the global scale of the emerging NaaS domain, and detail Open Source technologies that are foundational to the industry.
Speakers
avatar for Daniel Kibler

Daniel Kibler

Senior Systems Engineer and Founder, EIS Visual
Principal-level engineer and founder of EIS Visual, Daniel designs and operates large-scale distributed platforms across communications, 5G, edge networks, and high-performance compute. He bridges architecture, execution, and operations to deliver measurable business impact. A former... Read More →
avatar for Niem Dang

Niem Dang

Founder & Principal Consultant, NHD Consulting LLC
Industry-recognized technology and thought leader with 20+ years of ground-breaking patents and accomplishments in the cable industry. Passion for delivering challenging projects through mastery of planning, strategy, technology enablement, and innovation.
Monday May 18, 2026 11:20am - 12:00pm CDT
200F (Level Two)
  Cloud + Orchestration

1:30pm CDT

OpenBao: Horizontally Scaling Secrets Management - Alexander Scheel, ControlPlane
Monday May 18, 2026 1:30pm - 2:10pm CDT
OpenBao is an OpenSSF project and a fork of HashiCorp Vault. It is an open-source secrets manager with support for static and dynamic secrets including identities and certificates.

The OpenBao community recently landed support for horizontal scalability, formerly a Vault Enterprise exclusive feature, in partnership between multiple organizations in the community.

This session will outline the design and development process of the feature, showcase how horizontal scalability improves performance on Kubernetes, and highlight future improvements the community is considering to offer write scalability.
Speakers
avatar for Alexander Scheel

Alexander Scheel

Head of OpenBao Development, ControlPlane
Alex Scheel is the Head of OpenBao Development at ControlPlane and a member of the OpenBao Technical Steering Committee. He has built a career on open source contributions, previously working at GitLab with OpenBao, at Keyfactor on Bouncy Castle, at HashiCorp on Vault, and at Canonical... Read More →
Monday May 18, 2026 1:30pm - 2:10pm CDT
200F (Level Two)
  Cloud + Orchestration

2:25pm CDT

Troubleshooting Like a Senior on Day 1: ReAct Agents With Real-Time Cluster Evidence - Bohyun Choi, UCLIX; Woobin Hwang, NEOWIZ Partners; TaeJi Kim, Bungaejangter Inc
Monday May 18, 2026 2:25pm - 3:05pm CDT
If a production incident hits on your first day, can you debug it? Or if you are a senior engineer, do you find it impossible to download your years of debugging intuition into a new hire’s head?
Kubernetes troubleshooting often depends on undocumented decision paths: where to look first, which signals to trust, and how to turn a sea of logs into a testable hypothesis.

In this talk, we introduce KUBE-RCA, an open-source incident assistant that plugs into your preferred external LLM and provides real-time cluster evidence (metrics, logs, events) as structured context. Using a ReAct loop, the agent proposes hypotheses, runs an allowlisted set of read-only commands/queries, ties each claim back to evidence, and publishes a concise RCA draft directly to your team channel.

We’ll share the design decisions behind our guardrailed execution loop and how we encode SRE intuition into prompts and checks. You’ll walk away with an understanding of how to make incident response more systematic. So engineers of any tenure can resolve issues faster, with less senior interruption.
Speakers
avatar for Bohyun Choi

Bohyun Choi

NVIDIA SW Engineer, UCLIX
Bohyun Choi builds Kubernetes platforms for GPU/AI workloads and NVIDIA orchestration. She architects and operates scalable GPU clusters on Kubernetes and focuses on production reliability and incident response.

She holds four CNCF Kubernetes certifications and is developing kube-rca, an open-source, guardrailed LLM-assisted tool that produces evidence-backed incident triage and RCA drafts from live cluster signals... Read More →
avatar for Woobin Hwang

Woobin Hwang

DevOps Engineer, NEOWIZ Partners
​DevOps for Web3 (Blockchain Validator Node Operator | DeFi Infra Operator)

​"Engineering mission-critical validator node in zero-trust environments. Designing and operating 24/7 high-availability infrastructure for global DeFi protocols within the Web3 ecosystem."
avatar for TaeJi Kim

TaeJi Kim

DevSecOps Engineer, Bungaejangter Inc.
DevSecOps Engineer at Bungaejangter Inc. and team lead of KUBE-RCA, an open-source Kubernetes incident assistant that pairs ReAct-based LLM agents with real-time cluster evidence for automated root cause analysis. Leads the project's architecture and guardrailed execution design to... Read More →
Monday May 18, 2026 2:25pm - 3:05pm CDT
200F (Level Two)
  Cloud + Orchestration

3:35pm CDT

BoF: DRA for AI Workloads: Where Does the Spec Need To Go Next? - Yahav Biran, Amazon
Monday May 18, 2026 3:35pm - 4:20pm CDT
Kubernetes Dynamic Resource Allocation is now being adopted by hardware vendors implementing drivers for accelerators and high-speed networking. But as real-world AI workloads hit the spec—disaggregated inference across multiple nodes, topology-aware co-location of compute and network devices, per-workload hardware configuration—gaps are emerging. This Birds of a Feather session invites platform engineers, infrastructure teams, and Kubernetes contributors to compare notes on where DRA works well and where the spec leaves teams filling in the blanks themselves. We will use the AWS Trainium Neuron DRA driver as a concrete reference point to ground the discussion, then open the floor: What attributes should be standardized across vendors? How should multi-device atomic allocation be handled? What does the community need to build so that DRA delivers on its promise across all accelerator hardware?
Speakers
avatar for Yahav Biran

Yahav Biran

Principal Architect, Amazon
Yahav Biran is a Principal Architect at AWS, focusing on large-scale AI workloads. He contributes to open-source projects and publishes in AWS blogs and academic journals, including the AWS compute and AI blogs and the Journal of Systems Engineering. He frequently delivers technical... Read More →
Monday May 18, 2026 3:35pm - 4:20pm CDT
200F (Level Two)
  Cloud + Orchestration

4:30pm CDT

Why Is It Always DNS?: Rethinking & Engineering Node-Level DNS Resolution in Kubernetes - Shaheen Sayyed & Ankur Singh, Red Hat
Monday May 18, 2026 4:30pm - 5:10pm CDT
Is it DNS or is it network? & why is answer always DNS? In K8s, most clusters quietly rely on /etc/resolv.conf on every node for all non-service name resolution. At scale, this dependency becomes a liability, reducing caching & observability while causing fragile forwarding, higher latency, upstream resolver overload, and host-level divergence in multi-cloud environments

This talk will explain how K8s actually consumes node DNS today & then present two solutions: CoreDNS for large, complex clusters; dnsmasq for medium scale that bring deterministic caching, controlled forwarding, traffic steering, and real visibility—turning DNS from black box into an engineered system component

Attendees will gain a framework to choose DNS implementation strategies by understanding trade-offs based on cluster scale, workload type and platform maturity. We’ll compare CoreDNS & dnsmasq, surface real failure modes, & show how different designs affect latency, reliability, & blast radius.
Speakers
avatar for Shaheen Sayyed

Shaheen Sayyed

Site Reliability Engineer, Red Hat
Coding Enthusiast with a keen interest in Building Scalable Cloud Applications
avatar for Ankur Singh

Ankur Singh

Senior Site Reliability Engineer, Red Hat
A diverse Software Engineer with experience as DevOps Engineer, Platform Engineer & Site Reliability Engineer.
Monday May 18, 2026 4:30pm - 5:10pm CDT
200F (Level Two)
  Cloud + Orchestration

5:25pm CDT

Monolithic To Cloud Native: Lessons From Migrating Heroku To EKS at Scale - Mateen Anjum, Phono Technologies Inc
Monday May 18, 2026 5:25pm - 6:05pm CDT
When our platform served a SaaS company growing from
9M$ to $100M ARR, we faced a decision every scaling team encounters: stay on Heroku or migrate to Kubernetes. We chose migration. This talk shares the real lessons from moving 47 microservices to EKS while maintaining 99.9% uptime.

I will cover the migration patterns that worked, the ones that failed spectacularly, and the operational changes that made the difference between success and rollback. You will learn how we reduced API latency from 700ms to 70ms, why our first three migration attempts failed, and what "production ready" actually means when your customers include enterprise clients with strict SLAs.

This is not a vendor pitch or theoretical framework. It is a practitioner's account of what happens when you bet your platform on Kubernetes and have to deliver.

Attendees will leave with a realistic migration checklist, common failure patterns to avoid, and honest metrics on what to expect during and after migration.
Speakers
avatar for Mateen Anjum

Mateen Anjum

Staff DevOps Engineer, Phono Technologies Inc
Mateen Ali Anjum is a Staff DevOps Engineer with 12 years of experience building infrastructure platforms. He has scaled systems, led platform migrations, and currently works in Canada. His work focuses on the intersection of reliability engineering and emerging AI tooling for infrastructure... Read More →
Monday May 18, 2026 5:25pm - 6:05pm CDT
200F (Level Two)
  Cloud + Orchestration
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Audience Experience Level
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -