DevConf.US 2025

Prithvi Raj

Prithvi Raj is working as a Community Manager & Developer Advocate at Mirantis and is leading the community efforts for the OSPO at Mirantis including the k0s project, k0smortron, and the other OSS projects Mirantis is contributing to. He previously led the LitmusChaos project community, and has helped scale a community of 3000+ folks from scratch.
He is also a CNCF Ambassador and has closely worked to co-organize KCD Bengaluru, Chaos Carnival and LitmusChaosCon
He was previously at Harness.


Job title

Senior Community Manager

Company or affiliation

Mirantis


Sessions

09-19
14:20
35min
The Juggernaut of AIOps Driven Platform Engineering at Scale: Acing Distributed Container Management
Prithvi Raj

Kubernetes has transformed container orchestration, but managing Kubernetes clusters and Internal Developer Platforms (IDPs) at scale remains a complex challenge for platform engineers. While ClusterAPI offers a declarative way to provision and operate clusters, it often brings complexity through verbose YAML and integration difficulties.

This talk introduces an AIOps-driven approach that leverages ClusterAPI, Sveltos, and templated automation to enable end-to-end lifecycle management—provisioning, upgrades, and teardown—across diverse environments.

Through a real-world, large-scale platform demo, we’ll explore intelligent multi-cluster management, with enhanced observability, automated scaling, and proactive anomaly detection. Attendees will gain practical insights into building adaptive, resilient, and developer-friendly platform experiences on Kubernetes.

DevOps and Automation
101 (Capacity 48)
09-20
14:00
35min
Cluster Provisioning in a New Avatar: An AI/ML-Driven Approach to Platform Lifecycle Management
Prithvi Raj

This talk explores how platform engineering teams can build a self-service AI/ML infrastructure on Kubernetes—enabling dynamic provisioning, policy enforcement, and full observability—while allowing data scientists to stay focused on model development.

While ClusterAPI simplifies cluster provisioning, managing AI/ML workloads demands a full-lifecycle approach. We demonstrate how to extend ClusterAPI with tools like Backstage, OpenFeature, k0s, Prometheus, Sveltos, and k0rdent to build a scalable, secure, and automated AI/ML platform.

Key takeaways include:

  • AI/ML cluster provisioning with ClusterAPI and k0s
  • Policy-driven multi-cluster automation
  • Controlled model rollouts with feature flags
  • Self-service ML environments via an Internal Developer Platform
  • Observability and performance monitoring at scale

Join me to discover how a Kubernetes-native approach can empower your AI/ML platform engineering journey.

Cloud, Hybrid Cloud, and Hyperscale Infrastructure
101 (Capacity 48)