SA
Srinivas Anant
@jsrinivas_anant
Observability in Kubernetes: The good, the bad, and the ugly
Submitted Mar 30, 2025
Topic of your submission:
Observability
Type of submission:
30 mins talk
I am submitting for:
Rootconf Annual Conference 2025
Overview
With every Kubernetes cluster, observability has become a daunting task. Understanding complex, distributed workload performance, health, and behavior is critical for ensuring reliability and efficiency. This talk will explore best practices for observability in Kubernetes environments, covering logs, traces, metrics, events, and uptime.
We will discuss how we have consolidated observability data from multiple sources and streamlined the process of alerting teams in real time using Clutch and Temporal. We will also showcase how we have leveraged NATS and TimescaleDB for Incident Response Management and On-Call Alerting.
Takeaways
This presentation addresses the community’s need for standardized observability solutions by showcasing the practical applications of OpenSearch, VictoriaMetrics, ArgoCD, Jaeger, and Opentelemetry in Kubernetes environments.
The talk aims to help the attendees gain clarity and knowledge on how to enhance their monitoring capabilities, thereby improving the reliability and performance of cloud-native applications, by sharing the implementation strategies and lessons learned.
Audience
This session will be beneficial for Platform, Infrastructure Engineers, and SREs who want to know how they can enhance their existing Kubernetes observability so that they can improve reliability and operational visibility.
About Me
I work as a Senior Member of Technical Staff @ Nutanix Technologies India Pvt Ltd. I have expertise in managing and maintaining distributed systems such as Kubernetes, VictoriaMetrics, and HashiStack, stitching together different tools to create a robust infrastructure. I love to tinker around with distributed systems. Outside of work, I am an avid reader and love anything related to sci-fi.
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}