S3 Ep 16: K8s Monitoring & Observability with Grafana's Vasil Kaftandzhiev

On the latest Livin’ on the Edge podcast (and my last episode for Season 3), I interviewed Vasil Kaftandzhiev, a fellow Product Manager at Grafana. We explored the importance of observability in IT systems, particularly in the context of cloud infrastructure and Kubernetes management. Observability is the constant monitoring of the system and of business KPIs with the goal of understanding why something is happening. It goes beyond the here-and-now of just monitoring (which itself is key to observability) and extends to the analysis and understanding of broader problems or issues, the underlying system and root causes. Vasil’s team at Grafana has been focused on building opinionated observability solutions that are based on technology best practices and observability best practices. He shared a few of those best observations with us below. We also dove into the actual difference between plain monitoring and true observability, the role of AI and ML within observability, and, of course, the importance of resource utilization and cost management in Kubernetes.

Om Podcasten

Software developers, platform engineers, and sysadmin/operators listen to the biweekly Ambassador podcasts in order to learn how to build cloud platforms and create an effective developer experience (DevEx) for deploying container-based applications to Kubernetes. We also discuss best practices for releasing functionality via continuous delivery pipelines, and we investigate the latest developer tooling, API gateway technology (e.g Envoy), and service mesh implementations. We interview practitioners and senior technical leaders from organizations such as HashiCorp, Lyft, TicketMaster, Headstart, and Buoyant.