Kubernetes monitoring: why it is difficult and how to improve it
04-17, 16:10–16:40 (Europe/Amsterdam), The Warehouse

Kubernetes has changed everything and has become a standard for software development and orchestration. But by making life for software engineers easier, Kubernetes had a severe impact on observability and monitoring systems. The talk will explain the consequences of Kubernetes popularity for various monitoring systems, the price of flexibility it provides, and how any software becomes more and more complex with time.


The presentation will walk the audience through the concepts of time series churn, ephemerality and high cardinality. It will elaborate on the effect Kubernetes has on monitoring systems and how users can reduce resource usage by optimizing configuration. As an example, I'll show how the amount of exposed metrics has doubled or tripled for a default k8s cluster or even for Prometheus over the last couple of years.
But the main aim of the presentation is to call on the problem of producing huge volumes of metrics that are never used by the end users. We believe, this problem should be discussed more and eventually the community will come up with better standardization for metrics design.

Roman is a software engineer with experience in distributed systems, databases, monitoring, and high-performance microservices. Roman's passion is open source and he's proud to have contributions to Prometheus, Grafana, and ClickHouse. Currently, Roman is working on the open source time series database and monitoring solution VictoriaMetrics.