I have worked with large scale distributed systems for the last 10+ years, from online gaming to data intensive applications. In the last couple years I have been focusing on building a Kubernetes platform to accelerate the development teams in Meltwater. In my spare time, while not riding my Ducati on a race track I practice the fine art of tsundoku.
Zero downtime upgrades of Kubernetes
The Kubernetes project releases a new version every 3 month as well as several bug fix releases in between. You need and want to upgrade your clusters. How do you do that with zero-downtime and no impact on your production workloads? In this lightning talk I will show how my team has come up with a procedure to upgrade a cluster and monitor the upgrade itself. In particular to avoid impact due to nodes becoming "Not Ready".