logo
Dates

Author


Conferences

Tags

Sort by:  

Authors: Maciek Pytel
2023-04-20

tldr - powered by Generative AI

The presentation discusses the reliability of running Cluster Autoscaler in production and provides insights on monitoring and debugging tools.
  • Cluster Autoscaler's primary job is to ensure that all pods can schedule
  • Metrics such as pending pod metrics are useful for monitoring Cluster Autoscaler's performance
  • Cluster Autoscaler should be run on dedicated nodes or on the control plane VMs to prevent issues with scaling down
  • Testing configurations before using them in production is recommended
  • Ignoring certain flags can have significant side effects
  • Auto scaling can vary significantly at scale and should be tested
Authors: William Denniss
2022-05-18

tldr - powered by Generative AI

Building a fully managed Kubernetes platform should still prioritize the power and flexibility of Kubernetes, while simplifying the operation of the cluster.
  • A fully managed Kubernetes platform should still prioritize the power and flexibility of Kubernetes
  • Simplifying the operation of the cluster should be the goal, not simplicity at the Kubernetes layer
  • The platform should allow for bursting and support continued usage
  • Node visibility should be maintained while hiding certain bits
Authors: John Skarbek
2021-10-15

tldr - powered by Generative AI

GitLab.com migrated to Kubernetes using multiple clusters to save costs and improve network traffic control.
  • GitLab.com needed to move from virtual machines to Kubernetes as they grew past 10 million projects hosted
  • They used GKE to migrate stateless services and split regional GKE clusters into multiple zonal clusters for better network traffic control
  • Multiple clusters allowed for more efficient maintenance procedures, testing cluster configurations, and mitigating incidents
  • The solution may not work for everyone and network egress problems can occur in other workloads outside of Kubernetes