All
Articles
Conferences
Presentations
Dates
Clear
Within 1 day
Within 1 week
Within 1 month
Within 1 year
Within 3 years
Author
Has Video
1
Conferences
Apply
KubeCon + CloudNativeCon North America 2021
1
Tags
Apply
Centralized
1
Kubeflow
1
Product
1
Spotify
1
machine learning
1
Sort by:
Most recent
Scaling Kubeflow for Multi-tenancy at Spotify
Conference:
KubeCon + CloudNativeCon North America 2021
Authors:
Keshi Dai
,
Jonathan Jin
2021-10-15
tldr - powered by Generative AI
Improving observability and reliability in a multi-cluster environment through infrastructure as code and custom metrics
Investing in observability and reliability preemptively before experiencing issues
Using infrastructure as code, specifically Terraform and Argo CD, to manage multi-cluster deployments and ensure consistency
Creating custom metrics, such as Kubeflow state metrics, to track specific product needs and enable effective SLOs and alerts
Tags:
Kubeflow
Spotify
machine learning
Centralized
Product
Show 0 Comments
1