logo
Dates

Author


Conferences

Tags

Sort by:  

Authors: Vladimir Kovacik, Greg Smith
2023-04-20

A brief story of how we came to use Vitess/Kubernetes to power some of the biggest entertainment franchises on the planet A few years ago we started thinking about: “What would it look like to run a database on Kubernetes?” We had just migrated most of our workloads from VMs to Linux system containers. This unlocked a lot of performance potential, while being a mostly drop-in replacement. As our fleet grew and the on-call burden started to rear its head, we did some requirements gathering for running these databases using our new Kubernetes-based platform. We ended up testing a parallel track using several open source technologies. Months into the testing there was a very clear winner which met our requirements: Vitess. We spent the last few months of the year building a proof of concept for one of our smaller services, and launched it with that year’s major titles. The success of this spurred an increased interest in Vitess across Demonware/Activision leading to many larger services adopting it for the following year. This talk will mainly be about the transitional phases of moving from our classic database stack to Vitess. We will give a high level overview of the experience, what we learned, and some interesting points worth sharing to the wider community.
Authors: Sunil Shah, Ramya Krishnan, Ashley Cutalo, Madhu C.S., Fabio Kung
2022-10-27

Kubernetes clusters are critical infrastructure at large, public companies, with large amounts of traffic, complex dependencies on 3rd party services, and constant change as developers release features and traffic scales up and down. In this panel discussion, engineers from Airbnb, Lyft, Netflix and Robinhood share their challenges, experiences and learnings when it comes to managing a sustainable on-call rotation that meets the needs of their internal users whilst maintaining a high uptime to serve business critical workloads. Topics covered will include: +Keeping on-call engineers happy + Balancing rapid response with alert fatigue + Strategies to proactively deal with production issues + Preparing engineers for on-call
Authors: Seok-yong Hong, Wanhae Lee
2022-10-27

tldr - powered by Generative AI

The presentation discusses the development of a detection tool for Kubernetes clusters to identify known issues and automate problem-solving.
  • The tool, called Detect, examines multiple factors to identify problems with Kubernetes clusters.
  • It uses a variety of sources, including Kubernetes, Prometheus, and SSH, to collect data and generate reports.
  • Detect is extensible, allowing users to add or remove rules as needed.
  • The presentation includes a demonstration of how to create a new detector rule for identifying clusters with more than 10,000 parts.
  • The tool is designed to help users manage upgrades and avoid common issues with Kubernetes clusters.