logo
Dates

Author


Conferences

Tags

Sort by:  

Authors: Andrew Newdigate
2022-10-28

tldr - powered by Generative AI

The importance of retaining long-term metric data and using Python data analytics ecosystem with Prometheus data for capacity planning and other purposes
  • Tamland is a tool used for capacity planning that relies on long-term metric data retention and Python data analytics ecosystem with Prometheus data
  • Retaining long-term metric data is important for answering future questions and can be done with tools like Thanos, Cortex, Mamir, and Timescale DB
  • Python libraries like Prometheus pandas, Prophet, Neural Prophet, and Great Kite can be used for analyzing data and forecasting
  • Timeline, an open-source project available on GitLab, can be used for capacity planning and other purposes like Cloud cost forecasting, security and abuse monitoring, and network monitoring
Authors: John-Paul Robinson, Camille Rodriguez
2022-10-27

tldr - powered by Generative AI

The presentation discusses the use of Kubernetes (K8s) in research computing, particularly in machine learning operations (mlOps) workflows. The speaker highlights the need for a K8s platform to handle the environmental configuration and workflow integration required by mlOps. The presentation also touches on the challenges of managing different CUDA versions and the need for generous resource provisioning to handle large models in containers.
  • Kubernetes is being used in research computing, particularly in mlOps workflows
  • A K8s platform is needed to handle the environmental configuration and workflow integration required by mlOps
  • Managing different CUDA versions can be challenging
  • Generous resource provisioning is needed to handle large models in containers
Authors: Dov Hershkovitch
2022-10-25

DevSecOps extends the DevOps ecosystem with the security aspect. Sensitive information is everywhere, be it passwords, secret tokens or exchanged IDs in order to gain access to tools and platforms. The problem has been addressed by many secret management solutions and frameworks, yet creating another problem: Which to choose from, and how to integrate best into your DevOps processes? Engineers started to workaround the security protocols, and often sensitive information is stored in insecure ways. A plaintext token can lead to security leaks and business incidents in a worst case scenario. JSON Web Token (JWT) aims to build the integration bridge as an open standard for security claims exchange. Join this session to learn how in GitLab we leverage JWT tokens to access different secret management solutions, including major cloud providers. Hear best practices on the challenges to retrieve sensitive data and how to enhance the DevSecOps security processes in your organization.
Authors: John Skarbek
2021-10-15

tldr - powered by Generative AI

GitLab.com migrated to Kubernetes using multiple clusters to save costs and improve network traffic control.
  • GitLab.com needed to move from virtual machines to Kubernetes as they grew past 10 million projects hosted
  • They used GKE to migrate stateless services and split regional GKE clusters into multiple zonal clusters for better network traffic control
  • Multiple clusters allowed for more efficient maintenance procedures, testing cluster configurations, and mitigating incidents
  • The solution may not work for everyone and network egress problems can occur in other workloads outside of Kubernetes