logo
Dates

Author


Conferences

Tags

Sort by:  

Conference:  Black Hat Asia 2023
Authors: Simon Scannell, Valentina Palmiotti, Juan José López Jaimez
2023-05-11

Extended Berkeley Packet Filter (eBPF) is a technology that provides capabilities to programmers seeking to make use of kernel layer performance and functionality. Fundamentally, eBPF allows users to load programs into kernel space and attach them to hook points. This allows for loading kernel code at runtime without needing to modify the kernel source code itself or develop a kernel module. eBPF programs are written in a high-level language and then compiled into assembly-like bytecode. At load time, the bytecode is JIT-compiled into the native architecture which allows for the program to be kernel and architecture-independent. The instruction set is minimal but allows programmers to call outside kernel functions, read and store data in various data structures and perform pointer arithmetic and operations.Programs that run in the kernel must be carefully analyzed to ensure that these programs follow rules to guarantee the integrity and security of the kernel running the program. A single code flaw in any of the components involved in program parsing, analysis, optimization, and compilation may lead to a compromise of the kernel running an eBPF implementation.As eBPF becomes more prevalent, the goal of our talk is to share the history of eBPF vulnerabilities, bug classes, mitigations and provide an outlook for the future. We will also share our insights into automated vulnerability discovery. We will introduce listeners to advanced concepts of structured fuzzing such as designing and implementing an Intermediate Language. We will also discuss identifying roadblocks such as bug detection and give practical examples of how to overcome them. This will enable anyone to apply these concepts to their own fuzzing campaigns. The source code of our fuzzer will also be made available.
Authors: Sonja Chevre, Ahmet Soormally
2023-04-21

tldr - powered by Generative AI

OpenTelemetry can help monitor GraphQL queries in production and improve troubleshooting for developers and SREs.
  • GraphQL is a query language and server-side runtime that provides a monolithic facade on top of complex microservice architecture
  • Using GraphQL introduces new challenges when isolating failures and troubleshooting performance issues
  • OpenTelemetry can help monitor and improve troubleshooting for GraphQL queries in production
  • The RED method can be used to monitor the health and performance of distributed systems
  • Instrumenting GraphQL services with OpenTelemetry can provide distributed traces for monitoring
Authors: Dan Garfield, Brandon Phillips
2023-04-20

tldr - powered by Generative AI

The presentation discusses the scalability of Argo CD and the importance of separating concerns to minimize blast radius. The speaker also introduces the Sig Scalability project and encourages audience participation.
  • Argo CD is scalable but it's important to separate concerns to minimize blast radius
  • Sig Scalability is a project aimed at improving the performance of Argo CD and the speaker encourages audience participation
  • The speaker demonstrates the scalability of Argo CD by syncing all apps from the internet
  • The speaker also discusses the Argo CD certification program
Authors: Erwin de Keijzer
2023-04-20

Erwin's house has an elevator, you might think it's an apartment, but no, it's a family home with an elevator. Since moving in Erwin has wanted to upgrade the elevator experience. In this talk Erwin explains how he used open source projects like NATS, Grafana, Prometheus, AlertManager and protocol buffers to track elevator performance and add awesome elevator music and floor announcements to an otherwise mundane elevator ride. Erwin will show how he made the system resilient and performant and show some epic dashboards with insights into the elevator performance. This talk will not feature a live demo, Erwin thought about bringing the elevator with him, but it was not accepted by his family.
Authors: Brandon Smith, Howard Hao
2023-04-20

tldr - powered by Generative AI

The presentation discusses the challenges of bringing legacy applications into the modern cloud while reducing costs and the importance of effectively tuning and monitoring Windows containers for optimal performance.
  • Legacy applications need to be brought into the modern cloud to reduce costs and improve business value
  • Windows containers are more efficient than traditional Windows Server VMS
  • Effective tuning and monitoring of Windows containers is essential for optimal performance
  • Performance analysis should be easily accessible and updated guidance should be provided
  • Collaboration between businesses and Microsoft can help improve Windows performance
Authors: Neeraj Poddar
2023-04-20

tldr - powered by Generative AI

The main theme of the conference presentation is the roadmap for Istio in 2023, which focuses on making the service mesh more predictable, stable, and valuable for users. The presentation also discusses the transition from sidecar proxies to the ambient architecture and the future of Istio.
  • The themes for Istio in 2023 are accelerating the time to value for the service mesh, making the transition to mtls easier, reducing the total cost of ownership, and continuing to be predictable and stable.
  • The focus areas for Istio in 2023 include getting the ambient mesh to production, participating in the Gateway API and Gamma API initiatives, promoting alpha and beta features and APIs, and integrating natively with other Cloud native projects.
  • The presentation explains the shift from sidecar proxies to the ambient architecture and how it makes Istio more flexible and scalable.
  • The future of Istio may involve a combination of sidecar proxies and the ambient architecture, depending on the use case and requirements.
Authors: David Meder-Marouelli, Marco Schröder
2023-04-20

tldr - powered by Generative AI

The presentation discusses the reserves and optimization strategies for bare metal servers in a Kubernetes cluster to minimize energy consumption and costs.
  • There are three types of reserves: scale out reserves for future growth, geo-resident density reserves, and peak performance reserves.
  • Automation is key to quickly re-enabling hardware and minimizing energy consumption.
  • Immutable infrastructure and active-active data centers help with geo-redundancy reserves.
  • Transparency and buy-in from management is important for balancing potential cost savings and risks.
  • The presentation emphasizes the importance of measuring and quantifying energy consumption to make informed decisions about optimization strategies.
Authors: Derek Cavanaugh, Sara Moore
2023-04-19

tldr - powered by Generative AI

The presentation discusses the challenges of managing logs in a distributed system and how Loki, a log aggregation system, can help address these challenges.
  • Loki is a log aggregation system that can help manage logs in a distributed system
  • Managing logs in a distributed system can be challenging due to the large number of logs and the need to optimize chunk size
  • Query parallelization and horizontal scaling can help improve query performance and reduce costs
  • Monitoring and auditing cardinality is important to ensure system health
  • Tools like Prometheus and Tempo can also help address similar challenges in observability
Authors: Wojciech Tyczyński
2023-04-19

tldr - powered by Generative AI

Tips for dividing workloads among multiple clusters in Kubernetes
  • Networking is the most stressing for the control plane and where the biggest number of issues are seen
  • Understanding the size of churn forward or observed services is a significant factor in workload division
  • The current scalability limit of 5000 nodes is not a hard limit and there are no plans to push it further in open source
  • External factors like third-party controllers and ecosystem improvements need to be addressed
  • Using the watch protocol for getting large collections of data can help with memory consumption and system throughput
  • Graceful shutdowns can prevent the control plane from being blown out by hundreds of thousands of watches
  • Optimizations should be balanced with complexity versus return on investment trade-off
Authors: Aldo Culquicondor, Kante Yin
2023-04-19

tldr - powered by Generative AI

The talk discusses the latest enhancements in SIG Scheduling in Kubernetes and opportunities for better support for services and batch type workloads.
  • Improvements in scheduler performance for higher scheduling throughput
  • Better support for rolling updates in deployments while maintaining high availability
  • Introduction of the SchedulingGates knob for external integrators to control pod scheduling
  • Development of sponsored projects such as Kueue, scheduling plugins, and the descheduler
  • Discussion on priority and pod scheduling policies
  • Importance of paying attention to machine availability and idle pods