The presentation discusses the importance of distributed tracing and the use of sampling strategies to manage the volume of data produced. It also highlights the challenges of implementing tail-based sampling using OpenTelemetry.
- Distributed tracing is important for understanding system connections and diagnosing problems.
- Traces are made up of spans, which represent logical units of work within a request.
- Sampling can be implemented at different stages of span processing to reduce the number of created or sampled spans.
- Tail-based sampling can be optimal for efficiently getting the desired data, but it can also have performance and scalability concerns.
- OpenTelemetry requires a collector to implement tail-based sampling, and all traces need to end up in the same collector for it to work properly.