logo
Dates

Author


Conferences

Tags

Sort by:  

Authors: Kevin Hoffman
2023-04-21

tldr - powered by Generative AI

WasmCloud is a distributed computing framework that allows for building microservices, functions, and fully distributed systems using simple WebAssembly modules. It provides capabilities such as a web server, key-value store, and event sourcing. The framework takes care of dispatching messages and communicating with state stores, allowing developers to focus on writing simple, testable functions.
  • WasmCloud allows for building microservices, functions, and fully distributed systems using simple WebAssembly modules
  • Capabilities such as a web server, key-value store, and event sourcing are provided
  • The framework takes care of dispatching messages and communicating with state stores
  • Developers can focus on writing simple, testable functions
Authors: Marek Siarkowicz
2023-04-19

tldr - powered by Generative AI

The presentation discusses model-based testing for verifying the correctness of distributed systems, using HCD as an example. The model is simplified but can get complicated, and the testing is fragile if there are bugs or optimizations. The presentation also mentions the possibility of generalizing the model-based testing beyond HCD.
  • Model-based testing is great for testing generic approaches to correctness and separates validation from execution
  • The model can be simplified but can get complicated, and the testing is fragile if there are bugs or optimizations
  • The state increases exponentially, making the test fragile
  • The model can be generalized beyond HCD
  • The testing can validate the operations or the model and generate a report
  • The presentation includes an anecdote about using fail points to test HCD and finding a durability issue
Authors: Rebecca Bilbro, Patrick Deziel
2022-10-26

tldr - powered by Generative AI

The presentation discusses the challenges and solutions in building and maintaining a distributed system for a global directory service that securely exchanges private information for auditing purposes.
  • Storing public certificates in a distributed system to avoid latency fees
  • Distributed systems allow for tolerance of failures and increased availability
  • Testing for concurrency bugs is crucial in building a distributed system
  • Reframing distributed systems as a flow of events across space and time can improve user experience
  • The use of Kafka in Tinder's app demonstrates the importance of strict ordering in distributed systems
Authors: Colin Douch
2022-05-19

tldr - powered by Generative AI

The presentation discusses the challenges of using Prometheus as the standard for collecting and storing time series in serverless architectures and the need for a better solution.
  • Prometheus assumes that the system lives long enough to be discovered and scraped, that the service is network-enabled, and that the user can do their own aggregation.
  • Prometheus uses a pull-based model for metrics collection, which requires the service to live for at least 5-15 seconds.
  • Exposing things over the network requires the ability to listen on a port, spin up a server, and secure communication with firewall rules and TLS certificates.
  • Prometheus assumes that the user can do their own aggregation, which can be problematic for metrics like request counts.
  • There is a need for a better solution that can handle the challenges of serverless architectures and provide more accurate metrics.
Authors: Frederick Kautz, Andres Vega
2022-05-18

SPIFFE aims to strengthen the identification of software components in a common way that can be leveraged across distributed systems by anyone, anywhere. The ability to maintain software security by standardizing how systems define, attest, and maintain software identity, regardless of where systems are deployed or who deploys those systems, confers many benefits. The use of SPIFFE can significantly reduce costs associated with the overhead of managing and issuing cryptographic identity documents and accelerate development by removing the need for developers to understand the complexity involved to secure service-to-service communication, but that is not the only outcome. Production identity can have a positive impact on many areas such as interoperability, compliance, audibility, and more. This presentation demonstrates the real world scenarios and outcomes of deploying SPIFFE across your infrastructure and also using it to bridge and integrate the infrastructure of others.Click here to view captioning/translation in the MeetingPlay platform!
Authors: Paul Burt, Betty Junod
2021-10-15

tldr - powered by Generative AI

The presentation discusses the challenges of distributed systems and how Kubernetes addresses them through its design choices. It also compares Kubernetes to other modern systems and explores real-world cases of failures.
  • Distributed systems are challenging because failure is inevitable and requires designing systems to handle it gracefully.
  • Kubernetes is designed to handle failure through fault tolerance and traffic routing.
  • Other modern systems, such as Docker Swarm, HashiCorp Nomad, and K3s, have different approaches to handling failure.
  • DistSys concepts such as CAP theorem, Gossip protocols, High Availability, and the RAFT consensus algorithm are discussed.
  • Real-world cases, such as Target's 2019 cascading failure, are explored to illustrate the challenges of distributed systems.
  • Understanding the problems confronting distributed systems and what 'correct' looks like is essential for designing and operating them effectively.