logo
Dates

Author


Conferences

Tags

Sort by:  

Authors: Andrii Rosa
2022-06-22

tldr - powered by Generative AI

Trino is an out-of-the-box solution for ETL that provides necessary execution and resource management capabilities to handle queries of practically any size at a cluster with limited resources. The solution breaks down the streaming exchange limitation and introduces a distributed buffer in between tasks, allowing each task in a query to be executed completely independently of any other. This simple yet powerful improvement reduces the amount of memory that must be available in a cluster to successfully execute a query and allows fine-grained failure recovery. The solution has been well vetted and battle-tested under high concurrency, achieving up to 60% of cost savings.
  • Trino is an out-of-the-box solution for ETL that provides necessary execution and resource management capabilities to handle queries of practically any size at a cluster with limited resources
  • The solution breaks down the streaming exchange limitation and introduces a distributed buffer in between tasks, allowing each task in a query to be executed completely independently of any other
  • This simple yet powerful improvement reduces the amount of memory that must be available in a cluster to successfully execute a query and allows fine-grained failure recovery
  • The solution has been well vetted and battle-tested under high concurrency, achieving up to 60% of cost savings