Trino is an out-of-the-box solution for ETL that provides necessary execution and resource management capabilities to handle queries of practically any size at a cluster with limited resources. The solution breaks down the streaming exchange limitation and introduces a distributed buffer in between tasks, allowing each task in a query to be executed completely independently of any other. This simple yet powerful improvement reduces the amount of memory that must be available in a cluster to successfully execute a query and allows fine-grained failure recovery. The solution has been well vetted and battle-tested under high concurrency, achieving up to 60% of cost savings.
- Trino is an out-of-the-box solution for ETL that provides necessary execution and resource management capabilities to handle queries of practically any size at a cluster with limited resources
- The solution breaks down the streaming exchange limitation and introduces a distributed buffer in between tasks, allowing each task in a query to be executed completely independently of any other
- This simple yet powerful improvement reduces the amount of memory that must be available in a cluster to successfully execute a query and allows fine-grained failure recovery
- The solution has been well vetted and battle-tested under high concurrency, achieving up to 60% of cost savings