The presentation discusses the potential benefits of converged computing, which combines cloud and high-performance computing (HPC) technologies, and the challenges in achieving fully featured HPC scheduling in Kubernetes.
- Converged computing combines cloud and HPC technologies to enhance application performance, scalability, flexibility, and automation.
- Fully featured HPC scheduling in Kubernetes has not yet been achieved, and there are challenges in co-scheduling, throughput, job communication and coordination, portability, and resource heterogeneity.
- The Flux framework is an open-source project that solves the five key technical problems of converged computing.
- Cloud computing is becoming a dominant market force, and HPC needs to integrate research and development in software and hardware to avoid becoming isolated.
- LLNL is seeing demand for cloud technologies within HPC workflows, and there is potential to unite the two communities in a converged computing environment.
The American Heart Association Molecular Screening Workflow, also known as AHA Moles, and the Rapid COVID-19 Small Molecule Drug Design Workflow are examples of composite workflows that use Kubernetes. The 2020 Laboratory Application Survey found that fewer than 10% of applications are currently using cloud, but 73% may adopt cloud in the future.