The presentation discusses the use of Kubernetes (K8s) in research computing, particularly in machine learning operations (mlOps) workflows. The speaker highlights the need for a K8s platform to handle the environmental configuration and workflow integration required by mlOps. The presentation also touches on the challenges of managing different CUDA versions and the need for generous resource provisioning to handle large models in containers.
- Kubernetes is being used in research computing, particularly in mlOps workflows
- A K8s platform is needed to handle the environmental configuration and workflow integration required by mlOps
- Managing different CUDA versions can be challenging
- Generous resource provisioning is needed to handle large models in containers
The speaker mentions that their university's researchers represent about 30% of the research revenue at the University of Alabama at Birmingham. They also note that their mascot is a dragon, not an elephant, which is a reference to the state's popular football team. The speaker also discusses the use of high-performance computing clusters and the need to keep data close to the CPU with the highest speed.