The presentation discusses an AI-powered optimization methodology for improving cost efficiency and performance of digital services provided by a company.
- The challenge faced by the customer was to optimize their application while keeping on releasing application updates to introduce new business functionalities and align to new regulations.
- The tuning practice in place was manual and took almost two months to tune one single macro service.
- The AI-powered optimization methodology works in five steps: applying new configuration suggested by AI, applying workload to target system, collecting KPIs, analyzing results, and producing new configuration to be tested in the next iteration.
- The methodology allows setting constraints and goals, such as minimizing application cost and ensuring service reliability.
- The presentation provides an anecdote of how the methodology was used to optimize a customer's authentication service on Kubernetes, resulting in a 49% improvement on cost efficiency compared to the baseline configuration.
The presentation shows how the AI-powered optimization methodology was used to optimize a customer's authentication service on Kubernetes. The baseline configuration caused spikes in response time and higher memory usage, leading to a lack of performance, operational efficiency, and business agility. By experimenting with different Kubernetes and Java configurations suggested by AI, the methodology identified a new configuration that increased both memory and CPU request limits, adjusted JVM options, and sustained full load by one pod. This resulted in a 49% improvement on cost efficiency compared to the baseline configuration.