back button All Events

Autoscaling Generative AI Workloads

Autoscaling Generative AI Workloads

Speaker: Jiri Kremser

Event:     KCD Czech Slovak 2024

Details

June 06, 2024

Generative AI workloads are known for their resource-intensive nature, making them challenging to manage in a Kubernetes environment. In this session, we will explore how Kubernetes Event-Driven Autoscaling (KEDA) can be used to autoscale generative AI workloads efficiently. Attendees will learn how to leverage KEDA to dynamically scale resources based on the workload’s demand, ensuring optimal performance and cost-efficiency.

Kedify home screenshot

Reduce cloud costs and complexity.

Start autoscaling with KEDA today.

Get Started Free