Pricing for Kubernetes GPU & Cluster Autoscaling

Scale your Kubernetes clusters and GPU workloads
at the right speed and the right cost

Kedify delivers intelligent GPU autoscaling and Kubernetes autoscaling
so your inference, training, and pipelines run efficiently without idle waste.

Starter

Who It’s For

Teams running a few services (APIs, inference endpoints, GPU workloads) in production.

Included
Clusters

Up to 3

Support

Business hours

Security

Hardened KEDA build

Marketplace purchase options

AWS Marketplace GCP Marketplace Red Hat Marketplace

Usage
Pricing

Platform package + Usage pricing

Best for

Getting AI APIs production‑ready with intelligent autoscaling, without over‑provisioning.

Price

$800/mo + usage
(billed annually)

Professional

Who It’s For

Platform teams running multiple GPU workloads and clusters, with higher traffic or stricter SLOs.

Included
Clusters

Up to 10

Scalers

+ HTTP scaler (great for inference endpoints and latency SLOs)

Support

Business hours

Security

+ FIPS

Marketplace purchase options

AWS Marketplace GCP Marketplace Red Hat Marketplace

Usage
Pricing

Platform package + Usage pricing

Best for

Consolidating AI APIs, agents, and pipelines across teams with predictable P95s.

Price

$2,000/mo + usage
(billed annually)

Enterprise

Who It’s For

Enterprise‑scale AI platforms with advanced security/compliance and custom scaler needs.

Included
Clusters

Unlimited

Scalers

+ Cluster scaling & AI/GPU workloads scalers

Support

24/7 on-call with escalation routing

Security

+ FIPS

Airgapped installation

Yes

Marketplace purchase options

AWS Marketplace GCP Marketplace Red Hat Marketplace

Usage
Pricing

Platform package + Usage pricing

Best for

Large‑scale inference, agent fleets, and mission‑critical AI services.

Price

Speak with Sales

Are you an engineer?
Try Kedify for free here.