Pricing for Kubernetes & GPU Autoscaling

Scale your Kubernetes clusters and GPU workloads
at the right speed and the right cost

Kedify delivers intelligent GPU autoscaling and Kubernetes autoscaling
so your inference, training, and pipelines run efficiently without idle waste.

Who It’s For

Included Clusters

Scalers

Support

Security

Airgapped installation

Marketplace purchase options

Usage pricing

Best for

Price

Starter

Who It’s For

Teams running a few services (APIs, inference endpoints, GPU workloads) in production.

Included
Clusters

Up to 3

Scalers

70 pre-built scalers

Support

Business hours

Security

SOC 2 Type II + Hardened KEDA

N/A

Market-place purchase options

Usage
Pricing

Platform package + Usage pricing

Best for

Getting AI APIs production‑ready with intelligent autoscaling, without over‑provisioning.

Price

$800/mo + usage
(billed annually)

Professional

Who It’s For

Platform teams running multiple GPU workloads and clusters, with higher traffic or stricter SLOs.

Included
Clusters

Up to 10

Scalers

+ HTTP scaler (great for inference endpoints and latency SLOs)

Support

Business hours

Security

+ FIPS

N/A

Market-place purchase options

Usage
Pricing

Platform package + Usage pricing

Best for

Consolidating AI APIs, agents, and pipelines across teams with predictable P95s.

Price

$2,000/mo + usage
(billed annually)

Enterprise

Who It’s For

Enterprise‑scale AI platforms with advanced security/compliance and custom scaler needs.

Included
Clusters

Unlimited

Scalers

+ Cluster scaling & AI/GPU
workloads scalers

Support

24/7 on-call with escalation routing

Security

+ FIPS

Airgapped installation

Yes

Market-place purchase options

Usage
Pricing

Platform package + Usage pricing

Best for

Large‑scale inference, agent fleets, and mission‑critical AI services.

Price

Speak with Sales

Are you an engineer?
Try Kedify for free here.