kube_vpa

Stable

Direct

Kubernetes VPA

This module provides a deployment of the vertical-pod-autoscaler project.

Usage

Metrics History

Set history_length_hours to the number of hours of historical metrics that you want to use for the initial VPA recommendations.

Metrics are weighted based on an exponential decay algorithm so more recent data will be weighted more heavily than older data. Metrics older than history_length_hours will no longer impact calculations.

If using Prometheus, 100 samples will be taken from this range in order to seed the internal VPA database. Do not set history_length_hours to greater than 1 week as this will cause significant strain on the Prometheus instance.

Using the Prometheus Backend

Alpha Quality: Do not use

The VPA can use Prometheus as the source of its recommendations when setting pod resources. To enable this in the Panfactum stack:

Ensure you have deployed kube_monitoring. Note that kube_monitoring must have been deployed for at least history_length_hours in order for the VPA to work properly.
Set prometheus_enabled to true.
Add the thanos_query_frontend_url input from the kube_monitoring output.
Apply the module.
If you had previously deployed the VPA without Prometheus, delete all Verticalpodautoscalercheckpoints: kubectl delete -A verticalpodautoscalercheckpoints.autoscaling.k8s.io --all.

Providers

The following providers are needed by this module:

helm (2.12.1)
kubectl (2.1.3)
kubernetes (2.34.0)
pf (0.0.7)

Required Inputs

No required inputs.

Optional Inputs

The following input variables are optional (have default values):

burstable_nodes_enabled

Description: Whether to allow pods to schedule on burstable nodes

Type: bool

Default: true

controller_nodes_enabled

Description: Whether to allow pods to schedule on EKS Node Group nodes (controller nodes)

Type: bool

Default: true

history_length_hours

Description: The number of prior hours of metrics data that will be used for VPA recommendations

Type: number

Default: 24

log_verbosity

Description: The log verbosity (0-9) for the VPA pods

Type: number

Default: 0

monitoring_enabled

Description: Whether to allow monitoring CRs to be deployed in the namespace

Type: bool

Default: true

panfactum_scheduler_enabled

Description: Whether to use the Panfactum pod scheduler with enhanced bin-packing

Type: bool

Default: false

prometheus_enabled

Description: Whether to enable prometheus as the storage backend for the VPA recommender

Type: bool

Default: false

pull_through_cache_enabled

Description: Whether to use the ECR pull through cache for the deployed images

Type: bool

Default: true

sla_target

Description: The Panfactum SLA level for the module deployment. 1 = lowest uptime (99.9%), lowest cost — 3 = highest uptime (99.999%), highest Cost

Type: number

Default: 3

spot_nodes_enabled

Description: Whether to allow pods to schedule on spot nodes

Type: bool

Default: true

thanos_query_frontend_url

Description: The address of the thanos query frontend to be used by the VPA recommender

Type: string

Default: null

vertical_autoscaler_helm_version

Description: The version of VPA helm chart to deploy

Type: string

Default: "4.7.1"

vertical_autoscaler_image_version

Description: The version of VPA image deploy

Type: string

Default: "1.2.1"

vpa_enabled

Description: Whether the VPA resources should be enabled

Type: bool

Default: false

wait

Description: Wait for resources to be in a ready state before proceeding. Disabling this flag will allow upgrades to proceed faster but will disable automatic rollbacks. As a result, manual intervention may be required for deployment failures.

Type: bool

Default: true

Outputs

No outputs.

Usage

No notes