Skip to main content
This guide walks through installing the Lilac GPU operator in your cluster. The operator is deployed via a Helm chart hosted on AWS ECR.

Prerequisites

  • Kubernetes 1.28+
  • kubectl configured with cluster admin access
  • helm v3 installed
  • NVIDIA GPU nodes with device plugin installed
  • A Lilac supplier API key (generated during onboarding)

Install with Helm

1

Create the namespace

kubectl create namespace lilac-system
2

Install the operator

helm install lilac-gpu-operator \
  oci://public.ecr.aws/lilac/lilac-gpu-operator \
  --version 0.2.0 \
  --namespace lilac-system \
  --set apiKey="YOUR_SUPPLIER_API_KEY" \
  --set clusterName="my-gpu-cluster"
Replace my-gpu-cluster with a name that identifies this cluster in your dashboard.
The Helm chart automatically creates the API key secret, control plane config, CRDs, RBAC, and service accounts. No manual setup required.
3

Verify the installation

kubectl get pods -n lilac-system
You should see the operator pod running:
NAME                                              READY   STATUS    RESTARTS   AGE
lilac-gpu-operator-lilac-gpu-operator-...         1/1     Running   0          30s

Verify Control Plane Connection

Check the operator logs to confirm it connected to the Lilac control plane:
kubectl logs -n lilac-system deploy/lilac-gpu-operator-lilac-gpu-operator
Look for a log line like:
INFO  control plane sync successful  cluster_id=abc123
Your cluster should also appear as Connected in the Lilac dashboard within 30 seconds.

Helm Values

ValueRequiredDefaultDescription
apiKeyYesSupplier API key from the Lilac dashboard
clusterNameYesHuman-readable name for your cluster
controlPlaneUrlNohttps://api.getlilac.comControl plane URL
disconnectTimeoutNo10mTime before cluster is marked disconnected
image.tagNoChart app versionOverride the operator image tag
resources.limitsNocpu: 500m, memory: 128MiResource limits for the operator pod
resources.requestsNocpu: 10m, memory: 64MiResource requests for the operator pod

Upgrading

helm upgrade lilac-gpu-operator \
  oci://public.ecr.aws/lilac/lilac-gpu-operator \
  --version 0.2.0 \
  --namespace lilac-system \
  --set apiKey="YOUR_SUPPLIER_API_KEY" \
  --set clusterName="my-gpu-cluster"

Uninstalling

helm uninstall lilac-gpu-operator --namespace lilac-system
Uninstalling the operator will drain all Lilac inference workloads from your cluster. Your own workloads are not affected.

Next Steps

Configure GPU Pools

Define which GPUs Lilac can use and when.

How the Operator Works

Understand the sync loop and architecture.