Moving workload from the existing node group

The workload is moved automatically after you evict it from the existing node group. Before moving your workload, you need to:

Install jq if you don’t have it on your system:
sudo apt-get install jq
Get your cluster ID which is returned in the .metadata.id field of the cluster resource.

The CLI commands in this guide assume that the cluster ID is saved to an environment variable K8S_CLUSTER_ID. To move your workload:

Create a node group.

For example, the following command creates a group of two nodes, each with one NVIDIA H100 GPU, and all drivers and components required for the GPU:

nebius mk8s node-group create \
  --parent-id $K8S_CLUSTER_ID \
  --name mk8s-node-group-test \
  --fixed-node-count 2 \
  --template-resources-platform gpu-h100-sxm \
  --template-resources-preset 1gpu-16vcpu-200gb \
  --template-gpu-settings-drivers-preset cuda12

Export to an environment variable the name of the node group from which you want to move the workload. For example, you can get it by the node group’s name (if you have set and know it) and the parent cluster’s ID:
```
export K8S_NODE_GROUP_ID=$(nebius mk8s node-group get-by-name \
  --parent-id $K8S_CLUSTER_ID \
  --name node-group-name \
  --format jsonpath='{.metadata.id}')
```

Get the list of node names in the node group from which you want to move the workload:

export OLD_NODES=$(kubectl get nodes -o json \
  | jq '.items[].metadata
    | select(.annotations."cluster.x-k8s.io/owner-name" = "$K8S_NODE_GROUP_ID")
    | .name')

Cordon off the old nodes so that no new Pods are scheduled on them:

for node in $OLD_NODES; do
  kubectl cordon $node;
done

Drain the old nodes so that the existing Pods can be evicted from them:
```
for node in $OLD_NODES; do
  kubectl drain --force --ignore-daemonsets --delete-emptydir-data $node;
done
```
Kubernetes will automatically move evicted Pods to suitable nodes.

Delete the old node group:

nebius mk8s node-group delete --id $K8S_NODE_GROUP_ID