Name
nebius mk8s node-group create
Synopsis
Options
--auto-repair-conditions (json)
Conditions that determine whether a node should be auto repaired.
(structure)
disabled -> (bool)
When true, disables the default auto-repair condition rules.
Mutually exclusive with: timeout.
status -> (string)
Node condition status.
A value must be one of:
“condition_status_unspecified""true""false""unknown”
timeout -> (duration)[google.protobuf.Duration]
The duration after which the node is automatically repaired if the condition remains in the specified status.
Mutually exclusive with: disabled.
type -> (string)[required]
Node condition type.
JSON Schema:
--autoscaling-max-node-count (int64)
Mutually exclusive with: --fixed-node-count.
--autoscaling-min-node-count (int64)
Mutually exclusive with: --fixed-node-count.
--fixed-node-count (int64)
Number of nodes in the group. Can be changed manually at any time.
Mutually exclusive with: --autoscaling-max-node-count, --autoscaling-min-node-count.
--labels (string->string)
Labels associated with the resource.
--name (string)
Human readable name for the resource.
--parent-id (string) [required]
Identifier of the parent resource to which the resource belongs.
--resource-version (int64)
Version of the resource for safe concurrent modifications and consistent reads.
Positive and monotonically increases on each resource spec change (but not on each change of the
resource’s container(s) or status).
Service allows zero value or current.
--strategy-drain-timeout (duration: 2h30m10s)
Maximum amount of time that the service will spend on attempting gracefully draining a node (evicting it’s pods), before
falling back to pod deletion.
By default, node can be drained unlimited time.
Important consequence of that is if PodDisruptionBudget doesn’t allow to evict a pod,
then NodeGroup update with node re-creation will hung on that pod eviction.
Note, that it is different from kubectl drain --timeout.
--strategy-max-surge-count (int64)
Mutually exclusive with: --strategy-max-surge-percent.
--strategy-max-surge-percent (int64)
Mutually exclusive with: --strategy-max-surge-count.
--strategy-max-unavailable-count (int64)
Mutually exclusive with: --strategy-max-unavailable-percent.
--strategy-max-unavailable-percent (int64)
Mutually exclusive with: --strategy-max-unavailable-count.
--template-boot-disk-block-size-bytes (int64)
--template-boot-disk-size-bytes (int64)
Mutually exclusive with: --template-boot-disk-size-gibibytes, --template-boot-disk-size-kibibytes, --template-boot-disk-size-mebibytes.
--template-boot-disk-size-gibibytes (int64)
Mutually exclusive with: --template-boot-disk-size-bytes, --template-boot-disk-size-kibibytes, --template-boot-disk-size-mebibytes.
--template-boot-disk-size-kibibytes (int64)
Mutually exclusive with: --template-boot-disk-size-bytes, --template-boot-disk-size-gibibytes, --template-boot-disk-size-mebibytes.
--template-boot-disk-size-mebibytes (int64)
Mutually exclusive with: --template-boot-disk-size-bytes, --template-boot-disk-size-gibibytes, --template-boot-disk-size-kibibytes.
--template-boot-disk-type (string)
A value must be one of:
unspecifiednetwork_ssdnetwork_hddnetwork_ssd_io_m3network_ssd_non_replicated
--template-cloud-init-user-data (string)
Cloud-init user-data
Should contain at least one SSH key.
--template-filesystems (json)
Static attachments of Compute Filesystem.
Can be used as a workaround, until CSI for Compute Disk and Filesystem will be available.
(structure)
attach_mode -> (string)[required]
A value must be one of:
“unspecified""read_only""read_write”
existing_filesystem -> (structure)
id -> (string)[required]
mount_tag -> (string)[required]
Specifies the user-defined identifier, allowing to use it as a device in mount command.
JSON Schema:
--template-gpu-cluster-id (string)
--template-gpu-settings-drivers-preset (string)
Identifier of the predefined set of drivers included in the ComputeImage deployed on ComputeInstances that are part of the NodeGroup.
Supported presets for different platform / Kubernetes version combinations:
gpu-l40s-a,gpu-l40s-d,gpu-h100-sxm,gpu-h200-sxm:
version: 1.30 →"cuda12"(CUDA 12.4)version: 1.31 →"cuda12"(CUDA 12.4),"cuda12.4","cuda12.8"
gpu-b200-sxm:
version: 1.31 →"cuda12"(CUDA 12.8),"cuda12.8"
gpu-b200-sxm-a:
version: 1.31 →"cuda12.8".
--template-local-disks-config-none (bool)
None: “do nothing” - local disks will be provisioned as on a regular compute instance.
--template-local-disks-passthrough-group-requested (bool)
Passthrough local disks from the underlying host.
Devices are expected to appear in the guest as NVMe devices (nvme0, nvme1, …),
but the exact number depends on the preset.
Enabled only when this field is explicitly set.
--template-metadata-labels (string->string)
Kubernetes Node labels.
Keys and values must follow Kubernetes label syntax:
https://kubernetes.io/docs/concepts/overview/working-with-objects/labels/
For now change will not be propagated to existing nodes, so will be applied only to Kubernetes Nodes created after the field change.
That behavior may change later.
So, for now you will need to manually set them to existing nodes, if that is needed.
System labels containing “kubernetes.io” and “k8s.io” will be ignored.
Field change will NOT trigger NodeGroup roll out.
--template-network-interfaces (json)
(structure)
public_ip_address -> (structure)[meaningful_empty_value]
Parameters for Public IPv4 address associated with the interface.
Set to empty value, to enable it.
subnet_id -> (string)[non_empty_default]
Nebius VPC Subnet ID that will be attached to a node cloud instance network interface.
By default Cluster control plane subnet_id used.
Subnet should be located in the same network with control plane.
JSON Schema:
--template-os (string)
OS version that will be used to create the boot disk of Compute Instances in the NodeGroup.
Supported platform / Kubernetes version / OS / driver presets combinations
gpu-l40s-a,gpu-l40s-d,gpu-h100-sxm,gpu-h200-sxm,cpu-e1,cpu-e2,cpu-d3:
drivers_preset:""
version: 1.30 →"ubuntu22.04"version: 1.31 →"ubuntu22.04"(default),"ubuntu24.04"
gpu-l40s-a,gpu-l40s-d,gpu-h100-sxm,gpu-h200-sxm:
drivers_preset:"cuda12"(CUDA 12.4)
version: 1.30, 1.31 →"ubuntu22.04"
drivers_preset:"cuda12.4"
version: 1.31 →"ubuntu22.04"
drivers_preset:"cuda12.8"
version: 1.31 →"ubuntu24.04"
gpu-b200-sxm:
drivers_preset:""
version: 1.30, 1.31 →"ubuntu24.04"
drivers_preset:"cuda12"(CUDA 12.8)
version: 1.30, 1.31 →"ubuntu24.04"
drivers_preset:"cuda12.8"
version: 1.31 →"ubuntu24.04"
gpu-b200-sxm-a:
drivers_preset:""
version: 1.31 →"ubuntu24.04"
drivers_preset:"cuda12.8"
version: 1.31 →"ubuntu24.04".
--template-preemptible (bool)
Configures whether the nodes in the group are preemptible.
Set to empty value to enable preemptible nodes.
--template-reservation-policy-policy (string)
A value must be one of:
autoforbidstrict
--template-reservation-policy-reservation-ids (string array)
Capacity block groups, order matters.
--template-resources-platform (string) [required]
--template-resources-preset (string) [required]
--template-service-account-id (string)
The Nebius service account whose credentials will be available on the nodes of the group.
With these credentials, it is possible to make nebius CLI or public API requests from the nodes
without the need for extra authentication.
This service account is also used to make requests to container registry.resource.serviceaccount.issueAccessToken permission is required to use this field.
--template-taints (json)
Kubernetes Node taints.
For now change will not be propagated to existing nodes, so will be applied only to Kubernetes Nodes created after the field change.
That behaviour may change later.
So, for now you will need to manually set them to existing nodes, if that is needed.
Field change will NOT trigger NodeGroup roll out.
(structure)
effect -> (string)[required]
A value must be one of:
“effect_unspecified""no_execute""no_schedule""prefer_no_schedule”
key -> (string)[required]
value -> (string)[required]
JSON Schema:
--version (string)
Version is desired Kubernetes version of the cluster. For now only acceptable format is
<major>.<minor> like “1.31”. Option for patch version update will be added later.
By default the cluster control plane <major>.<minor> version will be used.
--async (bool)
If set, returns operation id. Otherwise, waits for the operation to complete and returns its resource.
-i, --interactive (bool)
If set, suggests to insert field values in interactive mode.
Global Options
-h, --help (bool)
Show this message.
-p, --profile (string)
Set a profile for interacting with the cloud.
--format (string)
Output format. Supported values: yaml|json|jsonpath|table|text.
-f, --file (string)
Input file. For ‘update’ commands automatically set —full=true.
-c, --config (string)
Provide path to config file.
--debug (bool)
Enable debug logs.
--color (bool)
Enable colored output.
--no-browser (bool)
Do not open browser automatically on auth.
--insecure (bool)
Disable transport security.
--auth-timeout (duration: 2h30m10s)
Set the timeout for the request including authentication process, default is 15m0s.
--per-retry-timeout (duration: 2h30m10s)
Set the timeout for each retry attempt, default is 20s.
--retries (uint)
Set the number of retry attempts, 1 is disable retries, default is 3.
--timeout (duration: 2h30m10s)
Set the timeout for the main request, default is 1m0s.
--no-check-update (bool)
Suppress check for updates.