Skip to main content

Name

nebius compute node set-unhealthy

SetUnhealthy marks the node underlying the Compute VM as unhealthy, which has the following effect:

  1. Scheduler makes the best effort not to assign new VMs to the unhealthy node,
    but in case of no capacity, the VM can be assigned to an unhealthy node.
  2. The existing VMs continue to work on the node, but after stop/start via
    Compute API they most probably will be assigned to a different node.

To use this rpc one needs to obtain compute.node.setUnhealthy permission
for the VM’s parent container. The permission is granted to the TSA inside the VM.

Synopsis

nebius compute node set-unhealthy
  --dry-run
  --health-check-info-check-id [required]
  --health-check-info-description [required]
  --health-check-info-observed-at [required]
  --instance-id [required]

Options

--dry-run (bool)

If set to true, the call performs the authorization and limit checks, but does not mark node unhealthy.
False by default.

--health-check-info-check-id (string) [required]

Identifies specific GPU check that failed in soperator (key for observability).

--health-check-info-description (string) [required]

Human-readable description of the error for further investigation.

--health-check-info-observed-at (timestamp: 1970-01-31T02:30:59Z) [required]

Time when the unhealthy node was observed.

--instance-id (string) [required]

Global Options

-h, --help (bool)

Show this message.

-p, --profile (string)

Set a profile for interacting with the cloud.

--format (string)

Output format. Supported values: yaml|json|jsonpath|table|text.

-f, --file (string)

Input file. For ‘update’ commands automatically set —full=true.

-c, --config (string)

Provide path to config file.

--debug (bool)

Enable debug logs.

--color (bool)

Enable colored output.

--no-browser (bool)

Do not open browser automatically on auth.

--insecure (bool)

Disable transport security.

--auth-timeout (duration: 2h30m10s)

Set the timeout for the request including authentication process, default is 15m0s.

--per-retry-timeout (duration: 2h30m10s)

Set the timeout for each retry attempt, default is 20s.

--retries (uint)

Set the number of retry attempts, 1 is disable retries, default is 3.

--timeout (duration: 2h30m10s)

Set the timeout for the main request, default is 1m0s.

--no-check-update (bool)

Suppress check for updates.