Documentation Index
Fetch the complete documentation index at: https://docs.nebius.com/llms.txt
Use this file to discover all available pages before exploring further.
To work with MLflow in Nebius AI Cloud, you need to create a Managed Service for MLflow cluster.
Each medium cluster includes 8 vCPUs, 32 GiB RAM and a 200 GiB Network SSD disk.
To create a cluster:
Web console
CLI
Terraform
-
In the sidebar, go to
AI Services → MLflow.
-
Click the
Create cluster button.
-
On the page that opens, enter the cluster name and optional description.
-
Select the
mlflow-sa service account. It was created for Managed MLflow by default after you signed up for Nebius AI Cloud.
-
In the Access field, select configuration for tracking endpoints:
- Public and private: The cluster has both public and private tracking endpoints. You can access the public endpoint from the internet and the private endpoint from the network where the cluster is located (for example, via a virtual machine in this network).
- Private only: The cluster only has a private tracking endpoint.
URIs of public and private tracking endpoints start with public-tracking and private-tracking, respectively.
-
Select a network where the cluster should be located.
-
Select a
medium cluster size.
Each medium cluster includes 8 vCPUs, 32 GiB RAM and a 200 GiB Network SSD disk.
-
In the Bucket for artifacts block, select a bucket to store logs and metrics of machine learning experiments launched in the cluster.
Alternatively, select the Auto option to create the bucket automatically.
This bucket is not deleted automatically after you delete the cluster. You are still charged for the bucket until you delete it.
-
In the Admin credentials block, specify the username and password of the cluster administrator.
The password must be between 8 and 64 characters long and must contain at least the following:
- One uppercase letter
- One lowercase letter
- One digit
- One special character from -!@#$^&*_=+:;’”|/?,.~§±()[]<>`.
-
Click Create cluster.
-
Install and configure Nebius AI Cloud CLI.
-
Create a password for administrator access to the cluster and save it to an environment variable:
export PASSWORD=<password>
The password must be between 8 and 64 characters long and must contain at least the following:
- One uppercase letter
- One lowercase letter
- One digit
- One special character from -!@#$^&*_=+:;’”|/?,.~§±()[]<>`.
-
Get the ID of the
mlflow-sa service account. It was created by default after you signed up for Nebius AI Cloud. Save the ID to an environment variable:
export SA_ID=$(nebius iam service-account get-by-name \
--name mlflow-sa \
--format json | jq -r '.metadata.id')
-
Get the ID of the default network and save it to an environment variable:
export NETWORK_ID=$(nebius vpc network get-by-name \
--name default-network \
--format json | jq -r '.metadata.id')
-
Create a cluster:
nebius msp mlflow v1alpha1 cluster create \
--name <cluster_name> \
--description "<cluster_description>" \
--network-id $NETWORK_ID \
--size medium \
--public-access=<true|false> \
--admin-username <username> \
--admin-password $PASSWORD \
--service-account-id $SA_ID \
--storage-bucket-name <bucket_name>
The command contains the following parameters:
-
--name: The cluster name.
-
--description: The cluster description.
-
--network-id: The network ID.
-
--size: The size of resources allocated to the cluster. Only medium size is available.
Each medium cluster includes 8 vCPUs, 32 GiB RAM and a 200 GiB Network SSD disk.
-
--public-access: Configuration for tracking endpoints:
- If
true, the cluster has both public and private tracking endpoints. You can access the public endpoint from the internet and the private endpoint from the network where the cluster is located (for example, via a virtual machine in this network).
- If
false, the cluster only has a private tracking endpoint.
URIs of public and private tracking endpoints start with public-tracking and private-tracking, respectively.
-
--admin-username: The name of the cluster administrator, for example admin.
-
--admin-password: The administrator password that you set earlier.
-
--service-account-id: The ID of the mlflow-sa service account.
-
--storage-bucket-name: The name of the bucket to store logs and metrics from machine learning experiments launched in the cluster. If you do not specify this parameter, a bucket is created automatically.
This bucket is not deleted automatically after you delete the cluster. You are still charged for the bucket until you delete it.
-
Install and configure the Nebius AI Cloud provider for Terraform.
-
Create the following configuration file:
resource "nebius_msp_mlflow_v1alpha1_cluster" "<cluster_name>" {
name = "<cluster_name>"
description = "<cluster_description>"
network_id = "<network_ID>"
size = "medium"
parent_id = "<project_ID>"
public_access = <true|false>
admin_username = "<username>"
admin_password = "<password>"
service_account_id = "<service_account_ID>"
storage_bucket_name = "<bucket>"
}
The file contains the following parameters:
-
name: The cluster name.
-
description: The cluster description.
-
network_id: The network ID.
-
--size: The size of resources allocated to the cluster. Only medium size is available.
Each medium cluster includes 8 vCPUs, 32 GiB RAM and a 200 GiB Network SSD disk.
-
parent_id: Project ID.
-
public_access: Configuration for tracking endpoints:
- If
true, the cluster has both public and private tracking endpoints. You can access the public endpoint from the internet and the private endpoint from the network where the cluster is located (for example, via a virtual machine in this network).
- If
false, the cluster only has a private tracking endpoint.
URIs of public and private tracking endpoints start with public-tracking and private-tracking, respectively.
-
admin_username: The name of the cluster administrator, for example admin.
-
admin_password: The administrator password that you set earlier.
The password must be between 8 and 64 characters long and must contain at least the following:
- One uppercase letter
- One lowercase letter
- One digit
- One special character from -!@#$^&*_=+:;’”|/?,.~§±()[]<>`.
-
service_account_id: The ID of the mlflow-sa service account. It was created by default after you signed up for Nebius AI Cloud.
-
storage_bucket_name: The name of the bucket to store logs and metrics from machine learning experiments launched in the cluster. If you do not specify this parameter, a bucket is created automatically.
This bucket is not deleted automatically after you delete the cluster. You are still charged for the bucket until you delete it.
-
Check that the configuration is correct:
-
Apply the changes: