Documentation Index
Fetch the complete documentation index at: https://docs.nebius.com/llms.txt
Use this file to discover all available pages before exploring further.
You can use Serverless AI jobs to work with AI models and perform such operations as fine-tuning, scientific simulations, data processing or batch inference.
To get started with jobs, use the example below. It demonstrates how to run nvidia-smi in a Serverless AI job. As a result, you receive information about the GPUs of the container over a virtual machine in which the job is running. In the web console, Quick start Serverless job runs the same kind of workload with settings already filled in.
Prerequisites
The preparations for this guide depend on your preferred interface.
-
Make sure you are in a group that has the
admin role within your tenant; for example, the default admins group. You can check this in the Administration → IAM section of the web console.
-
On the Administration → Limits → Quotas page of the web console, check the Number of virtual machines (VMs) quota, under Compute: it should have at least one VM available. If necessary, increase the quota.
-
Make sure you are in a group that has the
admin role within your tenant; for example, the default admins group. You can check this in the Administration → IAM section of the web console.
-
On the Administration → Limits → Quotas page of the web console, check the Number of virtual machines (VMs) quota, under Compute: it should have at least one VM available. If necessary, increase the quota.
-
Install and configure the Nebius AI Cloud CLI.
Check that your project ID is saved in the Nebius AI Cloud CLI profile configuration:
cat ~/.nebius/config.yaml
Steps
Run a Serverless AI job
- In the sidebar, go to
AI Services → Jobs.
- Click Quick start Serverless job. The flow uses a verified container image and prefilled job settings.
- Click Create job.
-
Get the ID of the default subnet where you create a job:
nebius vpc subnet get-by-name --name default-subnet \
--format jsonpath='{.metadata.id}'
-
Create a job:
nebius ai job create \
--name my-job \
--image nvidia/cuda:13.1.1-runtime-ubuntu24.04 \
--container-command bash \
--args "-c nvidia-smi" \
--platform gpu-l40s-a \
--preset 1gpu-8vcpu-32gb \
--timeout 1h \
--subnet-id <subnet_ID>
The command contains the following parameters:
--name: Job name.
--image: Container image to run. In the given example, the nvidia/cuda:13.1.1-runtime-ubuntu24.04 is used. It contains the Ubuntu 24.04 operating system as well as drivers and other components for GPUs.
--container-command: Terminal shell for commands to run.
--args: Arguments for docker run to pass to the entrypoint command. The specified arguments require the container to run nvidia-smi.
--platform: VM platform for the job. As Serverless AI jobs are based on containers over VMs, every job uses Compute platforms and presets.
--preset: Number of GPUs, vCPUs and RAM allocated to the container. The preset must match the selected platform.
--timeout: Duration after which the job is canceled if it has not completed.
--subnet-id: Subnet ID.
The job takes several minutes to complete. While the job is running, you can check its status and logs.
Check the job results
- In the sidebar, go to
AI Services → Jobs.
- Next to the job, click View logs. Alternatively, select the job that you want to view the logs for and switch to the Logs tab.
-
Save the job ID to an environment variable:
export JOB_ID=$(nebius ai job get-by-name --name my-job \
--format jsonpath='{.metadata.id}')
-
View information about the job:
nebius ai job get $JOB_ID
The command output contains such data as the job status, start and finish times.
-
If the output doesn’t indicate that the job has started, wait a few seconds. Next, run the previous command again and check that the job is running now.
-
When the job has started, check its logs:
nebius ai job logs $JOB_ID
The logs contain information about the GPUs that nvidia-smi provides (in other words, the output of the nvidia-smi command).
You can run nebius ai job get $JOB_ID and nebius ai job logs $JOB_ID multiple times to determine whether the job is complete, and the full output of nvidia-smi is returned in logs.
(Optional) Delete the job
When the job is complete, Serverless AI automatically releases all allocated computing resources and the container disk of the job. When you delete a complete job, you remove it from the list of Serverless AI jobs.
If you delete a running job, all the resources are released as well.
- In the sidebar, go to
AI Services → Jobs.
- Locate the job and then click
→ Delete.
- In the window that opens, confirm deleting the job.
To delete the job, run the following command:nebius ai job delete $JOB_ID