Nebius AI Cloud’s capacity advisor provides insights into GPU capacity availability for launching virtual machines (VMs) with specific hardware presets. It helps you understand where you can launch VMs based on your quotas and the current physical capacity in Nebius AI Cloud regions.Documentation Index
Fetch the complete documentation index at: https://docs.nebius.com/llms.txt
Use this file to discover all available pages before exploring further.
Scope
The capacity advisor provides data for the following virtual machines with GPUs:-
By service:
- VMs that you create directly in Compute
- Managed Soperator nodes
- Managed Kubernetes® nodes
- VMs launched by Serverless AI for running jobs and endpoints
- By type:
Prerequisites
- Web console
- CLI
How to get data from the capacity advisor
- Web console
- CLI
In the sidebar, go to
Capacity dashboard under Manage.For each VM platform, preset, region, InfiniBand™ fabric and VM type (regular, preemptible or with reservations), the capacity advisor shows:
- How many VMs you can launch, based on your quotas and the current physical capacity (for regular and preemptible VMs) or on your active capacity reservations (for VMs with reservations).
-
How likely you are to be able to launch the displayed number of regular or preemptible VMs.
For example, if your quota for a certain kind of VMs is 10 VMs, you have a high chance of launching them all when the overall capacity is in the hundreds, but a low chance when the capacity is less than 10 VMs.
As reservations guarantee you a certain number of GPUs that aren’t subject to quotas, the chance of launch isn’t displayed for VMs with reservations.