Skip to main content
To reserve GPUs for virtual machines (VMs), you can use capacity block groups. They reserve a certain number of GPUs in a given region for a GPU platform that you choose. The GPUs are assigned to an InfiniBand™ fabric. To reserve GPUs by using capacity block groups, send a request to your Nebius manager. They create capacity block groups for you and prepare an addendum for commitment discounts billing. If you are not in contact with a Nebius manager, you can ask technical support to connect you with one. Once capacity block groups are prepared, you can add them to your VMs or your node groups in Managed Service for Kubernetes®. After that, you can examine information about the capacity block groups on the Limits page in the web console.
GPUs allocated from capacity block groups do not count towards quotas on the number of GPUs.

Intervals in capacity block groups

Every capacity block group consists of intervals. Each interval represents a period for which a specific number of GPUs is reserved. Different intervals within the same capacity block group can overlap, go consequently or have gaps between. For example, there can be the following intervals:
  1. 60 GPUs reserved from January 1 until January 31.
  2. 72 GPUs reserved from January 26 until February 15.
  3. 150 GPUs reserved from February 20 until March 31.
Intervals within a single capacity block group can only be based on the same GPU platform and the same InfiniBand fabric.

List of capacity block groups

If you go to the Limits page and open the Capacity block groups tab, you see a list of your capacity block groups. It shows the following:
  • Region that the capacity block group belongs to.
  • InfiniBand fabric of reserved GPUs.
  • GPU platform.
  • Status:
    • Active: The capacity block group is available for usage; it does not expire soon.
    • Ending in N days: The capacity block group contains one active interval remaining, which soon expires. Consider using an active capacity block group instead.
    • Inactive: The capacity block group does not have any active intervals.
  • Next interval. You can find out when the next interval starts and to what number of GPUs the capacity block group switches. If the last interval is currently running, the page shows No renewal instead of the next interval.
  • Number of GPUs consumed out of the total number of reserved GPUs. The total number is calculated based on the active intervals within the capacity block group.

Dashboard of a capacity block group

To open the dashboard that contains detailed information about a given capacity block group:
  1. In the web console, go to https://mintcdn.com/nebius-ai-cloud/1Ha0sWR6e1mnIaHS/_assets/sidebar/administration.svg?fit=max&auto=format&n=1Ha0sWR6e1mnIaHS&q=85&s=e6411dc023fd6972922c0a12a59ccf21 Administration → Limits.
  2. Open the Capacity block groups tab.
  3. Click on the capacity block group.
On the Intervals tab of the dashboard, you can get the following information about intervals that the capacity block group includes:
  • ID of an interval.
  • Status of an interval:
    • Active: Interval is available for usage.
    • Pending: Interval will start in the future.
    • Expired: Interval has expired. Instead of it, a new active interval should apply.
  • Period when an interval is active.
  • Number of GPUs that an interval reserves.
This information is displayed as a graph and as a table. You can select the period for which the information should be displayed. On the Usage tab of the dashboard, you can determine what VMs consume the current capacity block group. These VMs are grouped in projects. If you click on the project line, the dashboard displays what VMs consume capacity and how many GPUs they consume. As a result, the dashboard provides a detailed breakdown by VMs and projects. Some consumption can be marked as Forbidden on the Usage tab. These are projects that you don’t have access to.

See also


InfiniBand and InfiniBand Trade Association are registered trademarks of the InfiniBand Trade Association.