To reserve GPUs for virtual machines (VMs), you can use capacity block groups. They reserve a certain number of GPUs in a given region for a GPU platform that you choose. The GPUs are assigned to an InfiniBand™ fabric. To reserve GPUs by using capacity block groups, send a request to your Nebius manager. They create capacity block groups for you and prepare an addendum for commitment discounts billing. If you are not in contact with a Nebius manager, you can ask technical support to connect you with one. Once capacity block groups are prepared, you can add them to your VMs or your node groups in Managed Service for Kubernetes®. After that, you can examine information about the capacity block groups on the Limits page in the web console.Documentation Index
Fetch the complete documentation index at: https://docs.nebius.com/llms.txt
Use this file to discover all available pages before exploring further.
GPUs allocated from capacity block groups do not count towards quotas on the number of GPUs.
Intervals in capacity block groups
Every capacity block group consists of intervals. Each interval represents a period for which a specific number of GPUs is reserved. Different intervals within the same capacity block group can overlap, go consequently or have gaps between. For example, there can be the following intervals:- 60 GPUs reserved from January 1 until January 31.
- 72 GPUs reserved from January 26 until February 15.
- 150 GPUs reserved from February 20 until March 31.
List of capacity block groups
If you go to the Limits page and open the Capacity block groups tab, you see a list of your capacity block groups. It shows the following:- Region that the capacity block group belongs to.
- InfiniBand fabric of reserved GPUs.
- GPU platform.
-
Status:
- Active: The capacity block group is available for usage; it does not expire soon.
- Ending in N days: The capacity block group contains one active interval remaining, which soon expires. Consider using an active capacity block group instead.
- Inactive: The capacity block group does not have any active intervals.
- Next interval. You can find out when the next interval starts and to what number of GPUs the capacity block group switches. If the last interval is currently running, the page shows No renewal instead of the next interval.
- Number of GPUs consumed out of the total number of reserved GPUs. The total number is calculated based on the active intervals within the capacity block group.
Dashboard of a capacity block group
To open the dashboard that contains detailed information about a given capacity block group:- In the web console, go to
Administration → Limits.
- Open the Capacity block groups tab.
- Click on the capacity block group.
- ID of an interval.
-
Status of an interval:
- Active: Interval is available for usage.
- Pending: Interval will start in the future.
- Expired: Interval has expired. Instead of it, a new active interval should apply.
- Period when an interval is active.
- Number of GPUs that an interval reserves.
See also
- Capacity reservations for Compute virtual machines
- Capacity reservations for Managed Service for Kubernetes® node groups
InfiniBand and InfiniBand Trade Association are registered trademarks of the InfiniBand Trade Association.