Costs
Nebius AI Cloud only charges for running the virtual machines that make up your GPU cluster. For more details, see the Compute pricing.Prerequisites
- Create a GPU cluster if you do not already have one.
- Create virtual machines and add them to the cluster.
Steps
Install Open MPI on each VM in the cluster
For each VM in the GPU cluster:- Get the VM’s private IP address.
- Connect to the VM through SSH.
-
Install the Open MPI library on the VM:
Build the tests on one of the VMs
Choose one of the VMs as the main VM – you will run the tests from it. Build the tests on the main VM:-
Clone the NVIDIA repository with the tests:
-
Build the tests with Open MPI:
-
Copy the built binary file,
all_reduce_perf, to the same directory on other VMs.
Set up SSH connectivity between the VMs in the cluster
-
On the main VM, generate an SSH key pair without a passphrase:
-
Copy the generated pair,
~/.ssh/id_ed25519and~/.ssh/id_ed25519.pub, to the same directory on each other VM. -
On all other VMs, add the public key from the pair to the list of authorized keys:
Run the tests
Run the tests from the main VM with thempirun command:
IP_address_[1-4]: IP address of the VM where you want to run the test.:8: Amount of GPUs on the VM.-mca pml ucx: Instruction for MPI communications to go through InfiniBand™ using UCX. To use Ethernet instead, replace the option with-mca btl_tcp_if_include eth0. This does not affect InfiniBand data exchanges of the test itself.~/nccl-tests/build/all_reduce_perf: A path to the binary file that should be available on all VMs.
How to delete the chargeable resources
The virtual machines that make up your GPU cluster are chargeable. If you do not need the VMs, delete them, so Nebius AI Cloud does not charge for them:- Web console
- CLI
- In the sidebar, go to
Compute → Virtual machines.
- Next to the virtual machine’s name, click
→ Delete.
- Enter the VM name and confirm deletion.
InfiniBand and InfiniBand Trade Association are registered trademarks of the InfiniBand Trade Association.