Prerequisites
-
Connect to your cluster. You should see the SSH welcome message. For example:
-
Get the command to open monitoring dashboards from the instructions in the SSH welcome message. In the example above, it is
ssh -L 3000:metrics-grafana.monitoring-system.svc:80 -N <USER>@<LOGIN_IP>. The URL for your cluster might be different.
How to view metrics in Grafana
-
On your local machine, run the command to open monitoring dashboards that you got from the SSH welcome message. For example:
In this command, specify the
usernameandpublic_IP_addressthat you use to connect to the cluster. Optionally, change port3000if it is already in use on your local machine. -
Open
localhost:3000(orlocalhost:<port>) in your browser. - In the sidebar, select Dashboards. Review the metrics on these dashboards. For example, you can see the metrics of Slurm jobs and resource allocations.
How to view metrics for worker nodes
The nodes of your Soperator cluster are Compute virtual machines. You can view their metrics on Monitoring dashboards in the web console. To find out the ID of the virtual machine for a worker node:- Connect to a login node of your Soperator cluster.
-
Run the following command:
Output example:Get the link from the
monitoringparameter. - Open the link in your browser. There, you can view the dashboards for the virtual machine that runs the worker node.
The Grafana Labs Marks are trademarks of Grafana Labs, and are used with Grafana Labs’ permission. We are not affiliated with, endorsed or sponsored by Grafana Labs or its affiliates.