Skip to main content
This article provides pricing information for applications in Nebius AI Cloud. Pricing depends on the deployment option you choose.

How charges and prices work

Each group of chargeable items in this article has two time units associated with it:
  • Billing unit: The minimum unit of usage for which you can be charged.
  • Pricing unit: The unit of usage for which the prices are shown.
Charges for units smaller than the pricing unit are calculated proportionally.
For example, for GPUs on running VMs, the billing unit is 1 second, and the pricing unit is 1 hour (3600 seconds). For 30 minutes of usage, you will be charged half the hourly price.
Prices in US dollars (USD, $) apply to all customers except for companies from Israel, where prices in Israeli shekels (ILS, ₪) apply instead. All prices are shown without any applicable taxes, including VAT. Due to rounding errors, usage costs shown in the web console and final charges may slightly differ from calculations based on the prices in this article.

Unified billing for GPUs, vCPUs and RAM from October 1, 2025

Starting October 1, 2025, GPUs, vCPUs and RAM of each standalone application and Compute virtual machine with NVIDIA B200, H200 or H100 GPUs are charged together rather than separately. This change becomes visible in your usage details and prices on October 22, 2025, with all usage from October 1 onwards recalculated retroactively. This does not affect your costs. What you pay per hour stays the same. For more details, see billing documentation. The next section lists prices for both old and new, unified billing items.

Standalone Applications

Applications deployed using the Standalone deployment option are charged for computing resources, storage and third-party licenses.

Prices

Nebius AI Cloud charges you for the computing resources allocated to running applications and the storage size. For NVIDIA NIM™ microservices, Nebius AI Cloud also charges you for the NVIDIA AI Enterprise License. Public access to applications is free of charge.

Computing resources

You are charged for computing resources in your running applications. You are not charged for computing resources in stopped applications.
  • Billing unit: 1 second
  • Pricing unit: 1 hour (3600 seconds)
The platform is available in the eu-north1 and us-central1 regions.
Item — before October 1, 2025PricePer
Standalone application on NVIDIA® H200 NVLink with Intel Sapphire Rapids. GPU$2.6681 GPU hour
Standalone application on NVIDIA® H200 NVLink with Intel Sapphire Rapids. CPU$0.0121 CPU hour
Standalone application on NVIDIA® H200 NVLink with Intel Sapphire Rapids. RAM$0.00321 GiB hour
Item — from October 1, 2025PricePer
Standalone application on NVIDIA® H200 NVLink with Intel Sapphire Rapids$3.501 GPU hour
Charges are based on the resource preset that you are using. For example, using a 8gpu-128vcpu-1600gb application for 1 hour costs 8 × $2.668 + 128 × $0.012 + 1600 × $0.0032 = 8 × $3.50 = $28.00. For 730 hours (~ 1 month), this amounts to 730 × $28.00 = $20,440.00.
The platform is only available in the eu-north1 region.
Item — before October 1, 2025PricePer
Standalone application on NVIDIA® H100 NVLink with Intel Sapphire Rapids. GPU$2.1181 GPU hour
Standalone application on NVIDIA® H100 NVLink with Intel Sapphire Rapids. CPU$0.0121 CPU hour
Standalone application on NVIDIA® H100 NVLink with Intel Sapphire Rapids. RAM$0.00321 GiB hour
Item — from October 1, 2025PricePer
Standalone application on NVIDIA® H100 NVLink with Intel Sapphire Rapids$2.951 GPU hour
Charges are based on the resource preset that you are using. For example, using a 8gpu-128vcpu-1600gb application for 1 hour costs 8 × $2.118 + 128 × $0.012 + 1600 × $0.0032 = 8 × $2.95 = $23.60. For 730 hours (~ 1 month), this amounts to 730 × $23.60 = $17,228.00.

NVIDIA® L40S PCIe with Intel Ice Lake

The platform is only available in the eu-north1 region.
ItemPricePer
Standalone application on NVIDIA® L40S PCIe with Intel Ice Lake. GPU$1.351 GPU hour
Standalone application on NVIDIA® L40S PCIe with Intel Ice Lake. CPU$0.0121 CPU hour
Standalone application on NVIDIA® L40S PCIe with Intel Ice Lake. RAM$0.00321 GiB hour
Charges are based on the resource preset that you are using. For example, using a 1gpu-16vcpu-64gb application for 1 hour costs $1.35 + 16 × $0.012 + 64 × $0.0032 = $1.7468. For 730 hours (~ 1 month), this amounts to 730 × $1.7468 = $1275.164.

NVIDIA® L40S PCIe with AMD Epyc Genoa

The platform is only available in the eu-north1 region.
ItemPricePer
Standalone application on NVIDIA® L40S PCIe with AMD Epyc Genoa. GPU$1.351 GPU hour
Standalone application on NVIDIA® L40S PCIe with AMD Epyc Genoa. CPU$0.011 CPU hour
Standalone application on NVIDIA® L40S PCIe with AMD Epyc Genoa. RAM$0.00321 GiB hour
Charges are based on the resource preset that you are using. For example, using a 1gpu-16vcpu-96gb application for 1 hour costs $1.35 + 16 × $0.01 + 96 × $0.0032 = $1.8172. For 730 hours (~ 1 month), this amounts to 730 × $1.8172 = $1326.556.

Non-GPU AMD EPYC Genoa

The platform is available in the eu-north1 and us-central1 regions.
ItemPricePer
Standalone application on Non-GPU AMD Epyc Genoa. CPU$0.0251 CPU hour
Standalone application on Non-GPU AMD Epyc Genoa. RAM$0.0051 GiB hour
Charges are based on the resource preset that you are using. For example, using a 4vcpu-16gb application for 1 hour costs 4 × $0.025 + 16 × $0.005 = $0.18. For 730 hours (~ 1 month), this amounts to 730 × $0.18 = $131.40.

Non-GPU Intel Ice Lake

The platform is only available in the eu-north1 region.
ItemPricePer
Standalone application on Non-GPU Intel Ice Lake. CPU$0.0251 CPU hour
Standalone application on Non-GPU Intel Ice Lake. RAM$0.0051 GiB hour
Charges are based on the resource preset that you are using. For example, using a 4vcpu-16gb application for 1 hour costs 4 × $0.025 + 16 × $0.005 = $0.18. For 730 hours (~ 1 month), this amounts to 730 × $0.18 = $131.40.

Storage

You are charged for the disks used by deployed applications. Charges are based on disk sizes, regardless of the amount of used space.
  • Billing unit: 1 byte per 1 second
  • Pricing unit: 1 GiB per 730 hours (230 bytes per 2,628,000 seconds ~ 1 month)
ItemPrice per 1 GiB per 730 hours
Standalone application. Network SSD disk$0.071

Licenses

You are charged for third-party software licenses used by running applications. You are not charged for licenses for stopped applications.
  • Billing unit: 1 GPU per 1 second
  • Pricing unit: 1 GPU per 1 hour (3600 seconds)
ItemApplies to applicationsPrice per 1 GPU hour
NVIDIA AI Enterprise LicenseNVIDIA NIM microservices$0.90
Charges are based on the number of GPUs in the resource preset that you are using. For example, if an NVIDIA NIM microservice that uses the preset 8gpu-128vcpu-1600gb is running for 1 hour, a license for it costs 8 × $0.90 = $7.20. For 730 hours (~ 1 month), this amounts to 730 × $7.20 = $5256.00.

Kubernetes

Applications deployed by using the Kubernetes deployment option run on Managed Kubernetes clusters. Applications themselves are not subject to their own charges. Only Managed Kubernetes charges apply to clusters and nodes that host the applications.

Virtual machine

Applications deployed by using the Virtual machine deployment option run on Compute virtual machines. Applications themselves are not subject to their own charges. Only Compute charges apply to the virtual machines that host the applications.