Skip to main content
This article lists what has been added, updated, fixed or removed in Nebius AI Cloud services.
March 2–8, 2026
  • Added: Virtual machines now support secondary private IP addresses that you can use as a backup option in case of incidents.
  • Added: Nebius VPN Gateway, an open-source, VM-based IPsec gateway provided by Nebius Professional Services, is now available. You can deploy it to establish site-to-site VPN connectivity with external networks in other cloud providers or on-premises data centers.
  • Added: Nebius AI Cloud now calculates taxes for the residents of Rhode Island in accordance with the United States tax law.
  • Updated: For viewing ingested traces from Managed Kubernetes clusters in Grafana®, the connection URL is now https://read.tracing.api.nebius.cloud/projects/<project_ID>/tempo instead of https://read.tracing.api.nebius.cloud/tempo/<project_ID>.
February 23 – March 1, 2026
February 16–22, 2026
February 9–15, 2026
  • Added: In Serverless AI, you can now debug failed jobs by keeping the job’s VM alive to connect via SSH.
  • Added: Single sign-on with JumpCloud is now available. Users can sign in to Nebius AI Cloud with their JumpCloud credentials instead of a separate password.
  • Added: You can now export audit events to an Object Storage bucket for a specific time range. Use exported data for long-term retention, data portability or analysis in external tools.
  • Added: You can now collect traces with Nebius Observability Agent for Kubernetes and view traces in Grafana. Send traces from cluster applications via OTLP and explore them in Grafana with Tempo.
  • Updated: We now recommend GPU boot disk images with Ubuntu 24.04 and CUDA 12 or CUDA 13.0. The ubuntu22.04-cuda12 image is not recommended and will be deprecated.
  • Updated: When setting up GPUs in Managed Kubernetes, you can now choose between driver presets cuda12.8, cuda13.0 and cuda12.4. You can also change the driver preset by updating the node group when you need a different driver or CUDA version.
February 2–8, 2026
  • Added: Serverless AI jobs and endpoints are now available. You can use them to run AI workloads without any need to create and maintain infrastructure for them.
  • Added: Taxes are now included in invoices for individuals from India, Singapore, the United Arab Emirates and the United Kingdom.
January 26 – February 1, 2026
January 19–25
  • Added: Object Storage now supports logs of two types: control plane logs and data plane logs. By using them, you can get detailed information about actions on buckets and objects.
January 5–11, 2026
December 29–31, 2025
  • Added: You can now set up billing data export and automatically get usage data for cost analysis.
December 15–21, 2025
  • Updated: The me-west1 region is now public. Services available in this region are listed in Regions.
December 1–7, 2025
  • Updated: The default Compute quotas on the numbers of NVIDIA B200 and NVIDIA H200 GPUs in the us-central1 region per tenant are now zero, down from 32 and 16, respectively. This means that in tenants created after December 1, you need to request a quota raise before creating VMs with any number of these GPUs in us-central1 projects. The change does not affect existing tenants.
November 24–30, 2025
  • Added: Two new virtual machine platforms are now available for Compute virtual machines, Managed Kubernetes nodes and other resources:
    • NVIDIA® B300 NVLink with Intel Granite Rapids (ID gpu-b300-sxm) is now available in the private region uk-south1. It introduces NVIDIA B300 GPUs to Nebius AI Cloud.
    • NVIDIA® B200 NVLink with Intel Emerald Rapids (ID gpu-b200-sxm-a) is now available in the private region me-west1. This is the second platform with NVIDIA B200 GPUs; a platform with the same name (ID gpu-b200-sxm) is available in the public region us-central1.
  • Added: When creating a support ticket in the support center, you can now choose the topic of the ticket (technical, billing or digital rights). Available priority levels for technical tickets have also changed: the standard level is renamed medium, and new high and low levels are added to the existing levels. This helps us process your requests faster and more efficiently.
November 17–23, 2025
November 10–16, 2025
November 3–9, 2025
  • Updated: The default Compute quota on the number of NVIDIA H200 GPUs in the us-central1 region per tenant is now 16, down from 32. The change does not affect existing tenants.
  • Updated: When you manually install the NVIDIA GPU Operator on your InfiniBand-interconnected Managed Kubernetes nodes, GPUDirect RDMA is now enabled by default.
October 27–November 2, 2025
  • Added: Monitoring dashboards for Managed PostgreSQL clusters now include two new connection pooler metrics, Pooler clients active and Pooler clients waiting.
October 20–26, 2025
  • Added: Managed Kubernetes now supports Kubernetes 1.32. This version is selected for new clusters and node groups by default. See the overview of Kubernetes versions.
  • Added: The list of roles has been extended. In addition to general roles (admin, editor, viewer, auditor), you can now assign new service roles to custom groups, allowing for more granular access to your resources. The new roles control access to Object Storage and MysteryBox resources, as well as Data Subject Requests to support.
  • Updated: Starting October 22, usage of virtual machines and applications that use NVIDIA B200, H200 and H100 GPUs is recorded under new billing items that cover GPUs, vCPUs and RAM together rather than separately. Usage from October 1–21 is recalculated retroactively. This makes monitoring your usage simpler and does not affect your costs. Learn more.
September 29–October 5, 2025
September 22–28, 2025
  • Updated: Automatic security updates are now disabled by default in boot disk images for non-GPU virtual machines in Compute, in addition to GPU VMs, to prevent errors in running workloads. You can enable them back.
  • Updated: Computing resources of all Compute virtual machines are now eligible for committed usage discounts, as opposed to only VMs with NVIDIA H100 GPUs.
September 15–21, 2025
  • Added: The MysteryBox service is launched, allowing you to store sensitive data in an encrypted form to reuse them in your scripts, configuration files or applications.
  • Added: You can now create routing tables and routes to customize the routing of egress traffic and implement advanced networking scenarios, such as custom NAT gateways or segmented architectures.
  • Added: Your committed usage discounts are now grouped in orders that correspond to the agreement addendums that you sign.
  • Updated: Automatic security updates are now disabled by default in boot disk images for GPU virtual machines in Compute to prevent errors in running workloads. You can enable them back.
  • Updated: Default Compute quotas for GPU and non-GPU virtual machines are now separate, allowing new customers to create more VMs overall and control the remaining capacity in their projects more precisely.
September 8–14, 2025
  • Added: You can now create Compute virtual machines with container images deployed on them: containers over VMs. You can use images provided by Nebius AI Cloud or custom Docker images from public registries.
  • Added: Two new extensions for PostgreSQL are now available on all Managed Service for PostgreSQL clusters: RUM offers enhanced full-text search, and pg_repack allows you to remove bloat from tables and indexes.
  • Updated: If you pay for Nebius AI Cloud services by bank card, we can now suspend your account after one unsuccessful attempt to charge your card rather than after multiple attempts. Check regularly that your payment method is up to date.
September 1–7, 2025
  • Added: Managed Kubernetes now performs health checks on nodes, and automatically recovers nodes that are not ready or have issues with the boot disk or GPUs. You can enable and disable these health checks.
  • Added: Audit Logs now support filtering events by more fields, so you can find events that occurred in a specific region, are associated with operations that have a specific status, etc.
  • Updated: The navigation in the web console has undergone a major overhaul to group and present services and resources more clearly.
  • Updated: The upload and download speed for the Standard storage class in Object Storage is now limited to 20 GBps per tenant in all regions.