Skip to main content
This article lists what has been added, updated, fixed or removed in Nebius AI Cloud services.
April 20–26, 2026
  • Added: Compute disks and shared filesystems now support deletion protection. Enable it to prevent accidental deletion in automated environments such as CI/CD pipelines or Terraform configurations.
  • Added: The Nebius AI Cloud CLI now supports an interactive mode. To enable it, add -i or --interactive to a command. The CLI prompts for parameters step by step, validates each value and shows the resulting configuration before you apply it.
  • Added: The log export to Object Storage now supports the Nebius AI Cloud CLI.
  • Added: You can now get a list of public boot disk images available in a given region. The list includes image families, recommended platforms and unsupported platforms for each image.
  • Added: When Prometheus takes too long to retrieve metrics of large clusters, shard and split their scraping jobs.
  • Updated: The AWS CLI is now documented as a supported interface for configuring lifecycle rules in Object Storage, alongside the Nebius AI Cloud interfaces.
April 13–19, 2026
  • Added: The capacity advisor is now available in the web console. It shows GPU capacity availability across regions, platforms and presets so you can check where you can launch virtual machines before creating them.
  • Added: You can now monitor data plane events for Audit Logs in the web console. Graphs show processed events, quota usage and dropped events so you can detect when Object Storage activity approaches the quota.
  • Updated: You can use the web console to create, configure and delete Serverless AI endpoints.
  • Updated: You can manage authorized keys for service accounts from the web console and the Terraform provider, in addition to the CLI.
  • Updated: The SkyPilot integration tutorial now includes instructions to use the Managed SkyPilot API Server, a standalone application available in the Nebius AI Cloud console.
  • Updated: The Nebius Service Health Status page now displays all timestamps in UTC (GMT+00:00).
April 6–12, 2026
March 30 – April 5, 2026
  • Added: The NVIDIA® RTX PRO™ 6000 virtual machine platform is now available in the us-central1 region. It features 96 GB of GDDR7 memory per GPU and comes in single-GPU and eight-GPU presets.
  • Added: Serverless AI CLI commands now support MysteryBox secrets. You can pass secrets as environment variables, registry credentials or tokens when creating endpoints and jobs, without exposing sensitive values in plain text.
  • Updated: You can now manage capacity reservations for Compute virtual machines and Managed Kubernetes node groups in the web console. Previously, this required the CLI or Terraform provider.
  • Updated: You can now export logs directly from individual resource pages in the web console, in addition to the central Observability Logs page.
  • Updated: The default retention period for logs in Observability services is now 14 days.
March 23–29, 2026
  • Added: Object versioning is available in Object Storage. When enabled, each update to an object creates a new version, which protects data from loss and allows you to restore previous versions.
  • Added: You can now create custom boot disk images to tailor images to your needs and create multiple virtual machines with custom disk architecture.
  • Added: You can also create custom boot disk images by using the Nebius Packer plug-in. Use it to pre-install packages to a disk, prepare versioned images and standardize environments across teams or workloads.
  • Added: You can now use the Nebius Terraform provider with Pulumi to declare your Nebius AI Cloud resources in the preferred programming language instead of using Terraform configuration.
  • Added: You can now export logs from Observability to an Object Storage bucket in JSON or Parquet format for long-term retention or analysis in external tools.
  • Updated: You can now attach and detach disks on running Compute virtual machines without stopping them first.
March 16–22, 2026
March 9–15, 2026
March 2–8, 2026
  • Added: Virtual machines now support secondary private IP addresses that you can use as a backup option in case of incidents.
  • Added: Nebius VPN Gateway, an open-source, VM-based IPsec gateway provided by Nebius Professional Services, is now available. You can deploy it to establish site-to-site VPN connectivity with external networks in other cloud providers or on-premises data centers.
  • Added: Nebius AI Cloud now calculates taxes for the residents of Rhode Island in accordance with the United States tax law.
  • Updated: For viewing ingested traces from Managed Kubernetes clusters in Grafana®, the connection URL is now https://read.tracing.api.nebius.cloud/projects/<project_ID>/tempo instead of https://read.tracing.api.nebius.cloud/tempo/<project_ID>.
February 23 – March 1, 2026
February 16–22, 2026
February 9–15, 2026
  • Added: In Serverless AI, you can now debug failed jobs by keeping the job’s VM alive to connect via SSH.
  • Added: Single sign-on with JumpCloud is now available. Users can sign in to Nebius AI Cloud with their JumpCloud credentials instead of a separate password.
  • Added: You can now export audit events to an Object Storage bucket for a specific time range. Use exported data for long-term retention, data portability or analysis in external tools.
  • Added: You can now collect traces with Nebius Observability Agent for Kubernetes and view traces in Grafana. Send traces from cluster applications via OTLP and explore them in Grafana with Tempo.
  • Updated: We now recommend GPU boot disk images with Ubuntu 24.04 and CUDA 12 or CUDA 13.0. The ubuntu22.04-cuda12 image is not recommended and will be deprecated.
  • Updated: When setting up GPUs in Managed Kubernetes, you can now choose between driver presets cuda12.8, cuda13.0 and cuda12.4. You can also change the driver preset by updating the node group when you need a different driver or CUDA version.
February 2–8, 2026
  • Added: Serverless AI jobs and endpoints are now available. You can use them to run AI workloads without any need to create and maintain infrastructure for them.
  • Added: Taxes are now included in invoices for individuals from India, Singapore, the United Arab Emirates and the United Kingdom.
January 26 – February 1, 2026
January 19–25
  • Added: Object Storage now supports logs of two types: control plane logs and data plane logs. By using them, you can get detailed information about actions on buckets and objects.
January 5–11, 2026
December 29–31, 2025
  • Added: You can now set up billing data export and automatically get usage data for cost analysis.
December 15–21, 2025
  • Updated: The me-west1 region is now public. Services available in this region are listed in Regions.
December 1–7, 2025
  • Updated: The default Compute quotas on the numbers of NVIDIA B200 and NVIDIA H200 GPUs in the us-central1 region per tenant are now zero, down from 32 and 16, respectively. This means that in tenants created after December 1, you need to request a quota raise before creating VMs with any number of these GPUs in us-central1 projects. The change does not affect existing tenants.
November 24–30, 2025
  • Added: Two new virtual machine platforms are now available for Compute virtual machines, Managed Kubernetes nodes and other resources:
    • NVIDIA® B300 NVLink with Intel Granite Rapids (ID gpu-b300-sxm) is now available in the private region uk-south1. It introduces NVIDIA B300 GPUs to Nebius AI Cloud.
    • NVIDIA® B200 NVLink with Intel Emerald Rapids (ID gpu-b200-sxm-a) is now available in the private region me-west1. This is the second platform with NVIDIA B200 GPUs; a platform with the same name (ID gpu-b200-sxm) is available in the public region us-central1.
  • Added: When creating a support ticket in the support center, you can now choose the topic of the ticket (technical, billing or digital rights). Available priority levels for technical tickets have also changed: the standard level is renamed medium, and new high and low levels are added to the existing levels. This helps us process your requests faster and more efficiently.
November 17–23, 2025
November 10–16, 2025
November 3–9, 2025
  • Updated: The default Compute quota on the number of NVIDIA H200 GPUs in the us-central1 region per tenant is now 16, down from 32. The change does not affect existing tenants.
  • Updated: When you manually install the NVIDIA GPU Operator on your InfiniBand-interconnected Managed Kubernetes nodes, GPUDirect RDMA is now enabled by default.
October 27–November 2, 2025
  • Added: Monitoring dashboards for Managed PostgreSQL clusters now include two new connection pooler metrics, Pooler clients active and Pooler clients waiting.
October 20–26, 2025
  • Added: Managed Kubernetes now supports Kubernetes 1.32. This version is selected for new clusters and node groups by default. See the overview of Kubernetes versions.
  • Added: The list of roles has been extended. In addition to general roles (admin, editor, viewer, auditor), you can now assign new service roles to custom groups, allowing for more granular access to your resources. The new roles control access to Object Storage and MysteryBox resources, as well as Data Subject Requests to support.
  • Updated: Starting October 22, usage of virtual machines and applications that use NVIDIA B200, H200 and H100 GPUs is recorded under new billing items that cover GPUs, vCPUs and RAM together rather than separately. Usage from October 1–21 is recalculated retroactively. This makes monitoring your usage simpler and does not affect your costs. Learn more.
September 29–October 5, 2025
September 22–28, 2025
  • Updated: Automatic security updates are now disabled by default in boot disk images for non-GPU virtual machines in Compute, in addition to GPU VMs, to prevent errors in running workloads. You can enable them back.
  • Updated: Computing resources of all Compute virtual machines are now eligible for committed usage discounts, as opposed to only VMs with NVIDIA H100 GPUs.
September 15–21, 2025
  • Added: The MysteryBox service is launched, allowing you to store sensitive data in an encrypted form to reuse them in your scripts, configuration files or applications.
  • Added: You can now create routing tables and routes to customize the routing of egress traffic and implement advanced networking scenarios, such as custom NAT gateways or segmented architectures.
  • Added: Your committed usage discounts are now grouped in orders that correspond to the agreement addendums that you sign.
  • Updated: Automatic security updates are now disabled by default in boot disk images for GPU virtual machines in Compute to prevent errors in running workloads. You can enable them back.
  • Updated: Default Compute quotas for GPU and non-GPU virtual machines are now separate, allowing new customers to create more VMs overall and control the remaining capacity in their projects more precisely.
September 8–14, 2025
  • Added: You can now create Compute virtual machines with container images deployed on them: containers over VMs. You can use images provided by Nebius AI Cloud or custom Docker images from public registries.
  • Added: Two new extensions for PostgreSQL are now available on all Managed Service for PostgreSQL clusters: RUM offers enhanced full-text search, and pg_repack allows you to remove bloat from tables and indexes.
  • Updated: If you pay for Nebius AI Cloud services by bank card, we can now suspend your account after one unsuccessful attempt to charge your card rather than after multiple attempts. Check regularly that your payment method is up to date.
September 1–7, 2025
  • Added: Managed Kubernetes now performs health checks on nodes, and automatically recovers nodes that are not ready or have issues with the boot disk or GPUs. You can enable and disable these health checks.
  • Added: Audit Logs now support filtering events by more fields, so you can find events that occurred in a specific region, are associated with operations that have a specific status, etc.
  • Updated: The navigation in the web console has undergone a major overhaul to group and present services and resources more clearly.
  • Updated: The upload and download speed for the Standard storage class in Object Storage is now limited to 20 GBps per tenant in all regions.