Nebius AI Cloud offers AI services for containerized inference and training, experiment tracking, integrations with third-party software and orchestration with external tools. Different services can help you achieve different goals:Documentation Index
Fetch the complete documentation index at: https://docs.nebius.com/llms.txt
Use this file to discover all available pages before exploring further.
- Use Serverless AI to run containerized AI workloads either as endpoints that return results immediately, or as background jobs that run in the background and stop when the task is complete.
- Use Managed Service for MLflow to manage machine learning experiments by tracking runs, organizing models into versions, comparing metrics and deploying customized models.
- Use Applications in Nebius AI Cloud to work with third-party software and services that help streamline AI and ML workflows.
- Use third-party integrations to run and orchestrate AI workloads in the Nebius AI Cloud infrastructure.
Getting started with endpoints in Serverless AI
Deploy an interactive endpoint that listens for requests and returns results immediately
Getting started with jobs in Serverless AI
Run a non-interactive workload in the background and stop it automatically after completion
Getting started with MLflow
Create your first MLflow cluster and run an experiment in it
Deploying applications
Browse the marketplace and choose an application to deploy
Third-party integrations: SkyPilot
Run, manage and scale AI workloads on Nebius AI Cloud by using SkyPilot
Third-party integrations: Run:ai
Optimize your GPU resources for ML/AI workloads by using Run:ai and Managed Service for Kubernetes®
Tutorials and examples
Follow our Serverless AI tutorials for step-by-step instructions on setting up specific AI workloads, or explore tutorials that combine multiple Nebius AI Cloud services to build complete solutions.Deploying a large language model and chatting with it
Create your first Serverless AI endpoint with an open-source large language model and send a chat request
Fine-tuning a large language model
Fine-tune a large language model by using a Serverless AI job with Axolotl
Speech synthesis
Convert text to speech by creating a model, running a fine-tuning job and deploying an endpoint
Building solutions with AI services
Explore tutorials that show you how to combine Nebius AI Cloud services to build complete solutions
Serverless cookbook
Explore runnable examples of Serverless AI workloads on GitHub