> ## Documentation Index > Fetch the complete documentation index at: https://docs.nebius.com/llms.txt > Use this file to discover all available pages before exploring further. # Deploying a large language model and chatting with it by using Serverless AI endpoints Serverless AI lets you deploy and manage endpoints without handling infrastructure yourself. With endpoints, you can create an OpenAI-compatible model backend in a few minutes. This tutorial shows how to prepare your environment, create your first endpoint with an open-source large language model (LLM), and send a chat request. The endpoint is based on the `vllm/vllm-openai:latest` image. [vLLM](https://github.com/vllm-project/vllm) automatically downloads the model from [Hugging Face](https://huggingface.co) when the endpoint starts. The container exposes an OpenAI-compatible `/v1/chat/completions` API. For a quick walkthrough of the web console workflow, watch the video below. If you prefer other interfaces or written instructions, follow the steps further down.