Endpoints and jobs
You can deploy your workload as an endpoint that listens for requests and returns results immediately, or as a job that runs in the background and quits after completing its task. Here is the comparison between endpoints and jobs at a glance:| Endpoint | Job | |
|---|---|---|
| Workflow | Interactive, listens for requests until you terminate it | Non-interactive, terminates upon task completion or timeout |
| Stop/start | Yes | No |
| Public URL for requests | Yes | No |
| Typical lifetime | Hours to days | Minutes to days |
| Use cases | Persistent workloads: serving and A/B-testing models, real-time inference | Batch workloads: pre-processing data, training and fine-tuning models, batch inference and model evaluation, scientific simulations |
| Guides | Getting started with endpoints | Getting started with jobs |