Vast.ai vLLM hosting prices
OpenAI-compatible vLLM serving usually starts at 24GB GPUs and moves into 80GB+ rows for larger models, batching, and long prompts. The cheapest tracked Vast.ai row in this slice is RTX 4090 at $0.40/hr.
Current Vast.ai rows for vLLM hosting
Vast.ai has 11 qualifying GPU models across 11 current price rows for this workload. Vast.ai is currently at the tracked market floor for vLLM hosting.
| GPU | VRAM | Type | Median hourly | Monthly | Offers | Updated |
|---|---|---|---|---|---|---|
|
RTX 4090
Budget
|
24GB GDDR6X | on-demand | $0.40/hr | $293/mo | 69 | Jun 21, 2026 |
|
L40
Mid-Range
|
48GB GDDR6 | on-demand | $0.45/hr | $331/mo | 1 | Jun 21, 2026 |
|
RTX 5090
Budget
|
32GB GDDR7 | on-demand | $0.53/hr | $390/mo | 129 | Jun 21, 2026 |
|
RTX 6000Ada
Mid-Range
|
48GB GDDR6 | on-demand | $0.60/hr | $439/mo | 8 | Jun 21, 2026 |
|
A100 PCIE
Mid-Range
|
80GB HBM2e | on-demand | $1.00/hr | $731/mo | 2 | Jun 19, 2026 |
|
A100 SXM4
High Performance
|
80GB HBM2e | on-demand | $1.14/hr | $834/mo | 6 | Jun 21, 2026 |
|
H100 PCIE
High Performance
|
80GB HBM3 | on-demand | $1.64/hr | $1,199/mo | 1 | Jun 21, 2026 |
|
H100 NVL
High Performance
|
94GB HBM3 | on-demand | $2.27/hr | $1,656/mo | 9 | Jun 21, 2026 |
|
H100 SXM
Flagship
|
80GB HBM3 | on-demand | $2.40/hr | $1,752/mo | 15 | Jun 21, 2026 |
|
H200 NVL
Flagship
|
141GB HBM3e | on-demand | $3.46/hr | $2,529/mo | 3 | Jun 21, 2026 |
Vast.ai vLLM hosting pricing overview
This page narrows Vast.ai's GPU catalog to the rows most likely to matter for vLLM hosting. It is built for buyers who already know the provider they are evaluating and need a workload-specific price shortlist.
Open-weight inference APIs, chat completions, and model serving teams comparing provider-specific GPU options. Use the table below to move from the cheapest visible row into adjacent provider/GPU pages, broader guides, and market-floor comparisons.
Cheapest Vast.ai row for vLLM hosting
Vast.ai currently starts at $0.40/hr for vLLM hosting with RTX 4090 on on-demand pricing. Vast.ai is currently at the tracked market floor for vLLM hosting.
How this workload slice is computed
We filter the live compare payload to on-demand rows with at least 24GB of VRAM, then sort qualifying Vast.ai rows by current median hourly price.
Vast.ai vLLM hosting pricing FAQ
What is the cheapest Vast.ai option for vLLM hosting?
Vast.ai currently starts at $0.40/hr for vLLM hosting with RTX 4090 on on-demand pricing.
Is Vast.ai cheapest for vLLM hosting?
Vast.ai is currently at the tracked market floor for vLLM hosting.
What GPUs count toward this vLLM hosting page?
This page filters to 24GB+ GPUs in the RTX 4090, RTX 5090, L4, A10G, L40, and more set and uses on-demand pricing.
How fresh is this Vast.ai vLLM hosting page?
The rows are recalculated from the latest stored provider snapshot. The freshest qualifying row visible here is from Jun 21, 2026.
Next searches after Vast.ai vLLM hosting
These links move sideways into the full provider catalog, workload guide, market floor, and adjacent provider workload pages.