Provider workload page

Vast.ai vLLM hosting prices

OpenAI-compatible vLLM serving usually starts at 24GB GPUs and moves into 80GB+ rows for larger models, batching, and long prompts. The cheapest tracked Vast.ai row in this slice is RTX 4090 at $0.40/hr.

24GB+ VRAM on-demand pricing Updated Jun 21, 2026

View qualifying rows All Vast.ai GPUs

Cheapest qualifying row

$0.40/hr

RTX 4090 · on-demand

Monthly estimate

$293/mo

730 hours at current median price

Qualifying GPUs

11 current rows updated Jun 21, 2026

Market gap

At floor

Floor is Vast.ai RTX 4090

Live workload pricing

Current Vast.ai rows for vLLM hosting

Vast.ai has 11 qualifying GPU models across 11 current price rows for this workload. Vast.ai is currently at the tracked market floor for vLLM hosting.

Updated Jun 21, 2026

GPU	VRAM	Type	Median hourly	Monthly	Offers	Updated
RTX 4090 Budget	24GB GDDR6X	on-demand	$0.40/hr	$293/mo	69	Jun 21, 2026
L40 Mid-Range	48GB GDDR6	on-demand	$0.45/hr	$331/mo	1	Jun 21, 2026
RTX 5090 Budget	32GB GDDR7	on-demand	$0.53/hr	$390/mo	129	Jun 21, 2026
RTX 6000Ada Mid-Range	48GB GDDR6	on-demand	$0.60/hr	$439/mo	8	Jun 21, 2026
A100 PCIE Mid-Range	80GB HBM2e	on-demand	$1.00/hr	$731/mo	2	Jun 19, 2026
A100 SXM4 High Performance	80GB HBM2e	on-demand	$1.14/hr	$834/mo	6	Jun 21, 2026
H100 PCIE High Performance	80GB HBM3	on-demand	$1.64/hr	$1,199/mo	1	Jun 21, 2026
H100 NVL High Performance	94GB HBM3	on-demand	$2.27/hr	$1,656/mo	9	Jun 21, 2026
H100 SXM Flagship	80GB HBM3	on-demand	$2.40/hr	$1,752/mo	15	Jun 21, 2026
H200 NVL Flagship	141GB HBM3e	on-demand	$3.46/hr	$2,529/mo	3	Jun 21, 2026

Search guide

Vast.ai vLLM hosting pricing overview

This page narrows Vast.ai's GPU catalog to the rows most likely to matter for vLLM hosting. It is built for buyers who already know the provider they are evaluating and need a workload-specific price shortlist.

Open-weight inference APIs, chat completions, and model serving teams comparing provider-specific GPU options. Use the table below to move from the cheapest visible row into adjacent provider/GPU pages, broader guides, and market-floor comparisons.

Cheapest provider right now

Cheapest Vast.ai row for vLLM hosting

Vast.ai currently starts at $0.40/hr for vLLM hosting with RTX 4090 on on-demand pricing. Vast.ai is currently at the tracked market floor for vLLM hosting.

Methodology and freshness

How this workload slice is computed

We filter the live compare payload to on-demand rows with at least 24GB of VRAM, then sort qualifying Vast.ai rows by current median hourly price.

FAQ

Vast.ai vLLM hosting pricing FAQ

What is the cheapest Vast.ai option for vLLM hosting?

Vast.ai currently starts at $0.40/hr for vLLM hosting with RTX 4090 on on-demand pricing.

Is Vast.ai cheapest for vLLM hosting?

Vast.ai is currently at the tracked market floor for vLLM hosting.

What GPUs count toward this vLLM hosting page?

This page filters to 24GB+ GPUs in the RTX 4090, RTX 5090, L4, A10G, L40, and more set and uses on-demand pricing.

How fresh is this Vast.ai vLLM hosting page?

The rows are recalculated from the latest stored provider snapshot. The freshest qualifying row visible here is from Jun 21, 2026.

Next searches after Vast.ai vLLM hosting

These links move sideways into the full provider catalog, workload guide, market floor, and adjacent provider workload pages.