GPU workload page

L40 vLLM hosting price by provider

OpenAI-compatible vLLM serving usually starts at 24GB GPUs and moves into 80GB+ rows for larger models, batching, and long prompts. For L40, the current tracked provider floor is $0.45/hr on Vast.ai.

48GB GDDR6 vLLM hosting 24GB+ floor Updated Jun 21, 2026

Compare providers All L40 prices

Cheapest provider

Vast.ai

$0.45/hr

Monthly estimate

$331/mo

730 hours at current median price

Provider coverage

6 current rows updated Jun 21, 2026

Market gap

+$0.05/hr

Floor is Vast.ai RTX 4090

Live provider pricing

Current L40 rows for vLLM hosting

L40 clears the 24GB+ filter for vLLM hosting. The broader vLLM hosting floor is $0.40/hr with RTX 4090 on Vast.ai, so the cheapest L40 row is $0.05/hr above that floor.

Updated Jun 21, 2026

Provider	Type	Median hourly	Monthly	Workload page	Offers	Updated
Vast.ai	on-demand	$0.45/hr	$331/mo	Vast.ai vLLM hosting	1	Jun 21, 2026
GCP	on-demand	$0.66/hr	$482/mo	GCP vLLM hosting	55	Jun 21, 2026
RunPod	on-demand	$0.91/hr	$661/mo	RunPod vLLM hosting	2	Jun 21, 2026
Lambda	on-demand	$1.29/hr	$942/mo	Lambda vLLM hosting	1	Jun 21, 2026
AWS	on-demand	$3.39/hr	$2,471/mo	AWS vLLM hosting	16	Jun 21, 2026
Oracle	on-demand	$3.50/hr	$2,555/mo	Oracle vLLM hosting	1	Jun 21, 2026

Search guide

L40 for vLLM hosting: current provider pricing

This page starts with the GPU instead of the provider, then filters the live market to rows that match vLLM hosting. It is useful when you have already chosen L40 and need the cheapest provider path for the workload.

Open-weight inference APIs, chat completions, and model serving teams comparing provider-specific GPU options. The table keeps provider/GPU and provider/workload links close together so you can keep narrowing the shortlist without going back to the homepage.

Cheapest provider right now

Cheapest L40 provider for vLLM hosting

The cheapest tracked L40 row for vLLM hosting is $0.45/hr on Vast.ai with on-demand pricing. The broader vLLM hosting floor is $0.40/hr with RTX 4090 on Vast.ai, so the cheapest L40 row is $0.05/hr above that floor.

Methodology and freshness

How this GPU workload page is computed

We filter the compare payload to L40, require 24GB+ workload fit, and sort current on-demand provider rows by median hourly price.

FAQ

L40 vLLM hosting pricing FAQ

Is L40 good for vLLM hosting?

L40 clears the 24GB+ filter for vLLM hosting. The cheapest tracked L40 row for vLLM hosting is $0.45/hr on Vast.ai with on-demand pricing.

What is the cheapest L40 provider for vLLM hosting?

The cheapest tracked L40 row for vLLM hosting is $0.45/hr on Vast.ai with on-demand pricing.

How does L40 compare with the broader vLLM hosting market?

The broader vLLM hosting floor is $0.40/hr with RTX 4090 on Vast.ai, so the cheapest L40 row is $0.05/hr above that floor.

How fresh is this L40 vLLM hosting page?

The page is recalculated from the latest stored on-demand rows. The freshest qualifying row visible here is from Jun 21, 2026.

Next searches after L40 vLLM hosting

Use these links to move into the base GPU page, the workload guide, provider workload slices, and adjacent GPU workload pages.