H200 vLLM hosting price by provider
OpenAI-compatible vLLM serving usually starts at 24GB GPUs and moves into 80GB+ rows for larger models, batching, and long prompts. For H200, the current tracked provider floor is $2.29/hr on Lambda.
Current H200 rows for vLLM hosting
H200 clears the 24GB+ filter for vLLM hosting. The broader vLLM hosting floor is $0.40/hr with RTX 4090 on Vast.ai, so the cheapest H200 row is $1.89/hr above that floor.
| Provider | Type | Median hourly | Monthly | Workload page | Offers | Updated |
|---|---|---|---|---|---|---|
| Lambda | on-demand | $2.29/hr | $1,672/mo | Lambda vLLM hosting | 1 | Jun 21, 2026 |
| Vast.ai | on-demand | $3.68/hr | $2,690/mo | Vast.ai vLLM hosting | 27 | Jun 21, 2026 |
| RunPod | on-demand | $4.39/hr | $3,205/mo | RunPod vLLM hosting | 1 | Jun 21, 2026 |
| AWS | on-demand | $7.91/hr | $5,776/mo | AWS vLLM hosting | 2 | Jun 21, 2026 |
| Oracle | on-demand | $10.00/hr | $7,300/mo | Oracle vLLM hosting | 1 | Jun 21, 2026 |
H200 for vLLM hosting: current provider pricing
This page starts with the GPU instead of the provider, then filters the live market to rows that match vLLM hosting. It is useful when you have already chosen H200 and need the cheapest provider path for the workload.
Open-weight inference APIs, chat completions, and model serving teams comparing provider-specific GPU options. The table keeps provider/GPU and provider/workload links close together so you can keep narrowing the shortlist without going back to the homepage.
Cheapest H200 provider for vLLM hosting
The cheapest tracked H200 row for vLLM hosting is $2.29/hr on Lambda with on-demand pricing. The broader vLLM hosting floor is $0.40/hr with RTX 4090 on Vast.ai, so the cheapest H200 row is $1.89/hr above that floor.
How this GPU workload page is computed
We filter the compare payload to H200, require 24GB+ workload fit, and sort current on-demand provider rows by median hourly price.
H200 vLLM hosting pricing FAQ
Is H200 good for vLLM hosting?
H200 clears the 24GB+ filter for vLLM hosting. The cheapest tracked H200 row for vLLM hosting is $2.29/hr on Lambda with on-demand pricing.
What is the cheapest H200 provider for vLLM hosting?
The cheapest tracked H200 row for vLLM hosting is $2.29/hr on Lambda with on-demand pricing.
How does H200 compare with the broader vLLM hosting market?
The broader vLLM hosting floor is $0.40/hr with RTX 4090 on Vast.ai, so the cheapest H200 row is $1.89/hr above that floor.
How fresh is this H200 vLLM hosting page?
The page is recalculated from the latest stored on-demand rows. The freshest qualifying row visible here is from Jun 21, 2026.
Next searches after H200 vLLM hosting
Use these links to move into the base GPU page, the workload guide, provider workload slices, and adjacent GPU workload pages.