H200 NVL batch inference price by provider
Batch inference can use cheaper and less stable pricing modes when queues are flexible and retries are acceptable. For H200 NVL, the current tracked provider floor is $3.46/hr on Vast.ai.
Current H200 NVL rows for batch inference
H200 NVL clears the 24GB+ filter for batch inference. The broader batch inference floor is $0.13/hr with L40 on GCP, so the cheapest H200 NVL row is $3.33/hr above that floor.
| Provider | Type | Median hourly | Monthly | Workload page | Offers | Updated |
|---|---|---|---|---|---|---|
| Vast.ai | on-demand | $3.46/hr | $2,529/mo | Vast.ai batch inference | 3 | Jun 21, 2026 |
H200 NVL for batch inference: current provider pricing
This page starts with the GPU instead of the provider, then filters the live market to rows that match batch inference. It is useful when you have already chosen H200 NVL and need the cheapest provider path for the workload.
Offline inference queues, embedding jobs, catch-up workloads, and cost-sensitive internal processing. The table keeps provider/GPU and provider/workload links close together so you can keep narrowing the shortlist without going back to the homepage.
Cheapest H200 NVL provider for batch inference
The cheapest tracked H200 NVL row for batch inference is $3.46/hr on Vast.ai with on-demand pricing. The broader batch inference floor is $0.13/hr with L40 on GCP, so the cheapest H200 NVL row is $3.33/hr above that floor.
How this GPU workload page is computed
We filter the compare payload to H200 NVL, require 24GB+ workload fit, and sort current spot, community, and on-demand provider rows by median hourly price.
H200 NVL batch inference pricing FAQ
Is H200 NVL good for batch inference?
H200 NVL clears the 24GB+ filter for batch inference. The cheapest tracked H200 NVL row for batch inference is $3.46/hr on Vast.ai with on-demand pricing.
What is the cheapest H200 NVL provider for batch inference?
The cheapest tracked H200 NVL row for batch inference is $3.46/hr on Vast.ai with on-demand pricing.
How does H200 NVL compare with the broader batch inference market?
The broader batch inference floor is $0.13/hr with L40 on GCP, so the cheapest H200 NVL row is $3.33/hr above that floor.
How fresh is this H200 NVL batch inference page?
The page is recalculated from the latest stored spot, community, and on-demand rows. The freshest qualifying row visible here is from Jun 21, 2026.
Next searches after H200 NVL batch inference
Use these links to move into the base GPU page, the workload guide, provider workload slices, and adjacent GPU workload pages.