GPU workload page

H100 SXM batch inference price by provider

Batch inference can use cheaper and less stable pricing modes when queues are flexible and retries are acceptable. For H100 SXM, the current tracked provider floor is $2.27/hr on Azure.

80GB HBM3 batch inference 24GB+ floor Updated Jun 21, 2026

Compare providers All H100 SXM prices

Cheapest provider

Azure

$2.27/hr

Monthly estimate

$1,658/mo

730 hours at current median price

Provider coverage

10 current rows updated Jun 21, 2026

Market gap

+$2.14/hr

Floor is GCP L40

Live provider pricing

Current H100 SXM rows for batch inference

H100 SXM clears the 24GB+ filter for batch inference. The broader batch inference floor is $0.13/hr with L40 on GCP, so the cheapest H100 SXM row is $2.14/hr above that floor.

Updated Jun 21, 2026

Provider	Type	Median hourly	Monthly	Workload page	Offers	Updated
Azure	spot	$2.27/hr	$1,658/mo	Azure batch inference	6	Jun 21, 2026
Vast.ai	on-demand	$2.40/hr	$1,752/mo	Vast.ai batch inference	15	Jun 21, 2026
GCP	spot	$2.51/hr	$1,833/mo	GCP batch inference	49	Jun 21, 2026
RunPod	spot	$2.69/hr	$1,964/mo	RunPod batch inference	1	Jun 21, 2026
RunPod	community	$2.69/hr	$1,964/mo	RunPod batch inference	1	Jun 21, 2026
RunPod	on-demand	$3.29/hr	$2,402/mo	RunPod batch inference	1	Jun 21, 2026
Lambda	on-demand	$4.14/hr	$3,022/mo	Lambda batch inference	4	Jun 21, 2026
GCP	on-demand	$4.84/hr	$3,534/mo	GCP batch inference	178	Jun 21, 2026
AWS	on-demand	$6.88/hr	$5,022/mo	AWS batch inference	2	Jun 21, 2026
Oracle	on-demand	$10.00/hr	$7,300/mo	Oracle batch inference	1	Jun 21, 2026

Search guide

H100 SXM for batch inference: current provider pricing

This page starts with the GPU instead of the provider, then filters the live market to rows that match batch inference. It is useful when you have already chosen H100 SXM and need the cheapest provider path for the workload.

Offline inference queues, embedding jobs, catch-up workloads, and cost-sensitive internal processing. The table keeps provider/GPU and provider/workload links close together so you can keep narrowing the shortlist without going back to the homepage.

Cheapest provider right now

Cheapest H100 SXM provider for batch inference

The cheapest tracked H100 SXM row for batch inference is $2.27/hr on Azure with spot pricing. The broader batch inference floor is $0.13/hr with L40 on GCP, so the cheapest H100 SXM row is $2.14/hr above that floor.

Methodology and freshness

How this GPU workload page is computed

We filter the compare payload to H100 SXM, require 24GB+ workload fit, and sort current spot, community, and on-demand provider rows by median hourly price.

FAQ

H100 SXM batch inference pricing FAQ

Is H100 SXM good for batch inference?

H100 SXM clears the 24GB+ filter for batch inference. The cheapest tracked H100 SXM row for batch inference is $2.27/hr on Azure with spot pricing.

What is the cheapest H100 SXM provider for batch inference?

The cheapest tracked H100 SXM row for batch inference is $2.27/hr on Azure with spot pricing.

How does H100 SXM compare with the broader batch inference market?

The broader batch inference floor is $0.13/hr with L40 on GCP, so the cheapest H100 SXM row is $2.14/hr above that floor.

How fresh is this H100 SXM batch inference page?

The page is recalculated from the latest stored spot, community, and on-demand rows. The freshest qualifying row visible here is from Jun 21, 2026.

Next searches after H100 SXM batch inference

Use these links to move into the base GPU page, the workload guide, provider workload slices, and adjacent GPU workload pages.