Provider workload page

Azure batch inference GPU prices

Batch inference can use cheaper and less stable pricing modes when queues are flexible and retries are acceptable. The cheapest tracked Azure row in this slice is A100 PCIE at $1.04/hr.

24GB+ VRAM spot pricing community pricing on-demand pricing Updated Jun 21, 2026

View qualifying rows All Azure GPUs

Cheapest qualifying row

$1.04/hr

A100 PCIE · spot

Monthly estimate

$758/mo

730 hours at current median price

Qualifying GPUs

8 current rows updated Jun 21, 2026

Market gap

+$0.91/hr

Floor is GCP L40

Live workload pricing

Current Azure rows for batch inference

Azure has 4 qualifying GPU models across 8 current price rows for this workload. The overall tracked market floor is $0.13/hr on GCP L40, making Azure $0.91/hr above that floor.

Updated Jun 21, 2026

GPU	VRAM	Type	Median hourly	Monthly	Offers	Updated
A100 PCIE Mid-Range	80GB HBM2e	spot	$1.04/hr	$758/mo	9	Jun 21, 2026
A100 SXM4 High Performance	80GB HBM2e	spot	$1.06/hr	$777/mo	3	Jun 21, 2026
H100 SXM Flagship	80GB HBM3	spot	$2.27/hr	$1,658/mo	6	Jun 21, 2026
A100 PCIE Mid-Range	80GB HBM2e	on-demand	$3.67/hr	$2,681/mo	9	Jun 21, 2026
A100 SXM4 High Performance	80GB HBM2e	on-demand	$4.10/hr	$2,990/mo	3	Jun 21, 2026
H100 NVL High Performance	94GB HBM3	spot	$6.28/hr	$4,586/mo	6	Jun 21, 2026
H100 NVL High Performance	94GB HBM3	on-demand	$6.98/hr	$5,095/mo	6	Jun 21, 2026
H100 SXM Flagship	80GB HBM3	on-demand	$12.29/hr	$8,972/mo	6	Jun 21, 2026

Search guide

Azure batch inference pricing overview

This page narrows Azure's GPU catalog to the rows most likely to matter for batch inference. It is built for buyers who already know the provider they are evaluating and need a workload-specific price shortlist.

Offline inference queues, embedding jobs, catch-up workloads, and cost-sensitive internal processing. Use the table below to move from the cheapest visible row into adjacent provider/GPU pages, broader guides, and market-floor comparisons.

Cheapest provider right now

Cheapest Azure row for batch inference

Azure currently starts at $1.04/hr for batch inference with A100 PCIE on spot pricing. The overall tracked market floor is $0.13/hr on GCP L40, making Azure $0.91/hr above that floor.

Methodology and freshness

How this workload slice is computed

We filter the live compare payload to spot, community, and on-demand rows with at least 24GB of VRAM, then sort qualifying Azure rows by current median hourly price.

FAQ

Azure batch inference pricing FAQ

What is the cheapest Azure option for batch inference?

Azure currently starts at $1.04/hr for batch inference with A100 PCIE on spot pricing.

Is Azure cheapest for batch inference?

The overall tracked market floor is $0.13/hr on GCP L40, making Azure $0.91/hr above that floor.

What GPUs count toward this batch inference page?

This page filters to 24GB+ GPUs and uses spot, community, and on-demand pricing.

How fresh is this Azure batch inference page?

The rows are recalculated from the latest stored provider snapshot. The freshest qualifying row visible here is from Jun 21, 2026.

Next searches after Azure batch inference

These links move sideways into the full provider catalog, workload guide, market floor, and adjacent provider workload pages.