Lambda batch inference GPU prices
Batch inference can use cheaper and less stable pricing modes when queues are flexible and retries are acceptable. The cheapest tracked Lambda row in this slice is RTX 6000Ada at $0.69/hr.
Current Lambda rows for batch inference
Lambda has 7 qualifying GPU models across 7 current price rows for this workload. The overall tracked market floor is $0.13/hr on GCP L40, making Lambda $0.56/hr above that floor.
| GPU | VRAM | Type | Median hourly | Monthly | Offers | Updated |
|---|---|---|---|---|---|---|
|
RTX 6000Ada
Mid-Range
|
48GB GDDR6 | on-demand | $0.69/hr | $504/mo | 1 | Jun 21, 2026 |
|
L40
Mid-Range
|
48GB GDDR6 | on-demand | $1.29/hr | $942/mo | 1 | Jun 21, 2026 |
|
A100 SXM4
High Performance
|
80GB HBM2e | on-demand | $1.99/hr | $1,453/mo | 6 | Jun 21, 2026 |
|
H200
Flagship
|
141GB HBM3e | on-demand | $2.29/hr | $1,672/mo | 1 | Jun 21, 2026 |
|
H100 PCIE
High Performance
|
80GB HBM3 | on-demand | $3.29/hr | $2,402/mo | 1 | Jun 21, 2026 |
|
H100 SXM
Flagship
|
80GB HBM3 | on-demand | $4.14/hr | $3,022/mo | 4 | Jun 21, 2026 |
|
B200
Flagship
|
192GB HBM3e | on-demand | $6.84/hr | $4,993/mo | 4 | Jun 21, 2026 |
Lambda batch inference pricing overview
This page narrows Lambda's GPU catalog to the rows most likely to matter for batch inference. It is built for buyers who already know the provider they are evaluating and need a workload-specific price shortlist.
Offline inference queues, embedding jobs, catch-up workloads, and cost-sensitive internal processing. Use the table below to move from the cheapest visible row into adjacent provider/GPU pages, broader guides, and market-floor comparisons.
Cheapest Lambda row for batch inference
Lambda currently starts at $0.69/hr for batch inference with RTX 6000Ada on on-demand pricing. The overall tracked market floor is $0.13/hr on GCP L40, making Lambda $0.56/hr above that floor.
How this workload slice is computed
We filter the live compare payload to spot, community, and on-demand rows with at least 24GB of VRAM, then sort qualifying Lambda rows by current median hourly price.
Lambda batch inference pricing FAQ
What is the cheapest Lambda option for batch inference?
Lambda currently starts at $0.69/hr for batch inference with RTX 6000Ada on on-demand pricing.
Is Lambda cheapest for batch inference?
The overall tracked market floor is $0.13/hr on GCP L40, making Lambda $0.56/hr above that floor.
What GPUs count toward this batch inference page?
This page filters to 24GB+ GPUs and uses spot, community, and on-demand pricing.
How fresh is this Lambda batch inference page?
The rows are recalculated from the latest stored provider snapshot. The freshest qualifying row visible here is from Jun 21, 2026.
Next searches after Lambda batch inference
These links move sideways into the full provider catalog, workload guide, market floor, and adjacent provider workload pages.