L40 batch inference price by provider
Batch inference can use cheaper and less stable pricing modes when queues are flexible and retries are acceptable. For L40, the current tracked provider floor is $0.13/hr on GCP.
Current L40 rows for batch inference
L40 clears the 24GB+ filter for batch inference. L40 is currently at the tracked market floor for batch inference.
| Provider | Type | Median hourly | Monthly | Workload page | Offers | Updated |
|---|---|---|---|---|---|---|
| GCP | spot | $0.13/hr | $96/mo | GCP batch inference | 39 | Jun 21, 2026 |
| Vast.ai | on-demand | $0.45/hr | $331/mo | Vast.ai batch inference | 1 | Jun 21, 2026 |
| GCP | on-demand | $0.66/hr | $482/mo | GCP batch inference | 55 | Jun 21, 2026 |
| RunPod | community | $0.74/hr | $540/mo | RunPod batch inference | 2 | Jun 21, 2026 |
| RunPod | spot | $0.89/hr | $650/mo | RunPod batch inference | 2 | Jun 21, 2026 |
| RunPod | on-demand | $0.91/hr | $661/mo | RunPod batch inference | 2 | Jun 21, 2026 |
| Lambda | on-demand | $1.29/hr | $942/mo | Lambda batch inference | 1 | Jun 21, 2026 |
| AWS | on-demand | $3.39/hr | $2,471/mo | AWS batch inference | 16 | Jun 21, 2026 |
| Oracle | on-demand | $3.50/hr | $2,555/mo | Oracle batch inference | 1 | Jun 21, 2026 |
L40 for batch inference: current provider pricing
This page starts with the GPU instead of the provider, then filters the live market to rows that match batch inference. It is useful when you have already chosen L40 and need the cheapest provider path for the workload.
Offline inference queues, embedding jobs, catch-up workloads, and cost-sensitive internal processing. The table keeps provider/GPU and provider/workload links close together so you can keep narrowing the shortlist without going back to the homepage.
Cheapest L40 provider for batch inference
The cheapest tracked L40 row for batch inference is $0.13/hr on GCP with spot pricing. L40 is currently at the tracked market floor for batch inference.
How this GPU workload page is computed
We filter the compare payload to L40, require 24GB+ workload fit, and sort current spot, community, and on-demand provider rows by median hourly price.
L40 batch inference pricing FAQ
Is L40 good for batch inference?
L40 clears the 24GB+ filter for batch inference. The cheapest tracked L40 row for batch inference is $0.13/hr on GCP with spot pricing.
What is the cheapest L40 provider for batch inference?
The cheapest tracked L40 row for batch inference is $0.13/hr on GCP with spot pricing.
How does L40 compare with the broader batch inference market?
L40 is currently at the tracked market floor for batch inference.
How fresh is this L40 batch inference page?
The page is recalculated from the latest stored spot, community, and on-demand rows. The freshest qualifying row visible here is from Jun 21, 2026.
Next searches after L40 batch inference
Use these links to move into the base GPU page, the workload guide, provider workload slices, and adjacent GPU workload pages.