Model provider cost

Host Qwen2.5 72B Instruct on Lambda

Qwen2.5 72B Instruct needs 1x 80GB+ GPU, and Lambda's current cheapest qualifying row is A100 SXM4 at $1.99/hr.

1x 80GB+ GPU 72B params Premium multilingual Tongyi Qianwen License
Cheapest on this provider
$1.99/hr
A100 SXM4
Monthly estimate
$1,453/mo
730 hours at the current median
VRAM baseline
80GB
1x 80GB+ GPU
Qualifying rows
5
Updated Jun 21, 2026

Lambda rows that can host Qwen2.5 72B Instruct

The cheapest tracked way to host Qwen2.5 72B Instruct on Lambda is A100 SXM4 at $1.99/hr. The overall tracked market floor is $1.00/hr on Vast.ai, so Lambda is $0.99/hr above the current floor.

GPU VRAM Per GPU Estimated hourly Estimated monthly Updated
A100 SXM4 80GB $1.99/hr $1.99/hr $1,453/mo Jun 21, 2026
H200 141GB $2.29/hr $2.29/hr $1,672/mo Jun 21, 2026
H100 PCIE 80GB $3.29/hr $3.29/hr $2,402/mo Jun 21, 2026
H100 SXM 80GB $4.14/hr $4.14/hr $3,022/mo Jun 21, 2026
B200 192GB $6.84/hr $6.84/hr $4,993/mo Jun 21, 2026

Why this setup does or does not fit

VRAM floor

1x 80GB+ GPU

1x 80GB GPU minimum; 2x 80GB for healthier serving margins. Long prompts, batching, and KV cache can require extra headroom.

Model quality

Premium multilingual

Premium open-weight quality with wide language coverage, but it firmly lives in 80GB-class infrastructure.

Operational note

Qwen 72B

Best suited to teams that want premium open-weight quality with wide language coverage.

Qwen2.5 72B Instruct on Lambda FAQ

Can I host Qwen2.5 72B Instruct on Lambda?

The cheapest tracked way to host Qwen2.5 72B Instruct on Lambda is A100 SXM4 at $1.99/hr.

What GPU memory does Qwen2.5 72B Instruct need?

Our baseline for Qwen2.5 72B Instruct is 1x 80GB+ GPU. The practical recommendation is 1x 80GB GPU minimum; 2x 80GB for healthier serving margins.

Is Lambda the cheapest provider for Qwen2.5 72B Instruct?

The overall tracked market floor is $1.00/hr on Vast.ai, so Lambda is $0.99/hr above the current floor.

How fresh is this Lambda Qwen2.5 72B Instruct cost page?

This page recalculates from the latest tracked on-demand rows. The freshest qualifying Lambda row shown here is from Jun 21, 2026.

Compare this setup