Model provider cost

Host Mistral Nemo Instruct 12B on AWS

Mistral Nemo Instruct 12B needs 1x 24GB+ GPU, and AWS's current cheapest qualifying row is A100 SXM4 at $3.09/hr.

1x 24GB+ GPU 12B params Strong long-context Apache 2.0
Cheapest on this provider
$3.09/hr
A100 SXM4
Monthly estimate
$2,254/mo
730 hours at the current median
VRAM baseline
24GB
1x 24GB+ GPU
Qualifying rows
4
Updated Jun 21, 2026

AWS rows that can host Mistral Nemo Instruct 12B

The cheapest tracked way to host Mistral Nemo Instruct 12B on AWS is A100 SXM4 at $3.09/hr. The overall tracked market floor is $0.40/hr on Vast.ai, so AWS is $2.69/hr above the current floor.

GPU VRAM Per GPU Estimated hourly Estimated monthly Updated
A100 SXM4 80GB $3.09/hr $3.09/hr $2,254/mo Jun 21, 2026
L40 48GB $3.39/hr $3.39/hr $2,471/mo Jun 21, 2026
H100 SXM 80GB $6.88/hr $6.88/hr $5,022/mo Jun 21, 2026
H200 141GB $7.91/hr $7.91/hr $5,776/mo Jun 21, 2026

Why this setup does or does not fit

VRAM floor

1x 24GB+ GPU

1x 24GB to 48GB GPU. Long prompts, batching, and KV cache can require extra headroom.

Model quality

Strong long-context

Good for summarization and retrieval-heavy work, though it is not a frontier reasoning model.

Operational note

Mistral 12B

Useful when long prompts matter more than absolute frontier reasoning quality.

Mistral Nemo Instruct 12B on AWS FAQ

Can I host Mistral Nemo Instruct 12B on AWS?

The cheapest tracked way to host Mistral Nemo Instruct 12B on AWS is A100 SXM4 at $3.09/hr.

What GPU memory does Mistral Nemo Instruct 12B need?

Our baseline for Mistral Nemo Instruct 12B is 1x 24GB+ GPU. The practical recommendation is 1x 24GB to 48GB GPU.

Is AWS the cheapest provider for Mistral Nemo Instruct 12B?

The overall tracked market floor is $0.40/hr on Vast.ai, so AWS is $2.69/hr above the current floor.

How fresh is this AWS Mistral Nemo Instruct 12B cost page?

This page recalculates from the latest tracked on-demand rows. The freshest qualifying AWS row shown here is from Jun 21, 2026.

Compare this setup