What is the difference between on-demand and spot pricing for Crusoe Cloud?

On-demand pricing is our most flexible option, billed per-hour (or per-second) with no minimum commitment, and ideal for workloads where uptime, predictability, and stability are critical. Spot pricing offers significant discounts, but is better for fault-tolerant workloads that can be stopped and restarted without major disruption.

Do you offer reserved capacity pricing?

Contact our sales team for information about reserved capacity pricing options.

Are there setup fees or hidden costs to start using Crusoe Cloud?

No. There are no upfront setup fees for on-demand GPU or CPU instances. Our billing is transparent; you only pay for the resources you consume.

What is the minimum billing period for GPU instances?

All Crusoe Cloud GPU and CPU instances are billed by the minute.

Do you charge for data transfer?

At this time, Crusoe Cloud does not charge for network ingress or egress, either within a VPC or to/from the public internet.

How can I get the best discount on GPU resources?

Contact our sales team to discuss discount options for GPU resources.

Is there a minimum contract length for reserved capacity?

Speak with sales to learn about contract terms for reserved capacity.

What makes Crusoe Cloud optimized for large model training?

Our infrastructure is purpose-built and engineered for AI. We deploy the latest high-interconnect NVIDIA GPUs, high-performance networking built on industry best practices for RDMA, and low-latency storage. We proactively monitor infrastructure to detect and remediate issues before they impact your workloads. This combination eliminates data bottlenecks and improves reliability, ensuring your models train faster and more efficiently at scale.

How is Crusoe Managed Inference billed, and what are input, output, and cached tokens?

Crusoe Managed Inference uses a usage-based, pay-as-you-go model, billed per 1 million tokens. Input tokens are the text your application sends to the model; Output tokens are the text the model generates in response. Cached tokens are used when the model reuses previous context or prompts, which are typically billed at a much lower rate.

Crusoe Cloud
pricing

Flexible pricing options to meet your needs. Choose reserved, on-demand, or spot pricing for GPU instances, and pay-as-you-go or provisioned throughput for Crusoe Managed Inference.

Contact sales

Get started

Two stylized white cloud shapes with gray sides and lime green square pixel patterns.

Built for value, scalability & speed

Cost-effective performance

Our AI-optimized hardware and lightweight virtualization cut waste and unlock performance, so you get more done with less.

Growth-aligned commitments

Avoid GPU lock-ins with tailored agreements that scale with your needs and budget.

Flexible consumption models

Access our full portfolio of LLMs and generative models with pay-as-you-go pricing, or utilize our GPU/CPU offerings, with spot, on-demand, or reserved pricing options.

Pricing for compute instances and managed AI services

Pricing

Compute

Managed Inference

Provisioned throughput

GPU instances pricing

Access the latest high-performance GPUs including NVIDIA GB200 NVL72 and AMD MI355X. Pay by the hour for maximum agility and unthrottled compute, or contact us to lock in guaranteed resources at our lowest rates.

GPU model

On-demand

Current spot

$4.29/GPU-hr

Contact sales

NVIDIA H100

80GB

HGX

$3.90/GPU-hr

$1.60/GPU-hr

NVIDIA A100

80GB

SXM

$1.95/GPU-hr

$1.30/GPU-hr

NVIDIA A100

80GB

PCIe

$1.65/GPU-hr

$1.20/GPU-hr

NVIDIA A100

40GB

PCIe

$1.45/GPU-hr

$1.00/GPU-hr

NVIDIA L40S

48GB

$1.00/GPU-hr

$0.50/GPU-hr

NVIDIA A40

48GB

$0.90/GPU-hr

$0.40/GPU-hr

AMD MI355X

288GB

Contact sales

AMD MI300X

192GB

$3.45/GPU-hr

$0.95/GPU-hr

CPU instances pricing

Ideal for data processing, model checkpointing and orchestrating your GPU clusters. Choose from a variety of vCPU and RAM configurations.

CPU Type

On-demand

General-purpose

$0.04/vCPU-hr

Storage-optimized

$0.09/vCPU-hr

Storage

Reliable, low-latency storage designed to handle the massive datasets and high-throughput demands of modern AI workloads.

Storage

Persistent disks

$0.08
per GiB/month

Shared disks

$0.07
per GiB/month

Container registry usage

$0.10 per GiB/month

Managed Kubernetes

A fully managed cluster that simplifies deployment and scaling of your AI applications across GPU and CPU resources.

Managed Kubernetes

Cluster pricing

$0.10
per cluster hour

Managed inference
(pay as you go)

Seamlessly integrate the industry's leading Large Language Models (LLMs) and generative models into your applications with flexible pay-as-you-go pricing.

Model

(Price per 1 million tokens)

Input tokens

Output tokens

Cached tokens

DeepSeek

R1 0528

$1.35

$5.40

$0.68

DeepSeek

V3 0324

$0.50

$1.50

$0.25

Gemma 3

12B

$0.08

$0.30

$0.04

GPT-OSS

120B

$0.15

$0.60

$0.08

Kimi-K2

Thinking

$0.60

$2.50

$0.30

Llama

3.3 70B Instruct

$0.25

$0.75

$0.13

Qwen3

235B A22B Instruct 2507

$0.22

$0.80

$0.11

Managed inference
(provisioned throughput)

Ensure guaranteed throughput for your generative AI applications. Provisioned throughput is transacted via  AI Model Units (AMUs). The longer your commitment,  the lower your cost. Contact sales to learn more about provisioned throughput.

Frequently asked  questions

Close-up of two server racks with illuminated green status indicator lights and bright green levers on hard drive bays.

Close-up of a server rack with two rows of black hard drive bays featuring bright green release latches and green indicator lights.

Are you ready to build something amazing?

Contact sales

A rural landscape showing hybrid generation, a large array of solar panels alongside a canal and a line of wind turbines.

Crusoe Cloudpricing

Built for value, scalability & speed

Cost-effective performance

Growth-aligned commitments

Flexible consumption models

Pricing for compute instances and managed AI services

GPU instances pricing

NVIDIA GB200

NVIDIA B200

NVIDIA H200

NVIDIA H100

NVIDIA A100

NVIDIA A100

NVIDIA A100

NVIDIA L40S

NVIDIA A40

AMD MI355X

AMD MI300X

CPU instances pricing

General-purpose

Storage-optimized

Storage

Persistent disks

Shared disks

Container registry usage

Managed Kubernetes

Cluster pricing

Managed inference(pay as you go)

DeepSeek

R1 0528

DeepSeek

V3 0324

Gemma 3

12B

GPT-OSS

120B

Kimi-K2

Thinking

Llama

3.3 70B Instruct

Qwen3

235B A22B Instruct 2507

Managed inference(provisioned throughput)

Frequently asked questions

Are you ready to build something amazing?

Crusoe Cloud
pricing

Managed inference
(pay as you go)

Managed inference
(provisioned throughput)

Frequently asked  questions