ABOUT US
BUY TOKENS FORWARD.
SECURE YOUR CAPACITY.
BUY TOKENS FORWARD.
SECURE YOUR CAPACITY.
Procure inference tokens in advance and tap them over a term of up to six months. Lock unit economics, commit capacity terms, and secure supply ahead of demand — across leading open-weight models.
Procure inference tokens in advance and tap them over a term of up to six months. Lock unit economics, commit capacity terms, and secure supply ahead of demand — across leading open-weight models.
HOW IT WORKS
COMIT. LOCK. TAP.
01
COMMIT
Specify the open model, token volume (input / cached / output), term up to six months, and whether batch processing is acceptable. We aggregate committed-use quotes across the provider network.
01
COMMIT
Specify the open model, token volume (input / cached / output), term up to six months, and whether batch processing is acceptable. We aggregate committed-use quotes across the provider network.
02
LOCK
Lock a committed per-token rate for the full term. Settlement (upfront, milestone, or monthly), priority allocation, and rollover terms surface per quote so you can compare bilaterally.
02
LOCK
Lock a committed per-token rate for the full term. Settlement (upfront, milestone, or monthly), priority allocation, and rollover terms surface per quote so you can compare bilaterally.
03
TAP
Tap tokens against your committed balance over the term, against your real demand curve. Usage reconciles per the agreed settlement schedule.
03
TAP
Tap tokens against your committed balance over the term, against your real demand curve. Usage reconciles per the agreed settlement schedule.
PRINCIPLES
FORWARD VS ON-DEMAND
FORWARD VS ON-DEMAND
BUDGET CERTAINTY
Lock per-token unit economics for the full term. Forecast inference spend with confidence across the commitment.
BUDGET CERTAINTY
Lock per-token unit economics for the full term. Forecast inference spend with confidence across the commitment.
PRIORITY ALLOCATION
Quotes surface each provider's priority and reservation terms for committed balances during demand spikes.
PRIORITY ALLOCATION
Quotes surface each provider's priority and reservation terms for committed balances during demand spikes.
SUPPLY SECURITY
Secure token supply ahead of anticipated demand growth or open-model availability constraints.
SUPPLY SECURITY
Secure token supply ahead of anticipated demand growth or open-model availability constraints.
FLEXIBLE TAP
Tap your committed balance against real usage over the term, with rollover terms surfaced per quote.
FLEXIBLE TAP
Tap your committed balance against real usage over the term, with rollover terms surfaced per quote.
COVERAGE
OPEN MODELS ONLY
OPEN MODELS ONLY
LARGE OPEN-WEIGHT
Flagship open models — Llama, DeepSeek, Qwen class — served across the provider network at committed volume.
LARGE OPEN-WEIGHT
Flagship open models — Llama, DeepSeek, Qwen class — served across the provider network at committed volume.
SMALL & EFFICIENT
Distilled and small open models for high-volume, latency-sensitive, or cost-optimized inference.
SMALL & EFFICIENT
Distilled and small open models for high-volume, latency-sensitive, or cost-optimized inference.
MULTIMODAL & VISION
Open vision, speech, and multimodal models for document, image, and audio inference workloads.
MULTIMODAL & VISION
Open vision, speech, and multimodal models for document, image, and audio inference workloads.
EMBEDDING & SPECIALIZED
Open embedding, reranking, and classification models priced per standard inference billing units.
EMBEDDING & SPECIALIZED
Open embedding, reranking, and classification models priced per standard inference billing units.
Frequently Asked Questions
TOKEN FORWARDS, EXPLAINED
What is a token forward?
What term lengths are available?
What happens if I do not use all my committed tokens?
Which models can I procure tokens for?
How is this different from Reserved GPU Rental?
How are token forwards settled?
SECURE YOUR TOKEN SUPPLY
SECURE YOUR TOKEN SUPPLY
Submit a commitment request and Compute Exchange returns a token forward quote across the verified open-model provider network.
COMPUTE
EXCHANGE
The transparent GPU marketplace for AI infrastructure. Built for builders.
ALL SYSTEMS OPERATIONAL
COMPUTE
EXCHANGE
The transparent GPU marketplace for AI infrastructure. Built for builders.
ALL SYSTEMS OPERATIONAL
COMPUTE
EXCHANGE
The transparent GPU marketplace for AI infrastructure. Built for builders.
ALL SYSTEMS OPERATIONAL