TOKEN FORWARDS
HOW IT WORKS
CONTRACT SPECIFICATION
Compute Exchange standardizes inference commitments as Standardized Token Units (STUs). Buyers commit to a chosen STU volume; providers fulfill against the published methodology.
INPUT TOKEN
UNCACHED
1.00
STU
CACHED INPUT
REUSED CONTEXT
0.20
STU
OUTPUT TOKEN
GENERATED
4.21
STU
BATCH MODE
NON-REALTIME
0.50
× DRAW
Per-STU pricing varies by model family. Providers absorb the basis between the published STU methodology and their underlying per-token economics — buyers hold a single fungible commitment denominated in STUs.
PRINCIPLES
COVERAGE
CALIBRATION REFERENCE
The published STU methodology (1.0 · 0.2 · 4.21 · 0.5×) is one calibration — anchored on the Kimi K2 line. Other open models have different native input:output economics. Below: how 1 input, cached, and output token convert to STUs under each model's native ratio.
HOW STU PRICING WORKS · Conversion to STU is fixed per model; the per-STU price floats by provider based on hardware and underlying economics. Providers absorb the basis between native ratios and the published index at quote time.
IMPORTANT · Per-STU pricing varies materially by model family and provider. Quotes are indicative; final terms are confirmed bilaterally with the matched provider.
Native ratios derived from active open-model provider catalogs. Cached-input ratio shown only where the provider publishes one — most non-Kimi endpoints don't currently expose cached pricing publicly, so the cell reads "varies." Quotes price against the published 4.21× index; providers absorb the basis between native ratios and the index.
Frequently Asked Questions