Lilac uses simple per-token pricing. You pay only for what you use — no contracts, no minimums, no reserved capacity fees.Documentation Index
Fetch the complete documentation index at: https://docs.getlilac.com/llms.txt
Use this file to discover all available pages before exploring further.
How It Works
Lilac routes your inference requests to idle enterprise GPUs. Because these GPUs are already powered on and paid for by their owners, we can offer inference at significantly lower prices than dedicated cloud providers.Current Pricing
| Model | Input | Cache Read | Output |
|---|---|---|---|
| Kimi K2.6 | $0.70 / M tokens | $0.20 / M tokens | $3.50 / M tokens |
| GLM 5.1 | $0.90 / M tokens | $0.27 / M tokens | $3.00 / M tokens |
| Gemma 4 | $0.11 / M tokens | — | $0.35 / M tokens |
| MiniMax M2.7 | $0.30 / M tokens | $0.055 / M tokens | $1.20 / M tokens |
Cache read pricing
When repeated input tokens (such as system prompts or shared context) are served from cache, they’re billed at the cache read rate instead of the standard input rate. Cache read pricing only applies to models that support cached input tokens — models without cache support are marked with— and bill all input tokens at the standard input rate.
What’s Included
- No cold start fees
- No request fees
- No minimum spend
- No contract or commitment
- Streaming at no extra cost
Adding Credits
Lilac uses a credit-based billing system. Add credits through the Billing section of your dashboard. You can also configure auto top-up to automatically add credits when your balance drops below a threshold.Go to Billing
Open console.getlilac.com and navigate to Billing in the sidebar.

