Skip to main content

Simple, transparent pricing

Pay only for what you use. Per-million-token pricing across every model. No monthly plans, no hidden fees, no surprises.

Pay per million tokens

Access every model with one API key. Prices are per 1 million tokens. Cached input is billed at 10% of the input rate (cache hits); cache writes at 1.25x.

Chat & Reasoning

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Chat Pico$0.15$0.015$0.19$0.55
Assisters Chat Mini$0.15$0.015$0.19$0.55
Chat v6$0.15$0.015$0.19$0.55
Chat Mini 2$0.15$0.015$0.19$0.55
Chat Nano$0.15$0.015$0.19$0.55
Chat v15 (Compact)$0.15$0.015$0.19$0.55
Chat Mini 3 (Lightweight)$0.15$0.015$0.19$0.55
Chat v17 (Free Lite)$0.15$0.015$0.19$0.55
Chat v12 (Balanced)$0.40$0.040$0.50$3.25
Chat v14 (Efficient)$0.40$0.040$0.50$3.25
Chat IN (Indic)$0.40$0.040$0.50$3.25
Chat JA (Japanese)$0.40$0.040$0.50$3.25
Chat v10 (Multilingual)$0.60$0.060$0.75$3.25
Chat v11 (Long Context)$0.60$0.060$0.75$3.25
Chat v19 (Sparse MoE)$0.80$0.080$1.00$3.25
Chat Medium$0.85$0.085$1.06$3.25
Chat v13 (Flagship)$1.75$0.17$2.19$13.00
Chat v7 (Ultra)$1.75$0.17$2.19$13.00
Chat v5$2.00$0.20$2.50$6.00
Assisters Chat Turbo$2.00$0.20$2.50$6.00
Chat v1$2.00$0.20$2.50$6.00
Assisters Chat v2$2.00$0.20$2.50$6.00
Assisters Chat v4$2.00$0.20$2.50$6.00
Chat v18 (Free Pro)$2.00$0.20$2.50$13.00
Chat v16 (Free Flash)$2.00$0.20$2.50$6.00
Assisters MoE$2.00$0.20$2.50$6.00
Assisters Chat v3$4.00$0.40$5.00$6.00
Chat v8 (Flash)$4.00$0.40$5.00$6.00
Chat v9 (Pro)$4.00$0.40$5.00$13.00

Classification

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Detect Speaker$0.20$0.00
Detect AV$0.20$0.00

Code

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Assisters Code v2$0.95$0.095$1.19$3.25
Assisters Code Fast$0.95$0.095$1.19$3.25
Code v4$0.95$0.095$1.19$3.25
Assisters Code v1$1.00$0.10$1.25$5.00
Assisters Code v3$1.00$0.10$1.25$5.00
Coder$1.00$0.10$1.25$5.00

Embedding

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Embeddings v2$0.15$0.00
Assisters Embeddings v1$0.20$0.00
Embeddings v4$0.30$0.00
Assisters Code Embeddings$0.30$0.00

Safety

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Guard v4 (Content Safety)$0.20$0.00
Assisters Guard Topic$0.30$0.00
Assisters Guard v1$0.30$0.00
Assisters Moderation v1$0.30$0.00
Assisters Moderation v2$0.30$0.00
Assisters Guard PII$0.30$0.00
Assisters Guard Content$0.30$0.00

Reasoning

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Reason Nano$0.15$0.015$0.19$0.55
Reason v7 (Fast)$0.40$0.040$0.50$3.25
Reason v5 (Multimodal)$0.40$0.040$0.50$3.25
Reason Fast$0.95$0.095$1.19$3.25
Reason v3$0.95$0.095$1.19$3.25
Reason v6 (Deep)$1.75$0.17$2.19$13.00
Assisters Reason v2$2.00$0.20$2.50$6.00
Assisters Reason v1$2.00$0.20$2.50$6.00

Reranking

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Assisters Rerank v1$0.30$0.00
Assisters Rerank v2$0.30$0.00

Science

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Science v2 (Protein Folding)$0.20$0.00
Assisters Science v1$0.20$0.00

Speech & Audio

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Speech-to-Text v1$0.10$0.00

Translation

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Assisters Translate v1$2.00$0.20$2.50$6.00

Speech & Audio

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Assisters TTS v1$0.40$0.00
Assisters TTS v2$0.40$0.00

Video

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Video v2$1.00$0.10$1.25$3.00
Assisters Video v1$1.00$0.10$1.25$3.00

Vision

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Vision v5 (Efficient)$0.40$0.040$0.50$3.25
Vision v6 (Compact)$0.40$0.040$0.50$3.25
Assisters Vision v2$0.90$0.090$1.13$3.25
Assisters Vision v3$0.90$0.090$1.13$3.25
Assisters Vision v1$0.90$0.090$1.13$3.25
Vision v4$0.90$0.090$1.13$3.25

Voice

ModelInput / 1MCached input / 1MCache write / 1MOutput / 1M
Voice Chat$0.30$0.00
Assisters Voice Enhance$0.30$0.00

Frequently asked questions

Have more questions? Contact us

What counts as a token?

Tokens are pieces of words. On average, 1 token is about 4 characters or 0.75 words. Input and output tokens are priced separately, per million tokens.

How does billing work?

There are no monthly plans. You pay only for what you use — per million input tokens and per million output tokens, at the per-model rates above. Costs are deducted from your wallet balance as you make API calls.

How are cached tokens priced?

Prompt-cache tokens are billed separately from regular input. Cache reads (cache hits, where the model reuses a previously seen prompt prefix) are charged at 10% of the input rate — a 90% discount. Cache writes (creating a cache entry) are charged at 1.25x the input rate. The cached_tokens / cache_write_tokens counts are reported back in each response's usage.prompt_tokens_details so you can see exactly what was cached.

Where do I add funds?

Your wallet is shared across all Assisters products. Top up from your dashboard wallet, and the balance is usable on both assisters.io and assisters.dev.

Are there any minimums or commitments?

No. There are no monthly fees, no token caps, and no commitments — you're only charged for the tokens you actually use.

What payment methods do you accept?

We accept all major credit cards (Visa, Mastercard, American Express) and can arrange invoicing for Enterprise customers.

Ready to get started?

Pay only for the tokens you use — no monthly plans, no commitments.

Looking for consumer AI assistant pricing? View assisters.io plans