Paste actual content to get real token counts. Load a preset below to seed with production CSA / Lumi content.
Advanced: reasoning / output cap
Cached tokens cost ~10% of base input rate. Write cost (1.25× input) amortized across conversations.
Cache write amortization
GPT uses the exact o200k_base tokenizer in-browser. Non-GPT families apply a characters-per-token ratio calibrated for Vietnamese. Override any value below.
Vietnamese calibration ratios — editable
Monthly cost · CSA
Monthly projection by volume · USD
Sources & formulas. Prices verified April 2026 against vendor docs (platform.claude.com, openai.com/api/pricing, ai.google.dev/pricing, minimax.io). Hosted open-source prices via DeepInfra / SiliconFlow / OpenRouter. Claude 4.7 ratio (1.35× over 4.6) from Anthropic migration guide + claudecodecamp real-world measurement. Gemini SentencePiece Vietnamese extrapolated from CJK (~2.4 cpt) and English (~4.2 cpt) baselines. All ratios editable above — paste real usage.input_tokens values to calibrate precisely.