Models & AI

Bring your own key

Don’t want to run a model locally, or want frontier quality? Bring your own API key. Same IDE, same verified loop — the generation step calls your provider directly from your browser. Your key is stored in your browser only and is never sent to an OIOXO server.

Supported providers

ProviderExample models
OpenAIgpt-4o, gpt-4o-mini, o3-mini, o1
Anthropic (Claude)claude-3-5-sonnet-latest, claude-opus-4, claude-3-5-haiku-latest
Google (Gemini)gemini-2.0-flash, gemini-2.5-pro, gemini-2.5-flash
OpenRouterany model on OpenRouter (live list)
Groqllama-3.3-70b-versatile, qwen-2.5-coder-32b
DeepSeekdeepseek-chat, deepseek-reasoner
Mistralmistral-large-latest, codestral-latest
xAI (Grok)grok-2-latest, grok-beta
Together AIQwen 2.5 Coder, Llama 3.3, DeepSeek V3
Fireworks AIQwen 2.5 Coder, Llama 3.3, DeepSeek V3
Cerebrasllama-3.3-70b, llama3.1-8b (fastest inference)
Customany OpenAI-compatible endpoint (your base URL + model)

Setting it up

  1. Choose “Use your own API key — no download” when picking a model (or switch any time from the model settings).
  2. Paste your key — OIOXO recognizes the provider from the key format automatically.
  3. Pick a model from the curated list (or the provider’s live list where available).
  4. OIOXO makes one tiny test call to confirm the key works, then you’re building.

Token-saving (your bill, ~90% smaller)

OIOXO automatically structures BYOK requests for prompt caching: with Anthropic it marks the stable system context as cacheable (cache hits bill at a fraction of the price), and OpenAI-compatible providers cache long shared prefixes automatically. The meter shows tokens spent and tokens saved — including the dollar value. Free accounts get a monthly savings allowance; Pro removes the cap.

Ollama

Running Ollama locally? OIOXO detects it at localhost:11434 and can use any model you’ve pulled — local frontier-class coding with zero key and zero cloud.

Note

BYOK traffic goes from your browser straight to your provider. OIOXO never proxies it, never sees your key, and never sees your code in transit.