What is TokenVue?
TokenVue is an AI gateway and infrastructure control plane that helps teams securely manage, route, monitor, and optimize LLM traffic across providers and environments.
It sits between applications and AI providers such as OpenAI, Anthropic, Gemini, DeepSeek, and OpenAI-compatible self-hosted models.
Why TokenVue Exists
Modern AI applications often rely on multiple providers, model families, environments, and routing rules. Without a control layer, teams usually end up spreading provider keys, limits, logs, and failover logic across application code and operations runbooks.
That creates problems around:
- Reliability
- Security
- Observability
- Governance
- Cost optimization
- Vendor lock-in
TokenVue centralizes those controls so teams can operate LLM traffic consistently at production scale.
Core Capabilities
Virtual Keys
Virtual Keys are application-facing API keys that route requests through TokenVue instead of exposing provider credentials directly.
Use Virtual Keys to manage:
- Service ownership
- Provider and model access
- Daily, monthly, token, or request limits
- Guardrail behavior
- Usage tracking and request monitoring
LLM Config
LLM Config stores provider-facing setup inside TokenVue.
Use LLM Config to define:
- Provider name
- Provider API key
- Base URL for OpenAI-compatible providers
- Model name
- Budget mode
- Token or request limits
Auto Router
Auto Router applies routing policy when traffic should move away from a primary provider or model.
Routing decisions can use signals such as:
- Budget usage
- Latency
- Provider health
- Error rates
- Quota limits
- Region or capacity requirements
Guardrails
Guardrails run before requests are forwarded to providers.
TokenVue supports controls such as:
- Hard budget caps
- PII scrubbing
- Toxicity filtering
- Prompt injection detection
- Keyword blocking
Insights, Logs, and Breaches
TokenVue records gateway activity so teams can review:
- Token usage
- Estimated spend
- Provider latency
- Model routing behavior
- Request status
- Guardrail blocks
- Failed or suspicious traffic
OpenAI-Compatible API
TokenVue provides an OpenAI-compatible API layer, so existing OpenAI SDK workflows can use TokenVue with minimal code changes.
import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.TOKENVUE_API_KEY,
baseURL: "https://api.tokenvue.ai/v1",
});
For local development, your gateway base URL may be:
http://localhost:3000/v1
Built for Production AI
TokenVue is designed for:
- AI startups
- SaaS applications
- Enterprise AI platforms
- AI agents and copilots
- Internal AI infrastructure teams
- Hybrid AI deployments
Whether you are building customer-facing AI products or internal AI systems, TokenVue provides the infrastructure layer needed to operate LLM access securely and reliably.