Intelligent chat completions with automatic model routing and cost optimization. DynaRoute analyzes your prompts and routes them to the most cost-effective model while maintaining quality.
To get started, install the DynaRoute client library:
Here's how to make a simple, non-streaming chat completion request:
DynaRoute automatically selects the most cost-effective model for your prompt, potentially saving up to 70% on API costs compared to always using premium models.
POST https://api.dynaroute.com/chat/completionsCreates a chat completion response for the provided messages. Compatible with OpenAI's Chat Completions API format.
DynaRoute uses API key authentication. Include your API key in the Authorization header:
Authorization: Bearer YOUR_API_KEY| Parameter | Type | Required | Description |
|---|---|---|---|
messages | array | Required | Array of message objects forming the conversation. Each message must have role and content fields. |
stream | boolean | Optional | If true, the response will be streamed as Server-Sent Events. Default: false |
request_timeout | integer | Optional | Request timeout in seconds. Default: 720 |
level_override | integer | Optional | Override the automatic complexity level for routing (1–5). |
Use level_override to control the complexity level for routing (1–5). 5 is the most difficult and 1 is the least.
When stream is false, the API returns a complete JSON response:
DynaRoute includes detailed cost information in the usage.cost object, showing both actual costs and GPT‑4.1 equivalent costs for comparison.
The _custom_routing_info object provides insights into how DynaRoute classified and routed your request.
For real‑time responses, enable streaming to receive chunks as they're generated:
Returns 401 for invalid or missing API keys.
{"error": {"message": "Invalid or unauthorized API key", "type": "authentication_error", "code": 401}}Returns 400 for malformed requests or missing required parameters.
{"error": {"message": "No messages array found in request", "type": "invalid_request_error", "code": 400}}Returns 500 for internal server errors or routing failures.
{"error": {"message": "Core routing logic is not loaded", "type": "server_error", "code": 503}}All requests are tracked per API key with detailed usage metrics including token counts and costs.
DynaRoute automatically routes to cost‑effective models, with potential savings of up to 70% compared to premium models.
DynaRoute MCP Server is a Model Context Protocol server that provides intelligent chat completions with automatic model routing and cost optimization. It's compatible with Claude Desktop, Cursor, and other MCP-compatible clients.
Install via pip:
Configure Claude Desktop by adding this to your claude_desktop_config.json:
Once configured, you can use DynaRoute in Claude Desktop by:
When you use the DynaRoute tool, you'll get:
🤖 DynaRoute Response: [Your AI response content here] 📊 PERFORMANCE METRICS: • Model Used: gcp-gemini-2.0-flash-thinking • Response Time: 2.5 seconds • Request ID: chatcmpl-xyz123 🔢 TOKEN USAGE: • Input Tokens: 25 • Output Tokens: 150 • Total Tokens: 175 💰 COST BREAKDOWN: • Input Cost: $0.000012 • Output Cost: $0.000045 • Total Cost: $0.000057 📈 COST SAVINGS vs GPT-4o: • GPT-4o Equivalent Cost: $0.000175 • Your Actual Cost: $0.000057 • Money Saved: $0.000118 • Savings Percentage: 67.4% ✅ EFFICIENCY SUMMARY: DynaRoute saved you 67.4% compared to GPT-4o while maintaining quality!
DYNAROUTE_API_KEY: Your DynaRoute API key (required)The dynaroute_chat tool accepts:
messages: Array of conversation messages (required)level_override: Override complexity level 1–5 (optional)Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS):
Add to your Cursor settings:
You can also import and use the server programmatically:
mcp: Model Context Protocol implementationdynaroute-client: Official DynaRoute Python clientDynaRoute's MCP server enables seamless integration with all MCP‑compatible clients, allowing you to use intelligent model routing while maintaining your preferred AI interface.