Available Models
Binom.Router integrates with leading AI model providers, giving you access to state-of-the-art language models through a unified API. Configure multiple providers and switch between them seamlessly based on your needs.
Supported Models
The following models are currently available through Binom.Router:
Claude (Anthropic)
| Model | Context Window | Best For |
|---|---|---|
claude-3-5-sonnet-20240620 |
200K tokens | Complex reasoning, coding, analysis |
claude-3-opus-20240229 |
200K tokens | Most capable, long-context tasks |
claude-3-sonnet-20240229 |
200K tokens | Balanced performance and speed |
claude-3-haiku-20240307 |
200K tokens | Fast responses, simple tasks |
Gemini (Google)
| Model | Context Window | Best For |
|---|---|---|
gemini-2.0-flash-exp |
1M tokens | Experimental, fast multimodal |
gemini-1.5-pro |
1M tokens | Complex tasks, long documents |
gemini-1.5-flash |
1M tokens | Balanced speed and capability |
gemini-1.0-pro |
32K tokens | General purpose tasks |
GPT (OpenAI / Codex)
| Model | Context Window | Best For |
|---|---|---|
gpt-4o |
128K tokens | General purpose, vision, coding |
gpt-4-turbo |
128K tokens | Advanced reasoning |
gpt-3.5-turbo |
16K tokens | Cost-effective, simple tasks |
OpenAI Compatible Providers
Binom.Router also supports any model exposed through an OpenAI-compatible API endpoint. Models from providers like z.ai, Together AI, and others can be configured dynamically by administrators.
Model Availability
Model availability depends on:
- Your subscription plan
- Provider service status
- Worker node connectivity
- API key permissions
Check your Dashboard for real-time availability status.
Streaming Support
All listed models support streaming responses for real-time interaction. Enable streaming in your API requests by setting "stream": true in your request parameters.
Rate Limits
Rate limits vary by provider and model. Refer to your Billing page for detailed rate limit information based on your plan.