Available Models

Binom.Router integrates with leading AI model providers, giving you access to state-of-the-art language models through a unified API. Configure multiple providers and switch between them seamlessly based on your needs.

Supported Models

The following models are currently available through Binom.Router:

Claude (Anthropic)

Model	Context Window	Best For
`claude-3-5-sonnet-20240620`	200K tokens	Complex reasoning, coding, analysis
`claude-3-opus-20240229`	200K tokens	Most capable, long-context tasks
`claude-3-sonnet-20240229`	200K tokens	Balanced performance and speed
`claude-3-haiku-20240307`	200K tokens	Fast responses, simple tasks

Gemini (Google)

Model	Context Window	Best For
`gemini-2.0-flash-exp`	1M tokens	Experimental, fast multimodal
`gemini-1.5-pro`	1M tokens	Complex tasks, long documents
`gemini-1.5-flash`	1M tokens	Balanced speed and capability
`gemini-1.0-pro`	32K tokens	General purpose tasks

GPT (OpenAI / Codex)

Model	Context Window	Best For
`gpt-4o`	128K tokens	General purpose, vision, coding
`gpt-4-turbo`	128K tokens	Advanced reasoning
`gpt-3.5-turbo`	16K tokens	Cost-effective, simple tasks

OpenAI Compatible Providers

Binom.Router also supports any model exposed through an OpenAI-compatible API endpoint. Models from providers like z.ai, Together AI, and others can be configured dynamically by administrators.

Model Availability

Model availability depends on:

Your subscription plan
Provider service status
Worker node connectivity
API key permissions

Check your Dashboard for real-time availability status.

Streaming Support

All listed models support streaming responses for real-time interaction. Enable streaming in your API requests by setting "stream": true in your request parameters.

Rate Limits

Rate limits vary by provider and model. Refer to your Billing page for detailed rate limit information based on your plan.