Models

Runtime registry with capability flags, context limits, pricing, and route state.

133 models indexed

Family

All anthropic cohere deepseek elevenlabs gemma general google llama mistral openai qwen runway stability xai

Capability

All tools vision json_mode streaming completion

Runtime state

All warm loading cold

Anthropic API Platform

anthropic-anthropic-api-platform • anthropic • Anthropic

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 2.1/5.7 per 1M

Class: closed

Model detail + compatibility

Anthropic API Platform listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-anthropic-api-platform"}'

Anthropic Claude 3.7 Sonnet

anthropic-anthropic-claude-37-sonnet • anthropic • Anthropic

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 0.37/0.84 per 1M

Class: closed

Model detail + compatibility

Anthropic Claude 3.7 Sonnet listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-anthropic-claude-37-sonnet"}'

Arize Phoenix

arize-arize-phoenix • general • Arize

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.21/0.52 per 1M

Class: closed

Model detail + compatibility

Arize Phoenix listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"arize-arize-phoenix"}'

AutoGen Studio

autogen-studio • general • LiveBench

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: -/- per 1M

Class: open

Model detail + compatibility

Developer-focused multi-agent experimentation and evaluation environment.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"autogen-studio"}'

AWS Bedrock

bedrock-aws-bedrock • general • AWS Bedrock

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.45/1 per 1M

Class: closed

Model detail + compatibility

AWS Bedrock listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"bedrock-aws-bedrock"}'

AWS Nova Pro

bedrock-aws-nova-pro • general • AWS Bedrock

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 1.3/1.7 per 1M

Class: closed

Model detail + compatibility

AWS Nova Pro listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"bedrock-aws-nova-pro"}'

Azure AI Foundry

azure-azure-ai-foundry • general • Azure

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.61/1.32 per 1M

Class: closed

Model detail + compatibility

Azure AI Foundry listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"azure-azure-ai-foundry"}'

Bardeen

bardeen-bardeen • general • Bardeen

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.37/0.84 per 1M

Class: closed

Model detail + compatibility

Bardeen listed under workflow automation agents for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"bardeen-bardeen"}'

Bolt.new

bolt-boltnew • general • Bolt

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.5/1.7 per 1M

Class: closed

Model detail + compatibility

Bolt.new listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"bolt-boltnew"}'

Braintrust Eval

braintrust-braintrust-eval • general • Braintrust

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.9/2.7 per 1M

Class: closed

Model detail + compatibility

Braintrust Eval listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"braintrust-braintrust-eval"}'

Bubble AI

bubble-bubble-ai • general • Bubble

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 1.5/4.2 per 1M

Class: closed

Model detail + compatibility

Bubble AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"bubble-bubble-ai"}'

Builder.io AI

builderio-builderio-ai • general • Builder.io

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.69/1.48 per 1M

Class: closed

Model detail + compatibility

Builder.io AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"builderio-builderio-ai"}'

CAMEL-AI

google-camel-ai • google • Google

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.21/0.52 per 1M

Class: closed

Model detail + compatibility

CAMEL-AI listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-camel-ai"}'

Cartesia Sonic

cartesia-sonic • general • OpenRouter

warm

toolsstreaming

Context: 128,000

Max output: 8,192

Pricing: 1.9/7.8 per 1M

Class: closed

Model detail + compatibility

Ultra-low latency TTS model designed for realtime conversational experiences.

temperature: not supported

top_p: not supported

top_k: not supported

min_p: not supported

max_tokens: supported

frequency_penalty: not supported

presence_penalty: not supported

stop: not supported

seed: not supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"cartesia-sonic"}'

Claude Code 4

anthropic-claude-code-4 • anthropic • Anthropic

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.5/1.7 per 1M

Class: closed

Model detail + compatibility

Claude Code 4 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-code-4"}'

Claude Opus 4.1

anthropic-claude-opus-41 • anthropic • Anthropic

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 0.7/2.2 per 1M

Class: closed

Model detail + compatibility

Claude Opus 4.1 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-opus-41"}'

Claude Opus 4.1 (OpenRouter)

anthropic-claude-opus-4-1-openrouter • anthropic • Anthropic

toolsvisionstreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 15/75 per 1M

Class: open

Model detail + compatibility

Premium Anthropic model in OpenRouter routing for deep analysis and high-stakes reasoning quality.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-opus-4-1-openrouter"}'

Claude Opus 4.6

claude-opus-4-6 • anthropic • Anthropic

warm

toolsjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 3/15 per 1M

Class: closed

Model detail + compatibility

Strong long-form reasoning and coding model with robust instruction quality.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"claude-opus-4-6"}'

Claude Sonnet 4

anthropic-claude-sonnet-4 • anthropic • Anthropic

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 0.9/2.7 per 1M

Class: closed

Model detail + compatibility

Claude Sonnet 4 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-sonnet-4"}'

Claude Sonnet 4 (OpenRouter)

anthropic-claude-sonnet-4-openrouter • anthropic • Anthropic

cold

toolsvisionstreamingcompletion

Context: 1,000,000

Max output: 65,536

Pricing: 3/15 per 1M

Class: open

Model detail + compatibility

Large-context Anthropic model via OpenRouter for coding, analysis, and multimodal enterprise tasks.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-sonnet-4-openrouter"}'

Cloudflare Workers AI

cloudflare-cloudflare-workers-ai • general • Cloudflare

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.21/0.52 per 1M

Class: closed

Model detail + compatibility

Cloudflare Workers AI listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"cloudflare-cloudflare-workers-ai"}'

CodeLlama 70B

meta-codellama-70b • llama • Meta

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.61/1.32 per 1M

Class: open

Model detail + compatibility

CodeLlama 70B listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"meta-codellama-70b"}'

CodeQwen 1.5

qwen-codeqwen-15 • qwen • Qwen

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 1.1/2.2 per 1M

Class: open

Model detail + compatibility

CodeQwen 1.5 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"qwen-codeqwen-15"}'

Codestral 25.08

mistral-codestral-2508 • mistral • Mistral

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.45/1 per 1M

Class: open

Model detail + compatibility

Codestral 25.08 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"mistral-codestral-2508"}'

Cohere Command A

cohere-cohere-command-a • cohere • Cohere

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 0.61/1.32 per 1M

Class: closed

Model detail + compatibility

Cohere Command A listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"cohere-cohere-command-a"}'

Cohere Command R Code

cohere-cohere-command-r-code • cohere • Cohere

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.13/0.36 per 1M

Class: closed

Model detail + compatibility

Cohere Command R Code listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"cohere-cohere-command-r-code"}'

Cohere Command R+

cohere-cohere-command-r • cohere • Cohere

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 1.1/2.2 per 1M

Class: closed

Model detail + compatibility

Cohere Command R+ listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"cohere-cohere-command-r"}'

CrewAI

crewai-crewai • general • Crewai

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.9/2.7 per 1M

Class: closed

Model detail + compatibility

CrewAI listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"crewai-crewai"}'

CrewAI Enterprise

crewai-enterprise • general • Arena

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: -/- per 1M

Class: closed

Model detail + compatibility

Role-based multi-agent runtime for business workflows and ops automations.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"crewai-enterprise"}'

DeepSeek Coder V3

deepseek-deepseek-coder-v3 • deepseek • DeepSeek

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.7/2.2 per 1M

Class: open

Model detail + compatibility

DeepSeek Coder V3 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-deepseek-coder-v3"}'

DeepSeek R1

deepseek-deepseek-r1 • deepseek • DeepSeek

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 1.5/4.2 per 1M

Class: open

Model detail + compatibility

DeepSeek R1 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-deepseek-r1"}'

DeepSeek V3.1

deepseek-deepseek-v31 • deepseek • DeepSeek

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 1.7/4.7 per 1M

Class: open

Model detail + compatibility

DeepSeek V3.1 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-deepseek-v31"}'

DeepSeek V3.1 (OpenRouter)

deepseek-v3-1-openrouter • deepseek • DeepSeek

cold

toolsstreamingcompletion

Context: 32,768

Max output: 4,096

Pricing: 0.15/0.75 per 1M

Class: open

Model detail + compatibility

Cost-efficient text model in OpenRouter for everyday assistant, coding, and automation prompts.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-v3-1-openrouter"}'

Devstral

mistral-devstral • mistral • Mistral

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 1.5/4.2 per 1M

Class: open

Model detail + compatibility

Devstral listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"mistral-devstral"}'

Dora AI

google-dora-ai • google • Google

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 2.1/5.7 per 1M

Class: closed

Model detail + compatibility

Dora AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-dora-ai"}'

DSPy

openai-dspy • openai • OpenAI

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.69/1.48 per 1M

Class: open

Model detail + compatibility

DSPy listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"openai-dspy"}'

Dynamiq

google-dynamiq • google • Google

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.69/1.48 per 1M

Class: closed

Model detail + compatibility

Dynamiq listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-dynamiq"}'

ElevenLabs Flash v2.5

elevenlabs-flash-v2-5 • elevenlabs • ElevenLabs

warm

toolsstreamingcompletion

Context: 128,000

Max output: 8,192

Pricing: -/- per 1M

Class: closed

Model detail + compatibility

ElevenLabs low-latency TTS model optimized for real-time conversational applications.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"elevenlabs-flash-v2-5"}'

Exa Search API

exa-search-api • general • LiveBench

warm

toolsstreamingcompletion

Context: 128,000

Max output: 8,192

Pricing: -/- per 1M

Class: closed

Model detail + compatibility

Web-scale retrieval API designed for AI-first search and research workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"exa-search-api"}'

Fireworks AI

fireworks-fireworks-ai • general • Fireworks

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.37/0.84 per 1M

Class: closed

Model detail + compatibility

Fireworks AI listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"fireworks-fireworks-ai"}'

Framer AI

framer-framer-ai • general • Framer

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.37/0.84 per 1M

Class: closed

Model detail + compatibility

Framer AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"framer-framer-ai"}'

Galileo Eval

google-galileo-eval • google • Google

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.61/1.32 per 1M

Class: closed

Model detail + compatibility

Galileo Eval listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-galileo-eval"}'

Gemini 2.5 Flash

google-gemini-25-flash • google • Google

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 0.45/1 per 1M

Class: closed

Model detail + compatibility

Gemini 2.5 Flash listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-25-flash"}'

Gemini 2.5 Flash (OpenRouter)

google-gemini-2-5-flash-openrouter • google • Google

warm

toolsvisionstreamingcompletion

Context: 1,048,576

Max output: 65,536

Pricing: 0.3/2.5 per 1M

Class: open

Model detail + compatibility

Multimodal fast-tier model supporting text/image/audio/video inputs through OpenRouter.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-2-5-flash-openrouter"}'

Gemini 2.5 Pro

google-gemini-25-pro • google • Google

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 1.1/3.2 per 1M

Class: closed

Model detail + compatibility

Gemini 2.5 Pro listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-25-pro"}'

Gemini 2.5 Pro (OpenRouter)

google-gemini-2-5-pro-openrouter • google • Google

warm

toolsvisionstreamingcompletion

Context: 1,048,576

Max output: 65,536

Pricing: 1.25/10 per 1M

Class: open

Model detail + compatibility

Large-context multimodal model for advanced analysis and tool-augmented workflows via OpenRouter.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-2-5-pro-openrouter"}'

Gemini Code Pro

google-gemini-code-pro • google • Google

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.37/0.84 per 1M

Class: closed

Model detail + compatibility

Gemini Code Pro listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-code-pro"}'

Gemma 3 27B

google-gemma-3-27b • gemma • Google

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 0.7/6.7 per 1M

Class: closed

Model detail + compatibility

Gemma 3 27B listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemma-3-27b"}'

Gemma 3 27B Instruct

google-gemma-3-27b-it • gemma • Google

warm

toolsjson_modestreamingcompletion

Context: 131,072

Max output: 16,384

Pricing: 0.16/0.5 per 1M

Class: open

Model detail + compatibility

Open-weight instruction model optimized for cost-efficient coding and multilingual tasks.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemma-3-27b-it"}'

Gemma Code 2

google-gemma-code-2 • gemma • Google

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.45/1 per 1M

Class: closed

Model detail + compatibility

Gemma Code 2 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-gemma-code-2"}'

GLM Code 4

zhipu-glm-code-4 • general • Zhipu AI

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.37/0.84 per 1M

Class: closed

Model detail + compatibility

GLM Code 4 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"zhipu-glm-code-4"}'

GLM-5

zhipu-glm-5 • general • Zhipu AI

warm

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 1.7/3.7 per 1M

Class: closed

Model detail + compatibility

GLM-5 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"zhipu-glm-5"}'

Google Agent Development Kit

google-google-agent-development-kit • google • Google

cold

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.7/2.2 per 1M

Class: closed

Model detail + compatibility

Google Agent Development Kit listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-google-agent-development-kit"}'

Google Agent Development Kit (ADK)

google-agent-development-kit • google • Google

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: -/- per 1M

Class: open

Model detail + compatibility

Open-source toolkit for building and evaluating Gemini-powered agents and workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"google-agent-development-kit"}'

Google Vertex AI

vertex-google-vertex-ai • google • Google Vertex

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 1.5/4.2 per 1M

Class: closed

Model detail + compatibility

Google Vertex AI listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"vertex-google-vertex-ai"}'

GPT-5 Codex

openai-gpt-5-codex • openai • OpenAI

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.3/1.2 per 1M

Class: open

Model detail + compatibility

GPT-5 Codex listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"openai-gpt-5-codex"}'

Grok 4

xai-grok-4 • xai • xAI

cold

toolsvisionjson_modestreamingcompletion

Context: 200,000

Max output: 25,000

Pricing: 0.9/1.7 per 1M

Class: closed

Model detail + compatibility

Grok 4 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: supported

curl -X POST /api/v1/chat/completions -d '{"model":"xai-grok-4"}'

GroqCloud

groq-groqcloud • general • Groq

warm

toolsvisionstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.9/2.7 per 1M

Class: closed

Model detail + compatibility

GroqCloud listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"groq-groqcloud"}'

Guardrails AI

guardrails-guardrails-ai • general • Guardrails

warm

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.45/1 per 1M

Class: closed

Model detail + compatibility

Guardrails AI listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"guardrails-guardrails-ai"}'

Gumloop

gumloop-gumloop • general • Gumloop

toolsstreamingcompletion

Context: 200,000

Max output: 8,192

Pricing: 0.61/1.32 per 1M

Class: closed

Model detail + compatibility

Gumloop listed under workflow automation agents for AI Bazaar discovery and comparison workflows.

temperature: supported

top_p: supported

top_k: supported

min_p: supported

max_tokens: supported

frequency_penalty: supported

presence_penalty: supported

stop: supported

seed: supported

tools: supported

vision: not supported

stream: supported

response_format_json: not supported

curl -X POST /api/v1/chat/completions -d '{"model":"gumloop-gumloop"}'