Models

Runtime registry with capability flags, context limits, pricing, and route state.

133 models indexed

Anthropic API Platform

anthropic-anthropic-api-platformanthropicAnthropic

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 2.1/5.7 per 1M
Class: closed
Model detail + compatibility

Anthropic API Platform listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-anthropic-api-platform"}'

Anthropic Claude 3.7 Sonnet

anthropic-anthropic-claude-37-sonnetanthropicAnthropic

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 0.37/0.84 per 1M
Class: closed
Model detail + compatibility

Anthropic Claude 3.7 Sonnet listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-anthropic-claude-37-sonnet"}'

Arize Phoenix

arize-arize-phoenixgeneralArize

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.21/0.52 per 1M
Class: closed
Model detail + compatibility

Arize Phoenix listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"arize-arize-phoenix"}'

AutoGen Studio

autogen-studiogeneralLiveBench

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: -/- per 1M
Class: open
Model detail + compatibility

Developer-focused multi-agent experimentation and evaluation environment.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"autogen-studio"}'

AWS Bedrock

bedrock-aws-bedrockgeneralAWS Bedrock

loading
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.45/1 per 1M
Class: closed
Model detail + compatibility

AWS Bedrock listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"bedrock-aws-bedrock"}'

AWS Nova Pro

bedrock-aws-nova-progeneralAWS Bedrock

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 1.3/1.7 per 1M
Class: closed
Model detail + compatibility

AWS Nova Pro listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"bedrock-aws-nova-pro"}'

Azure AI Foundry

azure-azure-ai-foundrygeneralAzure

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.61/1.32 per 1M
Class: closed
Model detail + compatibility

Azure AI Foundry listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"azure-azure-ai-foundry"}'

Bardeen

bardeen-bardeengeneralBardeen

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.37/0.84 per 1M
Class: closed
Model detail + compatibility

Bardeen listed under workflow automation agents for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"bardeen-bardeen"}'

Bolt.new

bolt-boltnewgeneralBolt

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.5/1.7 per 1M
Class: closed
Model detail + compatibility

Bolt.new listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"bolt-boltnew"}'

Braintrust Eval

braintrust-braintrust-evalgeneralBraintrust

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.9/2.7 per 1M
Class: closed
Model detail + compatibility

Braintrust Eval listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"braintrust-braintrust-eval"}'

Bubble AI

bubble-bubble-aigeneralBubble

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 1.5/4.2 per 1M
Class: closed
Model detail + compatibility

Bubble AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"bubble-bubble-ai"}'

Builder.io AI

builderio-builderio-aigeneralBuilder.io

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.69/1.48 per 1M
Class: closed
Model detail + compatibility

Builder.io AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"builderio-builderio-ai"}'

CAMEL-AI

google-camel-aigoogleGoogle

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.21/0.52 per 1M
Class: closed
Model detail + compatibility

CAMEL-AI listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-camel-ai"}'

Cartesia Sonic

cartesia-sonicgeneralOpenRouter

warm
toolsstreaming
Context: 128,000
Max output: 8,192
Pricing: 1.9/7.8 per 1M
Class: closed
Model detail + compatibility

Ultra-low latency TTS model designed for realtime conversational experiences.

temperature: not supported
top_p: not supported
top_k: not supported
min_p: not supported
max_tokens: supported
frequency_penalty: not supported
presence_penalty: not supported
stop: not supported
seed: not supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"cartesia-sonic"}'

Claude Code 4

anthropic-claude-code-4anthropicAnthropic

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.5/1.7 per 1M
Class: closed
Model detail + compatibility

Claude Code 4 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-code-4"}'

Claude Opus 4.1

anthropic-claude-opus-41anthropicAnthropic

loading
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 0.7/2.2 per 1M
Class: closed
Model detail + compatibility

Claude Opus 4.1 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-opus-41"}'

Claude Opus 4.1 (OpenRouter)

anthropic-claude-opus-4-1-openrouteranthropicAnthropic

loading
toolsvisionstreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 15/75 per 1M
Class: open
Model detail + compatibility

Premium Anthropic model in OpenRouter routing for deep analysis and high-stakes reasoning quality.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-opus-4-1-openrouter"}'

Claude Opus 4.6

claude-opus-4-6anthropicAnthropic

warm
toolsjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 3/15 per 1M
Class: closed
Model detail + compatibility

Strong long-form reasoning and coding model with robust instruction quality.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"claude-opus-4-6"}'

Claude Sonnet 4

anthropic-claude-sonnet-4anthropicAnthropic

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 0.9/2.7 per 1M
Class: closed
Model detail + compatibility

Claude Sonnet 4 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-sonnet-4"}'

Claude Sonnet 4 (OpenRouter)

anthropic-claude-sonnet-4-openrouteranthropicAnthropic

cold
toolsvisionstreamingcompletion
Context: 1,000,000
Max output: 65,536
Pricing: 3/15 per 1M
Class: open
Model detail + compatibility

Large-context Anthropic model via OpenRouter for coding, analysis, and multimodal enterprise tasks.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"anthropic-claude-sonnet-4-openrouter"}'

Cloudflare Workers AI

cloudflare-cloudflare-workers-aigeneralCloudflare

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.21/0.52 per 1M
Class: closed
Model detail + compatibility

Cloudflare Workers AI listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"cloudflare-cloudflare-workers-ai"}'

CodeLlama 70B

meta-codellama-70bllamaMeta

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.61/1.32 per 1M
Class: open
Model detail + compatibility

CodeLlama 70B listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"meta-codellama-70b"}'

CodeQwen 1.5

qwen-codeqwen-15qwenQwen

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 1.1/2.2 per 1M
Class: open
Model detail + compatibility

CodeQwen 1.5 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"qwen-codeqwen-15"}'

Codestral 25.08

mistral-codestral-2508mistralMistral

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.45/1 per 1M
Class: open
Model detail + compatibility

Codestral 25.08 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"mistral-codestral-2508"}'

Cohere Command A

cohere-cohere-command-acohereCohere

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 0.61/1.32 per 1M
Class: closed
Model detail + compatibility

Cohere Command A listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"cohere-cohere-command-a"}'

Cohere Command R Code

cohere-cohere-command-r-codecohereCohere

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.13/0.36 per 1M
Class: closed
Model detail + compatibility

Cohere Command R Code listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"cohere-cohere-command-r-code"}'

Cohere Command R+

cohere-cohere-command-rcohereCohere

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 1.1/2.2 per 1M
Class: closed
Model detail + compatibility

Cohere Command R+ listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"cohere-cohere-command-r"}'

CrewAI

crewai-crewaigeneralCrewai

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.9/2.7 per 1M
Class: closed
Model detail + compatibility

CrewAI listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"crewai-crewai"}'

CrewAI Enterprise

crewai-enterprisegeneralArena

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: -/- per 1M
Class: closed
Model detail + compatibility

Role-based multi-agent runtime for business workflows and ops automations.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"crewai-enterprise"}'

DeepSeek Coder V3

deepseek-deepseek-coder-v3deepseekDeepSeek

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.7/2.2 per 1M
Class: open
Model detail + compatibility

DeepSeek Coder V3 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-deepseek-coder-v3"}'

DeepSeek R1

deepseek-deepseek-r1deepseekDeepSeek

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 1.5/4.2 per 1M
Class: open
Model detail + compatibility

DeepSeek R1 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-deepseek-r1"}'

DeepSeek V3.1

deepseek-deepseek-v31deepseekDeepSeek

loading
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 1.7/4.7 per 1M
Class: open
Model detail + compatibility

DeepSeek V3.1 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-deepseek-v31"}'

DeepSeek V3.1 (OpenRouter)

deepseek-v3-1-openrouterdeepseekDeepSeek

cold
toolsstreamingcompletion
Context: 32,768
Max output: 4,096
Pricing: 0.15/0.75 per 1M
Class: open
Model detail + compatibility

Cost-efficient text model in OpenRouter for everyday assistant, coding, and automation prompts.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"deepseek-v3-1-openrouter"}'

Devstral

mistral-devstralmistralMistral

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 1.5/4.2 per 1M
Class: open
Model detail + compatibility

Devstral listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"mistral-devstral"}'

Dora AI

google-dora-aigoogleGoogle

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 2.1/5.7 per 1M
Class: closed
Model detail + compatibility

Dora AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-dora-ai"}'

DSPy

openai-dspyopenaiOpenAI

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.69/1.48 per 1M
Class: open
Model detail + compatibility

DSPy listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"openai-dspy"}'

Dynamiq

google-dynamiqgoogleGoogle

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.69/1.48 per 1M
Class: closed
Model detail + compatibility

Dynamiq listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-dynamiq"}'

ElevenLabs Flash v2.5

elevenlabs-flash-v2-5elevenlabsElevenLabs

warm
toolsstreamingcompletion
Context: 128,000
Max output: 8,192
Pricing: -/- per 1M
Class: closed
Model detail + compatibility

ElevenLabs low-latency TTS model optimized for real-time conversational applications.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"elevenlabs-flash-v2-5"}'

Exa Search API

exa-search-apigeneralLiveBench

warm
toolsstreamingcompletion
Context: 128,000
Max output: 8,192
Pricing: -/- per 1M
Class: closed
Model detail + compatibility

Web-scale retrieval API designed for AI-first search and research workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"exa-search-api"}'

Fireworks AI

fireworks-fireworks-aigeneralFireworks

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.37/0.84 per 1M
Class: closed
Model detail + compatibility

Fireworks AI listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"fireworks-fireworks-ai"}'

Framer AI

framer-framer-aigeneralFramer

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.37/0.84 per 1M
Class: closed
Model detail + compatibility

Framer AI listed under website app builders for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"framer-framer-ai"}'

Galileo Eval

google-galileo-evalgoogleGoogle

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.61/1.32 per 1M
Class: closed
Model detail + compatibility

Galileo Eval listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-galileo-eval"}'

Gemini 2.5 Flash

google-gemini-25-flashgoogleGoogle

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 0.45/1 per 1M
Class: closed
Model detail + compatibility

Gemini 2.5 Flash listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-25-flash"}'

Gemini 2.5 Flash (OpenRouter)

google-gemini-2-5-flash-openroutergoogleGoogle

warm
toolsvisionstreamingcompletion
Context: 1,048,576
Max output: 65,536
Pricing: 0.3/2.5 per 1M
Class: open
Model detail + compatibility

Multimodal fast-tier model supporting text/image/audio/video inputs through OpenRouter.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-2-5-flash-openrouter"}'

Gemini 2.5 Pro

google-gemini-25-progoogleGoogle

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 1.1/3.2 per 1M
Class: closed
Model detail + compatibility

Gemini 2.5 Pro listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-25-pro"}'

Gemini 2.5 Pro (OpenRouter)

google-gemini-2-5-pro-openroutergoogleGoogle

warm
toolsvisionstreamingcompletion
Context: 1,048,576
Max output: 65,536
Pricing: 1.25/10 per 1M
Class: open
Model detail + compatibility

Large-context multimodal model for advanced analysis and tool-augmented workflows via OpenRouter.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-2-5-pro-openrouter"}'

Gemini Code Pro

google-gemini-code-progoogleGoogle

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.37/0.84 per 1M
Class: closed
Model detail + compatibility

Gemini Code Pro listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemini-code-pro"}'

Gemma 3 27B

google-gemma-3-27bgemmaGoogle

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 0.7/6.7 per 1M
Class: closed
Model detail + compatibility

Gemma 3 27B listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemma-3-27b"}'

Gemma 3 27B Instruct

google-gemma-3-27b-itgemmaGoogle

warm
toolsjson_modestreamingcompletion
Context: 131,072
Max output: 16,384
Pricing: 0.16/0.5 per 1M
Class: open
Model detail + compatibility

Open-weight instruction model optimized for cost-efficient coding and multilingual tasks.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemma-3-27b-it"}'

Gemma Code 2

google-gemma-code-2gemmaGoogle

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.45/1 per 1M
Class: closed
Model detail + compatibility

Gemma Code 2 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-gemma-code-2"}'

GLM Code 4

zhipu-glm-code-4generalZhipu AI

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.37/0.84 per 1M
Class: closed
Model detail + compatibility

GLM Code 4 listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"zhipu-glm-code-4"}'

GLM-5

zhipu-glm-5generalZhipu AI

warm
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 1.7/3.7 per 1M
Class: closed
Model detail + compatibility

GLM-5 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"zhipu-glm-5"}'

Google Agent Development Kit

google-google-agent-development-kitgoogleGoogle

cold
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.7/2.2 per 1M
Class: closed
Model detail + compatibility

Google Agent Development Kit listed under agent frameworks for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-google-agent-development-kit"}'

Google Agent Development Kit (ADK)

google-agent-development-kitgoogleGoogle

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: -/- per 1M
Class: open
Model detail + compatibility

Open-source toolkit for building and evaluating Gemini-powered agents and workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"google-agent-development-kit"}'

Google Vertex AI

vertex-google-vertex-aigoogleGoogle Vertex

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 1.5/4.2 per 1M
Class: closed
Model detail + compatibility

Google Vertex AI listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"vertex-google-vertex-ai"}'

GPT-5 Codex

openai-gpt-5-codexopenaiOpenAI

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.3/1.2 per 1M
Class: open
Model detail + compatibility

GPT-5 Codex listed under coding models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"openai-gpt-5-codex"}'

Grok 4

xai-grok-4xaixAI

cold
toolsvisionjson_modestreamingcompletion
Context: 200,000
Max output: 25,000
Pricing: 0.9/1.7 per 1M
Class: closed
Model detail + compatibility

Grok 4 listed under llm foundation models for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: supported
curl -X POST /api/v1/chat/completions -d '{"model":"xai-grok-4"}'

GroqCloud

groq-groqcloudgeneralGroq

warm
toolsvisionstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.9/2.7 per 1M
Class: closed
Model detail + compatibility

GroqCloud listed under inference hosting platforms for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"groq-groqcloud"}'

Guardrails AI

guardrails-guardrails-aigeneralGuardrails

warm
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.45/1 per 1M
Class: closed
Model detail + compatibility

Guardrails AI listed under eval observability safety for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"guardrails-guardrails-ai"}'

Gumloop

gumloop-gumloopgeneralGumloop

loading
toolsstreamingcompletion
Context: 200,000
Max output: 8,192
Pricing: 0.61/1.32 per 1M
Class: closed
Model detail + compatibility

Gumloop listed under workflow automation agents for AI Bazaar discovery and comparison workflows.

temperature: supported
top_p: supported
top_k: supported
min_p: supported
max_tokens: supported
frequency_penalty: supported
presence_penalty: supported
stop: supported
seed: supported
tools: supported
vision: not supported
stream: supported
response_format_json: not supported
curl -X POST /api/v1/chat/completions -d '{"model":"gumloop-gumloop"}'
⌘K
AI Bazaar