Showing 32 of 32
Google

Gemini 3 Flash Preview

Coming soon

google/gemini-3-flash-preview

Context

1.0M

Max out

66K

Input

$0.50/M

Output

$3.00/M

Web search

$0.0140/R

Text chatVisionReasoningFunction callingStructured outputWeb search
Google

Gemini 3.1 Flash-Lite

Coming soon

google/gemini-3-1-flash-lite

Context

1.0M

Max out

66K

Input

$0.25/M

Output

$1.50/M

Web search

$0.0140/R

Text chatVisionFunction callingStructured outputWeb searchAudio input
Google

Gemini 3.1 Pro Preview

Coming soon

google/gemini-3-1-pro-preview

Context

1.0M

Max out

66K

Input

$2.00/M

Output

$12.00/M

Cache read

$0.20/M

Web search

$0.0140/R

Text chatVisionReasoningFunction callingStructured outputWeb search
Anthropic

Claude Opus 4.8

Contact sales

anthropic/claude-opus-4-8

Context

1M

Max out

128K

Input

$5.00/M

Output

$25.00/M

Cache read

$0.50/M

Text chatVisionReasoningFunction callingWeb searchComputer use
DeepSeek

DeepSeek V4 Flash

Coming soon

deepseek/deepseek-v4-flash

Context

1M

Max out

384K

Input

$0.14/M

Output

$0.28/M

Cache read

$0.0028/M

Text chatReasoningFunction callingStructured outputPrompt cache
DeepSeek

DeepSeek V4 Pro

Coming soon

deepseek/deepseek-v4-pro

Context

1M

Max out

384K

Input

$0.43/M

Output

$0.87/M

Cache read

$0.0036/M

Text chatReasoningFunction callingStructured outputPrompt cache
xAI

Grok 4.3

Coming soon

xai/grok-4-3

Context

1M

Max out

-

Input

$1.25/M

Output

$2.50/M

Text chatVisionReasoningFunction callingStructured output
xAI

Grok Build 0.1

Coming soon

xai/grok-build-0-1

Context

256K

Max out

-

Input

$1.00/M

Output

$2.00/M

Text chatVisionReasoningFunction callingStructured output
OpenAI

GPT-5.5

openai/gpt-5-5

Context

1.1M

Max out

128K

Input

$5.00/M

Output

$30.00/M

Cache read

$0.50/M

Web search

$0.0100/R

Text chatVisionReasoningFunction callingStructured outputWeb search
OpenAI

GPT-5.5 Pro

Coming soonContact sales

openai/gpt-5-5-pro

Context

1.1M

Max out

128K

Input

$30.00/M

Output

$180.00/M

Web search

$0.0100/R

Text chatVisionReasoningFunction callingStructured outputWeb search
Anthropic

Claude Opus 4.7

max only

anthropic/claude-opus-4-7

Context

1M

Max out

128K

Input

$5.00/M

Output

$25.00/M

Cache read

$0.50/M

Text chatVisionReasoningFunction callingWeb searchComputer use
OpenAI

GPT-5.4 Mini

openai/gpt-5-4-mini

Context

400K

Max out

128K

Input

$0.75/M

Output

$4.50/M

Cache read

$0.07/M

Web search

$0.0100/R

Text chatVisionReasoningFunction callingStructured outputWeb search
OpenAI

GPT-5.4

openai/gpt-5-4

Context

1.1M

Max out

128K

Input

$2.50/M

Output

$15.00/M

Cache read

$0.25/M

Web search

$0.0100/R

Text chatVisionReasoningFunction callingStructured outputWeb search
Anthropic

Claude Sonnet 4.6

anthropic/claude-sonnet-4-6

Context

1M

Max out

128K

Input

$3.00/M

Output

$15.00/M

Cache read

$0.30/M

Text chatVisionReasoningFunction callingWeb searchComputer use
Anthropic

Claude Haiku 4.5

anthropic/claude-haiku-4-5-20251001

Context

200K

Max out

64K

Input

$1.00/M

Output

$5.00/M

Cache read

$0.10/M

Text chatVisionFunction calling
OpenAI

o3

Coming soon

openai/o3

Context

200K

Max out

100K

Input

$2.00/M

Output

$8.00/M

Text chatVisionReasoningFunction callingStructured output
OpenAI

o4-mini

Coming soon

openai/o4-mini

Context

200K

Max out

100K

Input

$1.10/M

Output

$4.40/M

Text chatVisionReasoningFunction callingStructured output
OpenAI

GPT-4.1

Coming soon

openai/gpt-4-1

Context

1.0M

Max out

33K

Input

$2.00/M

Output

$8.00/M

Text chatVisionFunction callingStructured output
OpenAI

GPT-4.1 Mini

Coming soon

openai/gpt-4-1-mini

Context

1.0M

Max out

33K

Input

$0.40/M

Output

$1.60/M

Text chatVisionFunction callingStructured output
OpenAI

GPT-4.1 Nano

Coming soon

openai/gpt-4-1-nano

Context

1.0M

Max out

33K

Input

$0.10/M

Output

$0.40/M

Text chatVisionFunction callingStructured output
OpenAI

o3-mini

Coming soon

openai/o3-mini

Context

200K

Max out

100K

Input

$1.10/M

Output

$4.40/M

Text chatReasoningFunction callingStructured output
OpenAI

GPT-4o Mini

Coming soon

openai/gpt-4o-mini

Context

128K

Max out

16K

Input

$0.15/M

Output

$0.60/M

Text chatVision
OpenAI

GPT-4o

Coming soon

openai/gpt-4o

Context

128K

Max out

16K

Input

$2.50/M

Output

$10.00/M

Text chatVision
OpenAI

GPT-4 Turbo

Coming soon

openai/gpt-4-turbo

Context

128K

Max out

4K

Input

$10.00/M

Output

$30.00/M

Text chatVision
OpenAI

GPT-4

Coming soon

openai/gpt-4

Context

8K

Max out

8K

Input

$30.00/M

Output

$60.00/M

Text chat
Google

Gemini 2.5 Flash

Coming soon

google/gemini-2-5-flash

Context

1.0M

Max out

66K

Input

$0.30/M

Output

$2.50/M

Cache read

$0.03/M

Web search

$0.0350/R

Text chatVisionReasoningFunction callingStructured outputWeb search
Google

Gemini 2.5 Flash-Lite

Coming soon

google/gemini-2-5-flash-lite

Context

1.0M

Max out

66K

Input

$0.10/M

Output

$0.40/M

Cache read

$0.01/M

Web search

$0.0350/R

Text chatVisionFunction callingStructured outputWeb searchPrompt cache
Google

Gemini 2.5 Pro

Coming soon

google/gemini-2-5-pro

Context

1.0M

Max out

66K

Input

$1.25/M

Output

$10.00/M

Cache read

$0.13/M

Web search

$0.0350/R

Text chatVisionReasoningFunction callingStructured outputWeb search
Moonshot

Kimi K2.6

moonshot/kimi-k2-6

Context

262K

Max out

33K

Input

$0.95/M

Output

$4.00/M

Cache read

$0.16/M

Web search

$0.0050/R

Text chatVisionReasoningFunction callingStructured outputWeb search
Moonshot

Kimi K2.7 Code

Coming soon

moonshot/kimi-k2-7-code

Context

262K

Max out

33K

Input

$0.95/M

Output

$4.00/M

Cache read

$0.19/M

Text chatReasoningFunction callingStructured outputPrompt cache
Moonshot

Kimi K2.7 Code Highspeed

Coming soon

moonshot/kimi-k2-7-code-highspeed

Context

262K

Max out

33K

Input

$1.90/M

Output

$8.00/M

Cache read

$0.38/M

Text chatReasoningFunction callingStructured outputPrompt cache
Qwen

Qwen3.6 35B-A3B

Coming soon

qwen/qwen3-6-35b-a3b

Context

256K

Max out

-

Input

$0.25/M

Output

$1.49/M

Text chatReasoningFunction callingStructured output

Catalog updated Jun 25, 2026, 6:05 PM