Model Filter

Model Type

Features

Context Windown

Maxmium Output

Provider

Recommend

We have launched the Basic Series economy models, offering higher discounts. Click to view the model comparison >>

1MT: One million tokens. This pricing is based on the conversion rate of ¥2 = $1. If your purchase rate is ¥3.5 = $1, the price should be multiplied by 1.75 accordingly.

DeepGPT-4o-1120

DeepSeek-R1 + gpt-4o-2024-11-20,The Deep series is composed of the DeepSeek-R1 (671b) model combined with the chain-of-thought reasoning of other models, fully utilizing the powerful capabilities of the DeepSeek chain-of-thought. It employs a strategy of leveraging other more powerful models for supplementation, thereby enhancing the overall model's capabilities.

basic/gpt-4o-2024-11-20

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

claude-3-5-sonnet-20241022

An upgraded AI model with leading coding performance, enhanced tool use, and groundbreaking computer interaction capabilities—all at the same speed and cost as its predecessor."

basic/claude-3-5-sonnet-20241022

An upgraded AI model with leading coding performance, enhanced tool use, and groundbreaking computer interaction capabilities—all at the same speed and cost as its predecessor."

o1-mini

The o1 reasoning model is designed to solve hard problems across domains. o1-mini is a faster and more affordable reasoning model, but we recommend using the newer o3-mini model that features higher intelligence at the same latency and price as o1-mini.

o1-mini-2024-09-12

The o1 reasoning model is designed to solve hard problems across domains. o1-mini is a faster and more affordable reasoning model, but we recommend using the newer o3-mini model that features higher intelligence at the same latency and price as o1-mini.

gpt-4o

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

gpt-4o-2024-08-06

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

basic/gpt-4o

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

gpt-4o-mini

GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

gpt-4o-mini-2024-07-18

GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

basic/gpt-4o-mini

GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

claude-3-5-sonnet-20240620

Anthropic has launched Claude 3.5 Sonnet, a smarter, faster, and more cost-efficient AI model that outperforms competitors and previous Claude versions in reasoning, coding, and vision tasks, now available for free on Claude.ai with new collaborative features like Artifacts.

basic/claude-3-5-sonnet-20240620

Anthropic has launched Claude 3.5 Sonnet, a smarter, faster, and more cost-efficient AI model that outperforms competitors and previous Claude versions in reasoning, coding, and vision tasks, now available for free on Claude.ai with new collaborative features like Artifacts.

whisper-1

Whisper is a general-purpose speech recognition model, trained on a large dataset of diverse audio. You can also use it as a multitask model to perform multilingual speech recognition as well as speech translation and language identification.

gpt-4o-2024-05-13

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

bge-reranker-v2-m3

Lightweight reranker model, possesses strong multilingual capabilities, easy to deploy, with fast inference.

bge-m3

BGE-M3 is a versatile multilingual embedding model supporting dense, sparse, and multi-vector retrieval across 100+ languages and handling inputs from sentences to long documents (8,192 tokens).

gpt-3.5-turbo

GPT-3.5 Turbo models can understand and generate natural language or code and have been optimized for chat using the Chat Completions API but work well for non-chat tasks as well. As of July 2024, use gpt-4o-mini in place of GPT-3.5 Turbo, as it is cheaper, more capable, multimodal, and just as fast. GPT-3.5 Turbo is still available for use in the API.

gpt-3.5-turbo-0125

GPT-3.5 Turbo models can understand and generate natural language or code and have been optimized for chat using the Chat Completions API but work well for non-chat tasks as well. As of July 2024, use gpt-4o-mini in place of GPT-3.5 Turbo, as it is cheaper, more capable, multimodal, and just as fast. GPT-3.5 Turbo is still available for use in the API.