Model Filter

Model Type

Features

Context Windown

Maxmium Output

Provider

Recommend

We have launched the Basic Series economy models, offering higher discounts. Click to view the model comparison >>

1MT: One million tokens. This pricing is based on the conversion rate of ¥2 = $1. If your purchase rate is ¥3.5 = $1, the price should be multiplied by 1.75 accordingly.

o1-mini-2024-09-12

The o1 reasoning model is designed to solve hard problems across domains. o1-mini is a faster and more affordable reasoning model, but we recommend using the newer o3-mini model that features higher intelligence at the same latency and price as o1-mini.

gpt-4o

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

gpt-4o-2024-08-06

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

basic/gpt-4o

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

gpt-4o-mini

GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

gpt-4o-mini-2024-07-18

GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

basic/gpt-4o-mini

GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.

claude-3-5-sonnet-20240620

Anthropic has launched Claude 3.5 Sonnet, a smarter, faster, and more cost-efficient AI model that outperforms competitors and previous Claude versions in reasoning, coding, and vision tasks, now available for free on Claude.ai with new collaborative features like Artifacts.

basic/claude-3-5-sonnet-20240620

Anthropic has launched Claude 3.5 Sonnet, a smarter, faster, and more cost-efficient AI model that outperforms competitors and previous Claude versions in reasoning, coding, and vision tasks, now available for free on Claude.ai with new collaborative features like Artifacts.

whisper-1

Whisper is a general-purpose speech recognition model, trained on a large dataset of diverse audio. You can also use it as a multitask model to perform multilingual speech recognition as well as speech translation and language identification.

gpt-4o-2024-05-13

GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.

bge-reranker-v2-m3

Lightweight reranker model, possesses strong multilingual capabilities, easy to deploy, with fast inference.

bge-m3

BGE-M3 is a versatile multilingual embedding model supporting dense, sparse, and multi-vector retrieval across 100+ languages and handling inputs from sentences to long documents (8,192 tokens).

gpt-3.5-turbo

GPT-3.5 Turbo models can understand and generate natural language or code and have been optimized for chat using the Chat Completions API but work well for non-chat tasks as well. As of July 2024, use gpt-4o-mini in place of GPT-3.5 Turbo, as it is cheaper, more capable, multimodal, and just as fast. GPT-3.5 Turbo is still available for use in the API.

gpt-3.5-turbo-0125

GPT-3.5 Turbo models can understand and generate natural language or code and have been optimized for chat using the Chat Completions API but work well for non-chat tasks as well. As of July 2024, use gpt-4o-mini in place of GPT-3.5 Turbo, as it is cheaper, more capable, multimodal, and just as fast. GPT-3.5 Turbo is still available for use in the API.

text-embedding-3-large

text-embedding-3-large is our most capable embedding model for both english and non-english tasks. Embeddings are a numerical representation of text that can be used to measure the relatedness between two pieces of text. Embeddings are useful for search, clustering, recommendations, anomaly detection, and classification tasks.

text-embedding-3-small

text-embedding-3-small is our improved, more performant version of our ada embedding model. Embeddings are a numerical representation of text that can be used to measure the relatedness between two pieces of text. Embeddings are useful for search, clustering, recommendations, anomaly detection, and classification tasks.

gpt-4-all

Using reverse engineering to call the model within the official application and convert it into an API.

gpt-4-gizmo-*

Using reverse engineering to call the model within the official application and convert it into an API.

tts-1

TTS is a model that converts text to natural sounding spoken text. The tts-1 model is optimized for realtime text-to-speech use cases. Use it with the Speech endpoint in the Audio API.