whisper-1

Model Description

Whisper is a general-purpose speech recognition model, trained on a large dataset of diverse audio. You can also use it as a multitask model to perform multilingual speech recognition as well as speech translation and language identification.

Description Ends

Recommend Models

gemini-2.0-flash

Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, native tool use, multimodal generation, and a 1M token context window.

DeepSeek-V3-0324

DeepSeek-V3-0324 is an upgraded AI model with enhanced reasoning, coding, Chinese writing, and web search capabilities, outperforming GPT-4.5 in certain tasks while maintaining 128K context support and open-source MIT licensing.

QwQ-32B

QwQ-32B is a 32.5B-parameter reasoning model in the Qwen series, featuring advanced architecture and 131K-token context length, designed to outperform state-of-the-art models like DeepSeek-R1 in complex tasks.