qwen3-235b-a22b

Model Description

Qwen3-235B-A22B, part of the cutting-edge Qwen3 series, introduces the next generation of large language models with both dense and mixture-of-experts (MoE) architectures. This advanced model enables seamless switching between thinking mode (for complex reasoning, math, and coding) and non-thinking mode (for fast, general-purpose dialogue) within a single model—delivering optimal performance for a wide range of applications.

Key highlights:

Outperforms previous generation QwQ models (in thinking mode) and Qwen2.5 instruct models (in non-thinking mode) in mathematics, code generation, and commonsense logical reasoning.
Offers superior alignment with human preferences, excelling in creative writing, role-play, multi-turn dialogue, and following instructions for highly natural and engaging conversations.
Exceptional agent capabilities allow precise external tool integration in both thinking and non-thinking modes, achieving SOTA (state-of-the-art) performance among open-source models for complex agent-based tasks.
Supports over 100 languages and dialects, with robust multilingual instruction following and translation abilities.

Model overview:

Feature Description
Type Causal Language Model
Training Stage Pretraining & Post-training
Number of Parameters (Total) 235B
Activated Parameters 22B
Non-Embedding Parameters 234B
Layers 94
Attention Heads (GQA) Q: 64, KV: 4
Number of Experts 128
Activated Experts 8
Context Length 32,768 tokens natively, up to 131,072 tokens with YaRN

Qwen3-235B-A22B sets a new standard in the industry for reasoning, agent capabilities, human-like dialogue, and multilingual support, making it an ideal choice for sophisticated AI applications.

🔔How to Use

graph LR A("Purchase Now") --> B["Start Chat on Homepage"] A --> D["Read API Documentation"] B --> C["Register / Login"] C --> E["Enter Key"] D --> F["Enter Endpoint & Key"] E --> G("Start Using") F --> G style A fill:#f9f9f9,stroke:#333,stroke-width:1px style B fill:#f9f9f9,stroke:#333,stroke-width:1px style C fill:#f9f9f9,stroke:#333,stroke-width:1px style D fill:#f9f9f9,stroke:#333,stroke-width:1px style E fill:#f9f9f9,stroke:#333,stroke-width:1px style F fill:#f9f9f9,stroke:#333,stroke-width:1px style G fill:#f9f9f9,stroke:#333,stroke-width:1px

Purchase Now

Start Chat on Homepage

Register / Login

Enter Key

Read API Documentation

Enter Endpoint & Key

Start Using

Description Ends

Recommend Models

gemini-2.5-pro

Gemini 2.5 Pro is Google's most advanced AI model designed for coding and complex tasks, featuring enhanced reasoning capabilities, native multimodal support, and a 1-million token context window.

gemini-2.5-flash-preview-04-17

Gemini-2.5-Flash-Preview-04-17 is a large language model supporting text, image, video, and audio inputs, with advanced output and code execution capabilities and high token limits.

o3-pro

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently better answers. o3-pro is available in the Responses API only to enable support for multi-turn model interactions before responding to API requests, and other advanced API features in the future. Since o3-pro is designed to tackle tough problems, some requests may take several minutes to finish. To avoid timeouts, try using background mode.