QwQ-32B

2025-03-06
对话, 推理
By Qwen
便宜好用

Input: ￥2.00 / M tokens Output: ￥8.00 / M tokens
特征： Function Calling, 推理, 流式, 文本输入, 文本输出
上下文： 32K
最大输出： 8K

Input: ￥2.00 / M tokens Output: ￥8.00 / M tokens
特征： Function Calling, 推理, 流式, 文本输入, 文本输出
上下文： 32K
最大输出： 8K

模型描述

QwQ-32B is a medium-sized reasoning model from the Qwen series, optimized for enhanced performance in downstream tasks, particularly challenging problems requiring deep reasoning. Unlike conventional instruction-tuned models, QwQ-32B integrates advanced architectural components such as RoPE, SwiGLU, RMSNorm, and Attention QKV bias. With 64 layers, 40 query heads, and 8 key-value heads (GQA), it supports a full 131,072-token context length, though YaRN must be enabled for prompts exceeding 8,192 tokens. Pretrained and post-trained via supervised finetuning and reinforcement learning, it achieves competitive results against leading models like DeepSeek-R1 and o1-mini. Users can explore its capabilities via QwenChat or refer to official resources for deployment guidelines.

🔔如何使用

graph LR A("Purchase Now") --> B["Start Chat on Homepage"] A --> D["Read API Documentation"] B --> C["Register / Login"] C --> E["Enter Key"] D --> F["Enter Endpoint & Key"] E --> G("Start Using") F --> G style A fill:#f9f9f9,stroke:#333,stroke-width:1px style B fill:#f9f9f9,stroke:#333,stroke-width:1px style C fill:#f9f9f9,stroke:#333,stroke-width:1px style D fill:#f9f9f9,stroke:#333,stroke-width:1px style E fill:#f9f9f9,stroke:#333,stroke-width:1px style F fill:#f9f9f9,stroke:#333,stroke-width:1px style G fill:#f9f9f9,stroke:#333,stroke-width:1px

点击购买

点击首页立即对话

输入key

阅读API文档

输入端点和API Key

开始使用

推荐模型

gemini-2.5-flash-preview-04-17