Model - JuheNext

Model Filter

Model Type

Features

Context Windown

Maxmium Output

Provider

Recommend

We have launched the Basic Series economy models, offering higher discounts. Click to view the model comparison >>

1MT: One million tokens. This pricing is based on the conversion rate of ¥2 = $1. If your purchase rate is ¥3.5 = $1, the price should be multiplied by 1.75 accordingly.

grok-4-0709

Input: ￥24.00 / M tokens Output: ￥120.00 / M tokens
Chat, Reasoning, Vision
Grok
256K

Our latest and greatest flagship model, offering unparalleled performance in natural language, math and reasoning - the perfect jack of all trades.

By Grok
2025-07-09

claudecode/claude-sonnet-4-20250514

Input: ￥6.00 / M tokens Output: ￥30.00 / M tokens
Chat, Reasoning, Vision
Anthropic
128K

The Claude model series offered by the Claude Code has moderate stability and is extremely low-priced, making it more suitable for data batch processing tasks where strict stability requirements are not particularly stringent.

By Anthropic
2025-07-06
Budget Friendly

75% off

az/claude-sonnet-4-20250514

Input: ￥6.00 / M tokens Output: ￥30.00 / M tokens
Chat, Reasoning, Vision
Anthropic
128K

The Claude model series offered by the Microsoft Azure platform has moderate stability and is extremely low-priced, making it more suitable for data batch processing tasks where strict stability requirements are not particularly stringent.

By Anthropic
2025-07-02
Budget Friendly

75% off

sora_image

¥0.2/use
Chat, Image
JuheAI
8K

Reverse-engineered version of the official GPT-Image-1, featuring stable performance, high cost-effectiveness, compatibility with traditional OpenAI formats, and support for direct image generation through conversation.

By JuheAI
2025-06-26
Budget Friendly

gemini-2.5-pro

Input: ￥5.00 / M tokens Output: ￥40.00 / M tokens
Chat, Reasoning, Vision
Gemini
1M

Gemini 2.5 Pro is Google's most advanced AI model designed for coding and complex tasks, featuring enhanced reasoning capabilities, native multimodal support, and a 1-million token context window.

By Gemini
2025-06-17
Top Tier

gemini-2.5-flash

Input: ￥2.40 / M tokens Output: ￥19.2 / M tokens
Chat, Reasoning, Vision
Gemini
1M

Gemini 2.5 Flash is Google's most efficient multimodal AI model designed for fast, cost-effective performance on everyday tasks with native audio capabilities and a 1-million token context window.

By Gemini
2025-06-17

gemini-2.5-flash-lite-preview-06-17

Input: ￥0.80 / M tokens Output: ￥3.20 / M tokens
Chat, Reasoning, Vision
Gemini
1M

A Gemini 2.5 Flash model optimized for cost efficiency and low latency.

By Gemini
2025-06-17
Budget Friendly

o3-pro

Input: ￥160.00 / M tokens Output: ￥640.00 / M tokens
Chat, Reasoning, Vision
OpenAI
200K

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently better answers. o3-pro is available in the Responses API only to enable support for multi-turn model interactions before responding to API requests, and other advanced API features in the future. Since o3-pro is designed to tackle tough problems, some requests may take several minutes to finish. To avoid timeouts, try using background mode.

By OpenAI
2025-06-10
Top Tier

gemini-2.5-pro-preview-06-05

Input: ￥10.00 / M tokens Output: ￥80.00 / M tokens
Chat, Reasoning, Vision
Gemini
200K

Google has released an upgraded preview of Gemini 2.5 Pro (06-05) that significantly improves coding performance, mathematical reasoning, and response formatting while addressing previous performance concerns.

By Gemini
2025-06-05
Top Tier

DeepSeek-R1-0528

Input: ￥4.00 / M tokens Output: ￥16.00 / M tokens
Chat, Reasoning
DeepSeek
64K

The DeepSeek-R1-0528 is an upgraded AI model with enhanced programming, design, and inference efficiency, delivering high-quality outputs for complex tasks.

By DeepSeek
2025-05-28

gemini-2.5-flash-preview-05-20

Input: ￥5.00 / M tokens Output: ￥30.00 / M tokens
Chat, Reasoning, Vision
Gemini
1M

A comprehensive overview of Google Gemini 2.5 Flash (gemini-2.5-flash-preview-05-20), focusing on its hybrid reasoning architecture, multimodal capabilities, optimized performance, API pricing, application scenarios, and future developments in the AI field.

By Gemini
2025-05-20
Balanced

basic/gemini-2.5-flash-preview-05-20

Input: ￥2.50 / M tokens Output: ￥20.00 / M tokens
Chat, Reasoning, Vision
Gemini
1M

By Gemini
2025-05-20

50% off

claude-opus-4-20250514

Input: ￥135.00 / M tokens Output: ￥675.00 / M tokens
Chat, Reasoning, Vision
Anthropic
200K

Comprehensive introduction to Anthropic's newly released Claude 4 models, Opus 4 and Sonnet 4, highlighting their features, performance benchmarks, application scenarios, pricing, and availability. This report summarizes key differences between the models and discusses their integration with major platforms such as GitHub Copilot, emphasizing their advantages in coding, advanced reasoning, and ethical AI responses.

By Anthropic
2025-05-14
Top Tier

claude-opus-4-20250514-thinking

Input: ￥135.00 / M tokens Output: ￥675.00 / M tokens
Chat, Reasoning, Vision
Anthropic
200K

By Anthropic
2025-05-14
Top Tier

claude-sonnet-4-20250514

Input: ￥27.00 / M tokens Output: ￥135.00 / M tokens
Chat, Reasoning, Vision
Anthropic
200K

By Anthropic
2025-05-14
Top Tier

claude-sonnet-4-20250514-thinking

Input: ￥27.00 / M tokens Output: ￥135.00 / M tokens
Chat, Reasoning, Vision
Anthropic
200K

By Anthropic
2025-05-14
Top Tier

basic2/claude-opus-4-20250514

Input: ￥75.00 / M tokens Output: ￥375.00 / M tokens
Chat, Reasoning, Vision
Anthropic
200K

By Anthropic
2025-05-14
Basic Series

40% off

basic2/claude-sonnet-4-20250514

Input: ￥15.00 / M tokens Output: ￥75.00 / M tokens
Chat, Reasoning, Vision
Anthropic
200K

By Anthropic
2025-05-14
Basic Series

40% off

gemini-2.5-pro-preview-05-06

Input: ￥10.00 / M tokens Output: ￥80.00 / M tokens
Chat, Reasoning
Gemini
200K

Gemini 2.5 Pro is our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.

By Gemini
2025-05-06

qwen3-235b-a22b

Input: ￥4.00 / M tokens Output: ￥40.00 / M tokens
Chat, Reasoning
Qwen
128K

Qwen3-235B-A22B, part of the cutting-edge Qwen3 series, introduces the next generation of large language models with both dense and mixture-of-experts (MoE) architectures. This advanced model enables seamless switching between thinking mode (for complex reasoning, math, and coding) and non-thinking mode (for fast, general-purpose dialogue) within a single model—delivering optimal performance for a wide range of applications.

By Qwen
2025-04-28