首页/Qwen3 235B A22B
qwen/qwen3-235b-a22b-fp8

Qwen3 235B A22B

qwen/qwen3-235b-a22b-fp8
Achieves effective integration of inference and non-inference modes, enabling seamless switching between modes during conversations. The model's inference capability significantly surpasses that of QwQ, and its general capabilities exceed those of Qwen2.5-72B-Instruct, reaching the state-of-the-art (SOTA) level among models of the same scale.

特性

按使用量付费

$0.2/$0.8
每100万Token(输入/输出)

使用以下代码示例来集成我们的API:

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.jiekou.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="qwen/qwen3-235b-a22b-fp8",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=20000,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

信息

提供商
Qwen
量化
fp8

支持的功能

上下文长度
40960
最大输出
20000
推理
支持
Input Capabilities
text
Output Capabilities
text