Qwen3 235B A22B

qwen/qwen3-235b-a22b-fp8

Achieves effective integration of inference and non-inference modes, enabling seamless switching between modes during conversations. The model's inference capability significantly surpasses that of QwQ, and its general capabilities exceed those of Qwen2.5-72B-Instruct, reaching the state-of-the-art (SOTA) level among models of the same scale.

价格

输入	$0.2/百万 tokens
输出	$0.8/百万 tokens

使用以下代码示例来集成我们的API：

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.highwayapi.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="qwen/qwen3-235b-a22b-fp8",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=20000,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

信息

提供商

Alibaba

量化

fp8

支持的功能

上下文长度

40960

最大输出

20000

推理

支持

serverless

支持

Input Capabilities

text

Output Capabilities

text