Qwen3 8B

qwen/qwen3-8b-fp8

Achieves effective integration of reasoning and non-reasoning modes, allowing seamless mode switching during conversations. Its reasoning capability reaches state-of-the-art (SOTA) performance among models of the same scale, and its general capabilities significantly outperform those of Qwen2.5-7B.

价格

企业客户联系客户经理享专属折扣

输入	$0.035 /百万 tokens
输出	$0.138 /百万 tokens

使用以下代码示例来集成我们的API：

1from openai import OpenAI
2
3client = OpenAI(
4    api_key="<Your API Key>",
5    base_url="https://api.jiekou.ai/openai"
6)
7
8response = client.chat.completions.create(
9    model="qwen/qwen3-8b-fp8",
10    messages=[
11        {"role": "system", "content": "You are a helpful assistant."},
12        {"role": "user", "content": "Hello, how are you?"}
13    ],
14    max_tokens=20000,
15    temperature=0.7
16)
17
18print(response.choices[0].message.content)

信息

提供商

Qwen

量化

fp8

支持的功能

上下文长度

128000

最大输出

20000

推理

支持

Input Capabilities

text

Output Capabilities

text