大模型 API 定价

探索我们模型 API 的定价。通过透明的费率和灵活的选项,找到适合您需求的正确方案。

Anthropic logo

Anthropic

Anthropic的Claude模型提供先进的安全AI能力,专注于有用、无害、诚实的AI助手体验,并具备强大的推理和对话能力。

模型名称Input Token Range上下文输入(/Mt)缓存写入(/Mt)缓存读取(/Mt)输出(/Mt)Actions
claude-opus-4-8-1,000,000$5$6.25(5m)·$10(1h)$0.5$25去体验
claude-opus-4-7-1,000,000
$4.75$5
$5.9375(5m)·$9.5(1h)$6.25(5m)·$10(1h)
$0.475$0.5
$23.75$25
去体验
claude-opus-4-61-200,0001,000,000$5$6.25(5m)·$10(1h)$0.5$25去体验
200,000-1,000,0001,000,000$5$6.25(5m)·$10(1h)$0.5$25去体验
claude-opus-4-6-dd-1,000,000
$2.75$5
$3.4375(5m)·$5.5(1h)$6.25(5m)·$10(1h)
$0.275$0.5
$13.75$25
去体验
claude-sonnet-4-61-200,0001,000,000$3$3.75(5m)·$6(1h)$0.3$15去体验
200,000-1,000,0001,000,000$3$3.75(5m)·$6(1h)$0.3$15去体验
claude-sonnet-4-6-dd-1,000,000
$1.65$3
$2.0625(5m)·$3.3(1h)$3.75(5m)·$6(1h)
$0.165$0.3
$8.25$15
去体验
claude-opus-4-5-20251101-200,000
$4.75$5
$5.9375(5m)·$9.5(1h)$6.25(5m)·$10(1h)
$0.475$0.5
$23.75$25
去体验
claude-opus-4-5-20251101-dd-200,000
$2.75$5
$3.4375(5m)$6.25(5m)
$0.275$0.5
$13.75$25
去体验
claude-sonnet-4-5-202509291-200,000200,000$3$3.75(5m)·$6(1h)$0.3$15去体验
200,000-1,000,000200,000$6$7.5(5m)·$12(1h)$0.6$22.5去体验
claude-sonnet-4-5-20250929-dd-200,000
$1.65$3
$2.0625(5m)$3.75(5m)
$0.165$0.3
$8.25$15
去体验
claude-haiku-4-5-20251001-20,000$1$1.25(5m)·$2(1h)$0.1$5去体验
claude-haiku-4-5-20251001-dd-200,000
$0.55$1
$0.6875(5m)·$1.1(1h)$1.25(5m)·$2(1h)
$0.055$0.1
$2.75$5
去体验
claude-sonnet-4-20250514-200,000
$2.85$3
$3.5625(5m)$3.75(5m)
$0.285$0.3
$14.25$15
去体验
OpenAI

OpenAI

OpenAI的GPT系列模型提供最先进的语言理解和生成能力,在多种任务中表现出色,是业界领先的AI模型。

模型名称Input Token Range上下文输入(/Mt)缓存读取(/Mt)输出(/Mt)Actions
gpt-5.51-272,0001,050,000$5$0.5$30去体验
272,000-1,050,0001,050,000$10$1$45去体验
gpt-5.5-pro1-272,0001,050,000$30-$180去体验
272,000-1,050,0001,050,000$60-$270去体验
gpt-5.5-r-1,050,000
$0.25$5
$0.025$0.5
$1.5$30
去体验
gpt-5.5-light1-272,0001,050,000
$0.25$5
$0.025$0.5
$1.5$30
去体验
272,000-1,050,0001,050,000
$0.5$10
$0.05$1
$2.25$45
去体验
gpt-5.4-nano-400,000
$0.19$0.2
$0.019$0.02
$1.1875$1.25
去体验
gpt-5.4-mini-400,000
$0.7125$0.75
$0.0712$0.075
$4.275$4.5
去体验
gpt-5.4-pro1-272,0001,050,000$30-$180去体验
272,000-1,050,0001,050,000$60-$270去体验
gpt-5.41-272,0001,050,000$2.5$0.25$15去体验
272,000-1,050,0001,050,000$5$0.5$22.5去体验
gpt-5.3-chat-latest-128,000
$1.6625$1.75
$0.1662$0.175
$13.3$14
去体验
gpt-5.3-codex-400,000
$1.6625$1.75
$0.1662$0.175
$13.3$14
去体验
gpt-5.2-codex-400,000$1.75$0.175$14去体验
gpt-5.2-400,000
$1.6625$1.75
$0.1662$0.175
$13.3$14
去体验
gpt-5.2-pro-400,000
$19.95$21
-
$159.6$168
去体验
gpt-5.2-chat-latest-128,000
$1.6625$1.75
$0.1662$0.175
$13.3$14
去体验
gpt-5.1-codex-max-400,000
$1.1875$1.25
$0.1187$0.125
$9.5$10
去体验
gpt-5.1-codex-mini-400,000
$0.2375$0.25
$0.0237$0.025
$1.9$2
去体验
gpt-5.1-codex-400,000
$1.1875$1.25
$0.1187$0.125
$9.5$10
去体验
gpt-5.1-chat-latest-128,000
$1.1875$1.25
$0.1187$0.125
$9.5$10
去体验
gpt-5.1-400,000
$1.1875$1.25
$0.1187$0.125
$9.5$10
去体验
gpt-5-pro-400,000
$14.25$15
-
$114$120
去体验
gpt-5-codex-400,000
$1.1875$1.25
$0.1187$0.125
$9.5$10
去体验
gpt-5-chat-latest-400,000
$1.1875$1.25
$0.1187$0.125
$9.5$10
去体验
gpt-5-nano-400,000
$0.0475$0.05
$0.0047$0.005
$0.38$0.4
去体验
gpt-5-mini-400,000
$0.2375$0.25
$0.0237$0.025
$1.9$2
去体验
gpt-5-400,000
$1.1875$1.25
$0.1187$0.125
$9.5$10
去体验
OpenAI: GPT OSS 20B-131,072$0.05-$0.2去体验
OpenAI GPT OSS 120B-131,072$0.1-$0.5去体验
gpt-4.1-mini-1,047,576$0.4$0.1$1.6去体验
gpt-4.1-nano-1,047,576$0.1$0.025$0.4去体验
gpt-4.1-1,047,576$2$0.5$8去体验
gpt-4o-mini-128,000
$0.1425$0.15
$0.0712$0.075
$0.57$0.6
去体验
gpt-4o-131,072
$2.375$2.5
$1.1875$1.25
$9.5$10
去体验
Gemini logo

Gemini

Google的Gemini模型提供高质量的语言处理能力,在各种NLP任务中表现出色,并具备强大的多模态能力。

模型名称Input Token Range上下文输入(/Mt)缓存写入(/Mt)缓存读取(/Mt)输出(/Mt)Actions
gemini-3.1-pro-preview1-204,8001,048,576$2$0.375(5m)·$4.5(1h)$0.2$12去体验
204,800-1,048,5761,048,576$4$0.375(5m)·$4.5(1h)$0.4$18去体验
gemini-3.1-flash-lite-preview-1,048,576
$0.2375$0.25
$0.0791(5m)·$0.95(1h)$0.0833(5m)·$1(1h)
$0.0237$0.025
$1.425$1.5
去体验
gemini-3-flash-preview-1,048,576
$0.475$0.5
$0.0788(5m)·$0.95(1h)$0.083(5m)·$1(1h)
$0.0475$0.05
$2.85$3
去体验
gemini-2.5-flash-lite-preview-09-2025-1,048,576
$0.095$0.1
$0.0788(5m)·$0.95(1h)$0.083(5m)·$1(1h)
$0.0095$0.01
$0.38$0.4
去体验
gemini-2.5-flash-lite-1,048,576
$0.095$0.1
$0.0788(5m)·$0.95(1h)$0.083(5m)·$1(1h)
$0.0095$0.01
$0.38$0.4
去体验
gemini-2.5-pro-1,048,576
$1.1875$1.25
$0.3562(5m)·$4.275(1h)$0.375(5m)·$4.5(1h)
$0.1187$0.125
$9.5$10
去体验
gemini-2.5-flash-1,048,576
$0.285$0.3
$0.0788(5m)·$0.95(1h)$0.083(5m)·$1(1h)
$0.0285$0.03
$2.375$2.5
去体验
gemini-2.5-flash-lite-preview-06-17-1,048,576
$0.095$0.1
$0.0788(5m)·$0.95(1h)$0.083(5m)·$1(1h)
$0.0095$0.01
$0.38$0.4
去体验
gemini-2.5-flash-preview-05-20-1,048,576
$0.1425$0.15
$0.0788(5m)·$0.95(1h)$0.083(5m)·$1(1h)
$0.0285$0.03
$3.325$3.5
去体验
gemini-2.5-pro-preview-06-05-1,048,576
$1.1875$1.25
$0.3562(5m)·$4.275(1h)$0.375(5m)·$4.5(1h)
$0.1187$0.125
$9.5$10
去体验
gemini-2.0-flash-lite-1,048,576
$0.0712$0.075
--
$0.285$0.3
去体验
gemini-2.0-flash-20250609-1,048,576
$0.1425$0.15
--
$0.57$0.6
去体验
Gemma3 12B-131,072$0.05--$0.1去体验
Gemma 3 27B-32,768$0.119--$0.2去体验
gemini-3.5-flash-1,048,576$1.5$0.083(5m)·$1(1h)$0.15$9去体验
Llama logo

Llama

Meta的Llama模型提供最先进的语言理解能力,采用开放架构设计,适用于多样化应用场景。

模型名称上下文输入(/Mt)输出(/Mt)操作
Llama 4 Maverick Instruct1,048,576$0.17$0.85去体验
Llama 4 Scout Instruct131,072$0.1$0.5去体验
Llama 3.3 70B Instruct131,072$0.13$0.39去体验
Llama 3.2 3B Instruct32,768$0.03$0.05去体验
Llama 3.1 8B Instruct16,384$0.02$0.05去体验
Qwen logo

Qwen

Qwen系列模型提供高效的语言处理能力,具有多种参数规模,涵盖从轻量级到企业级的解决方案。

模型名称Input Token Range上下文输入(/Mt)输出(/Mt)Actions
Qwen3.5-Plus1-256,0001,000,000$0.4$2.4去体验
256,000-1,000,0001,000,000$1.2$7.2去体验
Qwen3.5-27B-262,144$0.3$2.4去体验
Qwen3.5-122B-A10B-262,144$0.4$3.2去体验
Qwen3.5-35B-A3B-262,144$0.25$2去体验
Qwen3.5-397B-A17B-262,144$0.6$3.6去体验
Qwen3 Coder Next FP8-262,144$0.2$1.5去体验
Qwen3 Next 80B A3B Instruct-65,536$0.15$1.5去体验
Qwen3 Next 80B A3B Thinking-65,536$0.15$1.5去体验
Qwen MT Plus-4,096$0.25$0.75去体验
Qwen3 235B A22b Thinking 2507-131,072$0.3$3去体验
Qwen3 Coder 480B A35B Instruct-262,144$0.29$1.2去体验
Qwen3 235B A22B Instruct 2507-131,072$0.15$0.8去体验
Qwen3 30B A3B-40,960$0.09$0.45去体验
Qwen3 32B-40,960$0.1$0.45去体验
Qwen3 235B A22B-40,960$0.2$0.8去体验
Qwen2.5 7B Instruct-32,000$0.07$0.07去体验
Qwen2.5 VL 72B Instruct-32,768$0.8$0.8去体验
Qwen 2.5 72B Instruct-32,000$0.38$0.4去体验
Wenxin

Baidu

百度的ERNIE模型提供先进的中文语言理解和多模态能力,针对中文应用进行了优化,并具备具有竞争力的价格。

模型名称上下文输入(/Mt)输出(/Mt)操作
ERNIE 4.5 VL 424B A47B123,000$0.42$1.25去体验
ERNIE 4.5 300B A47B123,000$0.28$1.1去体验
ChatGLM

THUDM

来自清华大学的GLM系列模型,具备先进的中文语言理解和生成能力。

模型名称上下文输入(/Mt)缓存读取(/Mt)输出(/Mt)操作
GLM-5.1204,800$1.38$0.26$4.4去体验
GLM-5V-Turbo204,800$1.2$0.24$4去体验
GLM-5-Turbo202,800$1.2$0.24$4去体验
GLM-5204,800$1$0.2$3.2去体验
GLM-OCR32,000$0.03-$0.03去体验
GLM-4.7-Flash200,000$0.07$0.01$0.4去体验
GLM-4.7204,800$0.6-$2.2去体验
GLM 4.5V65,536$0.6-$1.8去体验
GLM-4.5131,072$0.6-$2.2去体验
Sao10K logo

Sao10K

专门针对创意和角色扮演应用优化的微调模型,具有增强的故事叙述能力。

模型名称上下文输入(/Mt)输出(/Mt)操作
L3 8B Stheno V3.28,192$0.05$0.05去体验
Sao10k L3 8B Lunaris 8,192$0.05$0.05去体验
L31 70B Euryale V2.28,192$1.48$1.48去体验
L3 70B Euryale V2.1 8,192$1.48$1.48去体验
Mistralai logo

Mistralai

来自Mistral AI的高效强大语言模型,专为商业和开源应用而设计。

模型名称上下文输入(/Mt)输出(/Mt)操作
Mistral Nemo60,288$0.04$0.17去体验
Mistral 7B Instruct32,768$0.029$0.059去体验
Deepseek logo

Deepseek

来自DeepSeek的先进AI模型,为企业级和研究应用提供前沿的推理能力和具有竞争力的价格。

模型名称上下文输入(/Mt)缓存写入(/Mt)缓存读取(/Mt)输出(/Mt)操作
Deepseek V4 Flash1,048,576$0.14-$0.028$0.28去体验
Deepseek V4 Pro1,048,576$1.74-$0.145$3.48去体验
DeepSeek-OCR 28,192$0.03--$0.03去体验
DeepSeek V3.1163,840$0.27--$1去体验
DeepSeek R1 0528163,840$0.7-$0.35$2.5去体验
DeepSeek V3 0324163,840$0.28$0.14(5m)$0.14$1.14去体验
MiniMax logo

MiniMax

MiniMax AI的先进语言模型提供强大的对话AI能力,在客户服务、内容生成和创意应用中表现优异,并具备强大的多语言支持和企业级可扩展性。

模型名称上下文输入(/Mt)输出(/Mt)操作
MiniMax M11,000,000$0.55$2.2去体验
Gryphe logo

Gryphe

来自Gryphe的创新AI模型,提供专业的语言理解能力,专注于效率和适应性,适用于利基应用。

模型名称上下文输入(/Mt)输出(/Mt)操作
Mythomax L2 13B4,096$0.09$0.09去体验

Mixture of Expert

最先进AI模型的高级集合,具备高级推理、数学证明能力以及跨多个领域的前沿语言理解能力。

模型名称Input Token Range上下文输入(/Mt)缓存写入(/Mt)缓存读取(/Mt)输出(/Mt)Actions
Qwen3.5-Plus1-256,0001,000,000$0.4--$2.4去体验
256,000-1,000,0001,000,000$1.2--$7.2去体验
GLM-5.1-204,800$1.38-$0.26$4.4去体验
XiaomiMiMo/MiMo-V2.5-Pro1-262,1441,048,576$1-$0.2$3去体验
262,144-1,048,5761,048,576$2-$0.4$6去体验
OpenAI: GPT OSS 20B-131,072$0.05--$0.2去体验
OpenAI GPT OSS 120B-131,072$0.1--$0.5去体验
Deepseek V4 Flash-1,048,576$0.14-$0.028$0.28去体验
Deepseek V4 Pro-1,048,576$1.74-$0.145$3.48去体验
DeepSeek V3.1-163,840$0.27--$1去体验
DeepSeek R1 0528-163,840$0.7-$0.35$2.5去体验
DeepSeek V3 0324-163,840$0.28$0.14(5m)$0.14$1.14去体验
MiniMax M2.7-highspeed-204,800$0.6-$0.06$2.4去体验
MiniMax M2.7-204,800$0.3-$0.03$1.2去体验
MiniMax M2.5-highspeed-204,800$0.6-$0.03$2.4去体验
MiniMax M2.5-204,800$0.3-$0.03$1.2去体验
Minimax M2.1-204,800$0.3$0.375(5m)$0.03$1.2去体验
MiniMax M1-1,000,000$0.55--$2.2去体验
GLM-5V-Turbo-204,800$1.2-$0.24$4去体验
GLM-5-Turbo-202,800$1.2-$0.24$4去体验
GLM-5-204,800$1-$0.2$3.2去体验
GLM-4.7-Flash-200,000$0.07-$0.01$0.4去体验
GLM-4.7-204,800$0.6--$2.2去体验
GLM 4.5V-65,536$0.6--$1.8去体验
GLM-4.5-131,072$0.6--$2.2去体验
Kimi K2.5-262,144$0.6-$0.1$3去体验
Kimi K2 Instruct-131,072$0.57--$2.3去体验
Qwen3.5-122B-A10B-262,144$0.4--$3.2去体验
Qwen3.5-35B-A3B-262,144$0.25--$2去体验
Qwen3.5-397B-A17B-262,144$0.6--$3.6去体验
Qwen3 235B A22b Thinking 2507-131,072$0.3--$3去体验
Qwen3 30B A3B-40,960$0.09--$0.45去体验
Qwen3 32B-40,960$0.1--$0.45去体验
Qwen3 235B A22B-40,960$0.2--$0.8去体验
ERNIE 4.5 VL 424B A47B-123,000$0.42--$1.25去体验
ERNIE 4.5 300B A47B-123,000$0.28--$1.1去体验
Llama 4 Maverick Instruct-1,048,576$0.17--$0.85去体验
Llama 4 Scout Instruct-131,072$0.1--$0.5去体验
联系我们