Loading...
Loading...
Compare original and translation side by side
Agent setup: If your agent doesn't auto-load skills (e.g. Claude Code), see agent-compatibility.md once per session.
Agent设置: 如果您的Agent不会自动加载技能(例如Claude Code), 请每个会话查看一次agent-compatibility.md。
| Location | Purpose |
|---|---|
| Pricing overview — model categories, billing units, and link to official pricing page |
| Model catalog (point-in-time snapshot) |
| Official documentation URLs (manual lookup only) |
| Agent self-check: register skills in project config for agents that don't auto-load |
| 位置 | 用途 |
|---|---|
| 定价概览 — 模型分类、计费单位,以及官方定价页面的链接 |
| 模型目录(时间点快照) |
| 官方文档URL(仅手动查询使用) |
| Agent自检:为不会自动加载的Agent在项目配置中注册技能 |
$DASHSCOPE_API_KEYos.environ["QWEN_API_KEY"].env$DASHSCOPE_API_KEYos.environ["QWEN_API_KEY"].env| Model | Context | Thinking |
|---|---|---|
| qwen3.5-plus | 1M | Yes (budget: 81,920) |
| kimi-k2.5 | 256K | Yes (budget: 81,920) |
| glm-5 | 198K | Yes (budget: 32,768) |
| MiniMax-M2.5 | 192K | Yes (budget: 32,768) |
| qwen3-max-2026-01-23 | 256K | Yes (budget: 81,920) |
| qwen3-coder-next | 256K | No |
| qwen3-coder-plus | 1M | No |
| glm-4.7 | 198K | Yes (budget: 32,768) |
sk-sp-...references/codingplan.md| Model | Context | Thinking |
|---|---|---|
| qwen3.5-plus | 1M | Yes (budget: 81,920) |
| kimi-k2.5 | 256K | Yes (budget: 81,920) |
| glm-5 | 198K | Yes (budget: 32,768) |
| MiniMax-M2.5 | 192K | Yes (budget: 32,768) |
| qwen3-max-2026-01-23 | 256K | Yes (budget: 81,920) |
| qwen3-coder-next | 256K | No |
| qwen3-coder-plus | 1M | No |
| glm-4.7 | 198K | Yes (budget: 32,768) |
sk-sp-...references/codingplan.md| Signal | Keywords | Model |
|---|---|---|
| Reasoning | "think step by step", "reason", "analyze" | qwq-plus (text) · qvq-max (vision) |
| Coding | "write code", "implement", "debug" | qwen3-coder-plus |
| OCR / document | "extract text", "OCR", "scan" | qwen-vl-ocr |
| Long context | "long document", "large file" | qwen3.5-plus (1M context) |
| Multimodal (text+image+video) | "analyze image", "understand video" + text | qwen3.5-plus (unified multimodal) |
| Voice interaction / omni | "voice chat", "speak", "listen" | qwen3-omni-flash |
| Built-in tools | "search the web", "run code", "use tools" | qwen3-max (web search, code interpreter) |
| Image editing / style transfer | "edit image", "style transfer", "reference image" | wan2.6-image (preferred) · wan2.5-i2i-preview |
| Image-to-image fusion | "place object", "combine images", "fuse images" | wan2.6-image · wan2.5-i2i-preview |
| Style TTS | "emotion", "tone", "pace" | qwen3-tts-instruct-flash |
| Ambiguous | task doesn't clearly map to one model | compare Recommendation Matrix; ask user to clarify if needed |
| 信号 | 关键词 | 模型 |
|---|---|---|
| 推理 | "think step by step", "reason", "analyze" | qwq-plus (文本) · qvq-max (视觉) |
| 编码 | "write code", "implement", "debug" | qwen3-coder-plus |
| OCR / 文档 | "extract text", "OCR", "scan" | qwen-vl-ocr |
| 长上下文 | "long document", "large file" | qwen3.5-plus (1M上下文) |
| 多模态(文本+图像+视频) | "analyze image", "understand video" + 文本 | qwen3.5-plus (统一多模态) |
| 语音交互 / 全模态 | "voice chat", "speak", "listen" | qwen3-omni-flash |
| 内置工具 | "search the web", "run code", "use tools" | qwen3-max (网页搜索、代码解释器) |
| 图像编辑 / 风格迁移 | "edit image", "style transfer", "reference image" | wan2.6-image (优先) · wan2.5-i2i-preview |
| 图生图融合 | "place object", "combine images", "fuse images" | wan2.6-image · wan2.5-i2i-preview |
| 风格TTS | "emotion", "tone", "pace" | qwen3-tts-instruct-flash |
| 不明确 | 任务没有明确对应到某一个模型 | 对比推荐矩阵;必要时请用户澄清 |
| Pattern | Signals | Guidance |
|---|---|---|
| Interactive / real-time | "chat", "real-time", "interactive" | Prefer flash/turbo variants; enable streaming |
| Batch / offline | "batch", "offline", "background" | Quality model + Batch API (50% off) |
| One-off trial | "try", "test", "experiment" | Quality model; check if free quota is still available in user's console |
| High-volume production | "production", "at scale", "high volume" | Cost-optimize: flash/turbo + context cache |
| Repeated context | "template", "same prompt", "repeated" | Enable context caching for input token discount |
| 模式 | 信号 | 指引 |
|---|---|---|
| 交互式 / 实时 | "chat", "real-time", "interactive" | 优先选择flash/turbo变体;开启流式输出 |
| 批量 / 离线 | "batch", "offline", "background" | 高质量模型 + 批量API(优惠50%) |
| 一次性试用 | "try", "test", "experiment" | 高质量模型;检查用户控制台是否仍有免费额度可用 |
| 高流量生产 | "production", "at scale", "high volume" | 成本优化:flash/turbo + 上下文缓存 |
| 重复上下文 | "template", "same prompt", "repeated" | 开启上下文缓存以获得输入Token优惠 |
| Domain | Default | Quality | Speed | Cost |
|---|---|---|---|---|
| text.chat | qwen3.5-plus | qwen3-max | qwen3.5-flash | qwen-turbo |
| vision.analyze | qwen3-vl-plus | qwen3-vl-plus | qwen3-vl-flash | qwen3-vl-flash |
| omni (voice+vision) | qwen3-omni-flash | qwen3-omni-flash | qwen3-omni-flash | — |
| image.generate | wan2.6-t2i | wan2.6-t2i | wan2.2-t2i-flash | wan2.2-t2i-flash |
| image.edit | wan2.6-image | wan2.6-image | wan2.5-i2i-preview | wan2.5-i2i-preview |
| video.t2v | wan2.6-t2v | wan2.6-t2v | — | — |
| video.i2v | wan2.6-i2v-flash | wan2.6-i2v | wan2.6-i2v-flash | — |
| audio.tts | qwen3-tts-flash | — | qwen3-tts-flash | — |
Degradation: If this skill is not loaded or not available, each execution skill falls back to its own built-in default. This protocol is purely additive — it enhances model selection but never blocks execution.
| 领域 | 默认 | 高质量 | 高速度 | 低成本 |
|---|---|---|---|---|
| text.chat | qwen3.5-plus | qwen3-max | qwen3.5-flash | qwen-turbo |
| vision.analyze | qwen3-vl-plus | qwen3-vl-plus | qwen3-vl-flash | qwen3-vl-flash |
| omni (voice+vision) | qwen3-omni-flash | qwen3-omni-flash | qwen3-omni-flash | — |
| image.generate | wan2.6-t2i | wan2.6-t2i | wan2.2-t2i-flash | wan2.2-t2i-flash |
| image.edit | wan2.6-image | wan2.6-image | wan2.5-i2i-preview | wan2.5-i2i-preview |
| video.t2v | wan2.6-t2v | wan2.6-t2v | — | — |
| video.i2v | wan2.6-i2v-flash | wan2.6-i2v | wan2.6-i2v-flash | — |
| audio.tts | qwen3-tts-flash | — | qwen3-tts-flash | — |
降级策略:如果本技能未加载或不可用,每个执行技能将回退到自身内置的默认配置。本协议仅做补充 — 它会优化模型选择,但绝不会阻塞执行。
| Use Case | Recommended | Why |
|---|---|---|
| General chat/assistant | qwen3.5-plus | Best balance of quality, speed, cost. Also accepts image/video input (multimodal). Thinking enabled by default. |
| Fast responses, low cost | qwen3.5-flash | 3x faster, 70% cheaper than Plus. Thinking enabled by default. |
| Highest quality | qwen3-max | Strongest reasoning. Built-in tools (web search, code interpreter). Supports thinking mode. |
| Code generation | qwen3-coder-next | Best balance of code quality, speed, cost. Agentic coding. |
| Complex reasoning | qwq-plus | Chain-of-thought reasoning specialist |
| Long documents | qwen3.5-plus | Up to 1M context. For >1M needs, see model-list.md. |
| Budget/high volume | qwen-turbo | Cheapest per-token cost |
| 使用场景 | 推荐模型 | 说明 |
|---|---|---|
| 通用聊天/助手 | qwen3.5-plus | 质量、速度、成本的最佳平衡。同时支持图像/视频输入(多模态)。默认开启思考模式。 |
| 快速响应、低成本 | qwen3.5-flash | 比Plus快3倍,便宜70%。默认开启思考模式。 |
| 最高质量 | qwen3-max | 最强推理能力。内置工具(网页搜索、代码解释器)。支持思考模式。 |
| 代码生成 | qwen3-coder-next | 代码质量、速度、成本的最佳平衡。支持智能体编码。如需最高质量请选择 |
| 复杂推理 | qwq-plus | 思维链推理专用模型 |
| 长文档 | qwen3.5-plus | 最高支持1M上下文。如需超过1M的支持,请查看model-list.md。 |
| 预算/高流量 | qwen-turbo | 单位Token成本最低 |
| Use Case | Recommended | Why |
|---|---|---|
| Best quality text-to-image | wan2.6-t2i | Latest model, sync support |
| Image editing / style transfer (1–4 refs) | wan2.6-image | Multi-image composition, subject consistency, 2K output, interleaved text-image |
| Image editing / multi-image fusion (1–3 refs) | wan2.5-i2i-preview | Simpler prompt-based editing, subject consistency, multi-image fusion |
| Interleaved text-image output (tutorials) | wan2.6-image | Mixed text+image generation |
| Fast iteration | wan2.2-t2i-flash | 50% faster generation |
| Flexible resolution | wan2.5-t2i-preview | Custom aspect ratios |
| 使用场景 | 推荐模型 | 说明 |
|---|---|---|
| 最高质量文生图 | wan2.6-t2i | 最新模型,支持同步生成 |
| 图像编辑 / 风格迁移(1-4个参考图) | wan2.6-image | 多图合成、主体一致性、2K输出、图文混排生成 |
| 图像编辑 / 多图融合(1-3个参考图) | wan2.5-i2i-preview | 更简单的基于提示词的编辑、主体一致性、多图融合 |
| 图文混排输出(教程) | wan2.6-image | 混合文本+图像生成 |
| 快速迭代 | wan2.2-t2i-flash | 生成速度快50% |
| 灵活分辨率 | wan2.5-t2i-preview | 支持自定义宽高比 |
| Use Case | Recommended | Why |
|---|---|---|
| Quick video creation | wan2.6-i2v-flash | Fast, multi-shot narrative |
| High quality | wan2.6-i2v | Best visual quality |
| With audio | wan2.5-i2v-preview | Auto-dubbing support |
| 使用场景 | 推荐模型 | 说明 |
|---|---|---|
| 快速视频创作 | wan2.6-i2v-flash | 速度快、支持多镜头叙事 |
| 高质量 | wan2.6-i2v | 最佳视觉质量 |
| 带音频 | wan2.5-i2v-preview | 支持自动配音 |
| Use Case | Recommended | Why |
|---|---|---|
| Highest quality | | Best naturalness, emotional expression, professional scenarios |
| High quality + speed | | Good balance of quality and performance |
| Standard TTS | | Fast, reliable, multi-language, cost-effective |
| Controlled style | | Instruction-guided voice style (tone/emotion) |
| 使用场景 | 推荐模型 | 说明 |
|---|---|---|
| 最高质量 | | 最佳自然度、情感表达、适合专业场景 |
| 高质量+速度 | | 质量和性能的良好平衡 |
| 标准TTS | | 快速、可靠、多语言、高性价比 |
| 可控风格 | | 指令引导的语音风格(语气/情感) |
| Use Case | Recommended | Why |
|---|---|---|
| Best accuracy | qwen3-vl-plus | Highest vision understanding. Thinking mode supported. 256K context. |
| Fast analysis | qwen3-vl-flash | Quick image understanding. Thinking mode supported. |
| Unified text+vision | qwen3.5-plus | Multimodal (text + image + video). Surpasses qwen3-vl series on many benchmarks. Use when both text quality and vision matter. |
| 使用场景 | 推荐模型 | 说明 |
|---|---|---|
| 最高精度 | qwen3-vl-plus | 最高视觉理解能力。支持思考模式。256K上下文。 |
| 快速分析 | qwen3-vl-flash | 快速图像理解。支持思考模式。 |
| 统一图文 | qwen3.5-plus | 多模态(文本 + 图像 + 视频)。在多项基准测试中超过qwen3-vl系列。同时需要文本质量和视觉能力时使用。 |
| Use Case | Recommended | Why |
|---|---|---|
| Voice + vision chat | qwen3-omni-flash | Text/image/audio/video → text or speech. 49 voices, 10 languages. Thinking supported. |
| Real-time voice | qwen3-omni-flash-realtime | Streaming audio input + built-in VAD. 49 voices. |
| 使用场景 | 推荐模型 | 说明 |
|---|---|---|
| 语音+视觉聊天 | qwen3-omni-flash | 文本/图像/音频/视频 → 文本或语音。49种音色、10种语言。支持思考模式。 |
| 实时语音 | qwen3-omni-flash-realtime | 流式音频输入 + 内置VAD。49种音色。 |
Cost = Tokens ÷ 1,000,000 × Unit price成本 = Token数 ÷ 1,000,000 × 单价🚨 CRITICAL — NO EXCEPTIONS: NEVER fabricate, invent, or guess any price figure. If you do not have a confirmed price fromor the official pricing page, you MUST NOT output any number. Instead, direct the user to the official pricing page. Outputting a made-up price is a critical failure — worse than saying "I don't know."references/pricing.md
⚠️ 费用说明:以上费用为基于官方公示单价的预估价格,仅供参考。实际费用受 Token 消耗量、上下文长度阶梯定价、Batch/缓存折扣及计费策略调整等因素影响,请以QwenCloud控制台的实际账单为准。部分模型可能提供限时免费额度,但免费额度的可用性、额度量及有效期随时可能调整,请在控制台确认您的账户是否仍有剩余额度,切勿假设本次调用免费。最新定价详见 模型定价页。
⚠️ Pricing Notice: The cost figures above are estimates calculated from officially published unit prices and are provided for reference only. Actual charges depend on token consumption, tiered context-length pricing, Batch/cache discounts, and billing policy updates. Some models may offer a time-limited free quota, but quota availability, amounts, and validity periods are subject to change — do not assume this call is free. Please verify your remaining quota in the QwenCloud console and refer to the actual bill for definitive costs. See Model Pricing for the latest rates.
🚨 重要 — 无例外:永远不要编造、虚构或猜测任何价格数据。 如果您没有从或官方定价页面获得确认的价格,绝对不要输出任何数字。而是引导用户访问官方定价页面。输出虚构价格是严重失误 — 比说“我不知道”更糟糕。references/pricing.md
⚠️ 费用说明:以上费用为基于官方公示单价的预估价格,仅供参考。实际费用受 Token 消耗量、上下文长度阶梯定价、Batch/缓存折扣及计费策略调整等因素影响,请以QwenCloud控制台的实际账单为准。部分模型可能提供限时免费额度,但免费额度的可用性、额度量及有效期随时可能调整,请在控制台确认您的账户是否仍有剩余额度,切勿假设本次调用免费。最新定价详见 模型定价页。
⚠️ Pricing Notice: The cost figures above are estimates calculated from officially published unit prices and are provided for reference only. Actual charges depend on token consumption, tiered context-length pricing, Batch/cache discounts, and billing policy updates. Some models may offer a time-limited free quota, but quota availability, amounts, and validity periods are subject to change — do not assume this call is free. Please verify your remaining quota in the QwenCloud console and refer to the actual bill for definitive costs. See Model Pricing for the latest rates.
⚠️ Important: The model list above is a point-in-time snapshot and may be outdated. Model availability changes frequently. Always check the official model list for the authoritative, up-to-date catalog before making model decisions. See model-list.md for a more detailed local reference.
⚠️ 重要提示:上述模型列表是时间点快照,可能已过时。模型可用性会频繁变更。在做出模型决策前,请始终查看官方模型列表获取权威的最新目录。 查看model-list.md获取更详细的本地参考。
| Model | Thinking Default | Notes |
|---|---|---|
| qwen3.5-plus | On | Thinking enabled by default. Use |
| qwen3.5-flash | On | Thinking enabled by default. |
| qwen3-max | Off | Use |
| qwen-plus / qwen-flash / qwen-turbo | Off | Hybrid; enable for deeper reasoning at higher output cost. |
| qwen3-vl-plus / qwen3-vl-flash | Off | Vision + thinking for complex visual analysis. |
| qwen3-omni-flash | Off | Thinking supported; audio output not available in thinking mode. |
| qwq-plus / qvq-max | Always on | Pure reasoning models; CoT always active. |
| 模型 | 默认开启思考 | 说明 |
|---|---|---|
| qwen3.5-plus | 是 | 默认开启思考模式。使用 |
| qwen3.5-flash | 是 | 默认开启思考模式。 |
| qwen3-max | 否 | 复杂推理场景使用 |
| qwen-plus / qwen-flash / qwen-turbo | 否 | 混合模式;开启可获得更深的推理能力,但输出Token成本更高。 |
| qwen3-vl-plus / qwen3-vl-flash | 否 | 视觉+思考模式可用于复杂视觉分析。 |
| qwen3-omni-flash | 否 | 支持思考模式;思考模式下无法输出音频。 |
| qwq-plus / qvq-max | 始终开启 | 纯推理模型;思维链始终激活。 |
qwen3.5-plusreferences/pricing.mdqwen3.5-plusreferences/pricing.mdqwencloud-update-check/SKILL.mdpython3 <qwencloud-update-check-dir>/scripts/check_update.py --print-response--forceThe update-check skill is not installed. It keeps you informed about new qwencloud/qwencloud-ai features and bug fixes.
- Install now — I'll run the install command for you
- Skip for now — do nothing this time
- Never remind me — permanently suppress this prompt
npx skills add QwenCloud/qwencloud-ai --skill qwencloud-update-check -ypython3 <this-skill-scripts-dir>/gossamer.py --dismisspython3 <this-skill-scripts-dir>/gossamer.py --dismisspython3 <this-skill-scripts-dir>/gossamer.py --never-installqwencloud-update-check/SKILL.mdpython3 <qwencloud-update-check-dir>/scripts/check_update.py --print-response--force更新检查技能未安装。它会让您及时了解qwencloud/qwencloud-ai的新功能和bug修复。
- 立即安装 — 我将为您运行安装命令
- 暂时跳过 — 本次不做操作
- 永不提醒 — 永久屏蔽此提示
npx skills add QwenCloud/qwencloud-ai --skill qwencloud-update-check -ypython3 <this-skill-scripts-dir>/gossamer.py --dismisspython3 <this-skill-scripts-dir>/gossamer.py --dismisspython3 <this-skill-scripts-dir>/gossamer.py --never-install