Tongyi Qianfan Document Knowledge Base
The complete technical document knowledge base for Alibaba Cloud Tongyi Qianfan Platform, covering model usage, application development, API references, and other content.
When to Use
Activate this Skill when users are involved in the following scenarios:
- Querying Qianfan platform's model lists, model parameters, calling methods (including structured fields like contextWindow, QPM, pricing, sample code for specific models → check )
- Looking up Qianfan API parameters, request/response formats, error codes
- Learning about Qianfan application development (Agent, RAG, Knowledge Base, Memory, Plugins, etc.)
- Selecting models, comparing model capabilities, understanding model pricing and rate limits
- Using Qianfan SDK / OpenAI compatible interfaces
- Multimodal capabilities such as speech recognition, speech synthesis, image generation, video generation
- Business issues like Token Plan, billing, free quota, etc.
When Not to Use
Do not activate this Skill for the following scenarios that are unrelated to Qianfan documents:
- Inquiring about APIs, models or pricing of other vendors like OpenAI / Anthropic / Google
- General programming issues, framework issues (React, Vue, Spring, etc.)
- Usage of the CLI command itself (that's the responsibility of the skill)
- Alibaba Cloud products unrelated to Qianfan (OSS, ECS, RDS, etc.)
Document Layers
1. Wiki Layer (Synthesis Layer) —
Structured Wiki automatically synthesized by LLM, including three types of pages:
- Topic Pages: Comprehensive documents aggregated by functional domains (, )
- Concept Pages: Cross-cutting concepts across topics (, such as RAG, streaming output, Function Calling, etc.)
- Comparison Pages: Structured comparative analysis of similar solutions (, including comparison tables)
Prioritize checking , which is the index entry for all wiki pages.
Each topic page ends with a
section, listing the corresponding raw original document paths for traceability and in-depth reading.
2. Models Layer (Structured Data of Model Market) —
Structured model metadata directly crawled from the Qianfan console gateway
, corresponding to the "Model Plaza" page. For issues related to specific model capabilities, context length, QPM rate limits, pricing, official sample code, etc., check here first as it is more up-to-date and accurate than the wiki.
Three-layer products, single data source (
/
are machine query entries,
is a human-readable view of
):
- — Family index grouped by primary capabilities (Chinese labels + model codes + links to detailed JSON), suitable for "quick overview"
- — One family per line, including / / / / (union of all items under the family) / / / / lightweight summary (each item only contains / / / / ) / . Suitable for filtering by family, for example:
- List reasoning families:
jq -c 'select(.primaryCapability=="Reasoning") | {slug,name,itemCount}' families.jsonl
- Find families containing a specific provider:
grep '"moonshot-ai"' families.jsonl | jq -c .slug
- — One backbone model per line (flat across families), including / / / / / / / (simplified) / (simplified) / / . Suitable for cross-family batch queries, for example:
- List all models with contextWindow ≥ 1M:
jq -c 'select(.contextWindow>=1000000)' models.jsonl
- Find text models that support function-calling:
grep '"function-calling"' models.jsonl | jq -c '{model,family,contextWindow}'
Join field for the two JSONL files:
models.jsonl[].family == families.jsonl[].slug
. After matching, open
via
to get complete fields (
,
).
models/groups/<slug>.json
— Complete details of a single model family (such as , , ):
- Family layer: ,
- : All backbone model versions under the family (excluding snapshots with date suffixes), which have been trimmed (discarded account/UI fields irrelevant to model capabilities such as / / / / / / / ), retaining: (API call name), , , , , , , , , , , (call examples, flattened structure:
samples.<sdk>.<api>.{curl,python,nodejs,java,docUrl}
, e.g., samples.openai.completionsAPI.python
is directly a code string), (model call parameter definition: obtained from the gateway, including //// for parameters like ////, aligned with Qianfan Playground)
- — Fingerprint cache for incremental crawling, do not reference this for users
Data is refreshed via
pnpm --filter bailian-docs-llm-wiki run crawl:models
.
Capability Code Mapping
The section titles in
and
use English short codes, corresponding to Chinese meanings as follows:
| Code | Chinese Label |
|---|
| Text Generation |
| Reasoning |
| Visual Understanding |
| Image Generation |
| Video Generation |
| Speech Synthesis |
| Speech Recognition |
| Realtime Speech Recognition |
| Realtime Text-to-Speech |
| Realtime Audio Translation |
| Realtime Multimodal |
| Multimodal |
| Multimodal Embedding |
| Translation |
| 3D Generation |
A model often has multiple capabilities. In
, models are categorized by
(primary capability). You can locate the section by Chinese label during search.
3. Raw Layer (Original Layer) —
Original documents crawled from help.aliyun.com, stored by category:
- — Model User Guide
raw/application-user-guide/
— Application User Guide
- — Model API Reference
raw/application-api-reference/
— Application API Reference
When information in the wiki / models layers is insufficient, obtain complete original text from the raw layer.
Strong Constraints
- Answers must be based on actual file content under or , do not fabricate APIs, parameter names, error codes or pricing figures from memory.
- Attach relative paths when referencing (e.g., ) to facilitate user verification.
- If a page has found in , directly bypass this wiki page and fall back to the original text in for answering.
Lookup Process
- Check models first for model specification issues:
- Capabilities, context, QPM, pricing, official sample code of a specific model → Locate the family via → Open
- Cross-family model filtering (e.g., "models with context ≥ 1M", "reasoning models that support function-calling") → grep
- Family-level filtering (e.g., "which pure reasoning families are there", "what families are under moonshot") → grep
- After matching, open the corresponding via to get complete fields like /
- Check wiki index for concepts/usage methods: Read to find the corresponding topic pages, concept pages or comparison pages
- Prioritize concepts/comparisons: For cross-domain issues or solution selection issues, prioritize checking and
- Trace back to raw original text: Access from the list at the end of topic pages to view complete details
- Full-text index: contains the complete directory tree; contains full-text concatenation, which can be searched with grep
- Specific parameters/error codes: Directly use grep under
Quick Location Mapping
| Keywords | Recommended Entry Path |
|---|
| contextWindow / pricing / QPM / sample code / parameter definition of a model | models/groups/<slug>.json
(find slug from ) |
| Cross-family model filtering: Batch search by contextWindow / capability / feature / price | (one model per line, use / ) |
| Family-level filtering: Find families by primaryCapability / providers / itemCount / maxContextWindow | (one family per line, includes items[] summary) |
| Overview of model families / browsing by capability buckets | |
| Model list / Qwen / DeepSeek | wiki/guides/more-about-models.md
|
| Text chat / Chat Completion | wiki/api/text-generation-api-reference.md
|
| Speech synthesis / TTS | wiki/api/speech-synthesis-api-reference.md
|
| Speech recognition / ASR | wiki/api/speech-recognition-api-reference.md
|
| Image generation / image editing | wiki/api/image-generation-api-reference.md
|
| Video generation | wiki/api/video-generation-api-reference.md
|
| Multimodal / Omni | wiki/api/multimodal-api-reference.md
|
| Function Calling / Tool Calling | wiki/concepts/function-calling.md
|
| RAG / Knowledge Base | + wiki/guides/knowledge-base.md
|
| Agent / Application Calling | wiki/guides/application-call.md
|
| OpenAI Compatibility | wiki/concepts/openai-compatibility.md
|
| Token / Billing / Rate Limit | wiki/guides/billing-and-rate-limit.md
|
Actual file names are subject to
; if there is any discrepancy in the above table, return to the index page to search.
Notes
- Documents are continuously updated. In case of conflicts, follow the priority order: > >
( is directly pulled from the console gateway, the most up-to-date; is crawled from the source site; is synthesized by LLM and may be lagging)
- records the last modification time of each raw document
- records the synthesis hash, timestamp, and optional of each wiki page
- is the latest evaluation report (structural issues, broken links, length anomalies, etc.)
- The directory may not have been generated yet (need to run
pnpm --filter bailian-docs-llm-wiki run synthesize
)