Tongyi Qianfan Document Knowledge Base

The complete technical document knowledge base for Alibaba Cloud Tongyi Qianfan Platform, covering model usage, application development, API references, and other content.

When to Use

Activate this Skill when users are involved in the following scenarios:

Querying Qianfan platform's model lists, model parameters, calling methods (including structured fields like contextWindow, QPM, pricing, sample code for specific models → check
```
models/
```
)
Looking up Qianfan API parameters, request/response formats, error codes
Learning about Qianfan application development (Agent, RAG, Knowledge Base, Memory, Plugins, etc.)
Selecting models, comparing model capabilities, understanding model pricing and rate limits
Using Qianfan SDK / OpenAI compatible interfaces
Multimodal capabilities such as speech recognition, speech synthesis, image generation, video generation
Business issues like Token Plan, billing, free quota, etc.

When Not to Use

Do not activate this Skill for the following scenarios that are unrelated to Qianfan documents:

Inquiring about APIs, models or pricing of other vendors like OpenAI / Anthropic / Google
General programming issues, framework issues (React, Vue, Spring, etc.)
Usage of the
```
bl
```
CLI command itself (that's the responsibility of the
```
bailian-cli
```
skill)
Alibaba Cloud products unrelated to Qianfan (OSS, ECS, RDS, etc.)

Document Layers

1. Wiki Layer (Synthesis Layer) —

wiki/

Structured Wiki automatically synthesized by LLM, including three types of pages:

Topic Pages: Comprehensive documents aggregated by functional domains (
```
wiki/guides/*.md
```
,
```
wiki/api/*.md
```
)
Concept Pages: Cross-cutting concepts across topics (
```
wiki/concepts/*.md
```
, such as RAG, streaming output, Function Calling, etc.)
Comparison Pages: Structured comparative analysis of similar solutions (
```
wiki/comparisons/*.md
```
, including comparison tables)

Prioritize checking
wiki/index.md
, which is the index entry for all wiki pages.

Each topic page ends with a

## Source Documents

section, listing the corresponding raw original document paths for traceability and in-depth reading.

2. Models Layer (Structured Data of Model Market) —

models/

Structured model metadata directly crawled from the Qianfan console gateway

listFoundationModels

, corresponding to the "Model Plaza" page. For issues related to specific model capabilities, context length, QPM rate limits, pricing, official sample code, etc., check here first as it is more up-to-date and accurate than the wiki.

Three-layer products, single data source (

families.jsonl

models.jsonl

are machine query entries,

index.md

is a human-readable view of

families.jsonl

```
models/index.md
```
— Family index grouped by primary capabilities (Chinese labels + model codes + links to detailed JSON), suitable for "quick overview"

models/families.jsonl

— One family per line, including

slug

name

description

primaryCapability

capabilities

(union of all items under the family) /

providers

itemCount

maxContextWindow

/ lightweight

items[]

summary (each item only contains

model

name

contextWindow

capabilities

openSource

) /

detailPath

. Suitable for filtering by family, for example:

List reasoning families:

jq -c 'select(.primaryCapability=="Reasoning") | {slug,name,itemCount}' families.jsonl

Find families containing a specific provider:

grep '"moonshot-ai"' families.jsonl | jq -c .slug

models/models.jsonl

— One backbone model per line (flat across families), including

model

family

capabilities

features

contextWindow

maxInputTokens

maxOutputTokens

prices

(simplified) /

qpmInfo

(simplified) /

docUrl

detailPath

. Suitable for cross-family batch queries, for example:

List all models with contextWindow ≥ 1M:

jq -c 'select(.contextWindow>=1000000)' models.jsonl

Find text models that support function-calling:

grep '"function-calling"' models.jsonl | jq -c '{model,family,contextWindow}'

Join field for the two JSONL files:

models.jsonl[].family == families.jsonl[].slug

. After matching, open

groups/<slug>.json

via

detailPath

to get complete fields (

samples

predictConfig

```
models/groups/<slug>.json
```
— Complete details of a single model family (such as
```
qwen3-max
```
,
```
deepseek
```
,
```
wan-image-to-video
```
):
- Family layer:
```
name
```
  ,
```
description
```
- ```
items[]
```
  : All backbone model versions under the family (excluding snapshots with date suffixes), which have been trimmed (discarded account/UI fields irrelevant to model capabilities such as
```
permissions
```
  /
```
activationStatus
```
  /
```
quota
```
  /
```
scope
```
  /
```
tags
```
  /
```
license
```
  /
```
serviceSites
```
  /
```
modelInfo
```
  ), retaining:
```
model
```
  (API call name),
```
modelAlias
```
  ,
```
contextWindow
```
  ,
```
maxInputTokens
```
  ,
```
maxOutputTokens
```
  ,
```
capabilities
```
  ,
```
features
```
  ,
```
provider
```
  ,
```
docUrl
```
  ,
```
prices
```
  ,
```
qpmInfo
```
  ,
```
samples
```
  (call examples, flattened structure:
```
samples.<sdk>.<api>.{curl,python,nodejs,java,docUrl}
```
  , e.g.,
```
samples.openai.completionsAPI.python
```
  is directly a code string),
```
predictConfig
```
  (model call parameter definition: obtained from the
```
getPredictParamConfig
```
  gateway, including
```
name
```
  /
```
key
```
  /
```
default
```
  /
```
tip
```
  /
```
range
```
  for parameters like
```
system
```
  /
```
temperature
```
  /
```
top_p
```
  /
```
enable_search
```
  /
```
enable_thinking
```
  , aligned with Qianfan Playground)
```
models/meta.json
```
— Fingerprint cache for incremental crawling, do not reference this for users

Data is refreshed via

pnpm --filter bailian-docs-llm-wiki run crawl:models

Capability Code Mapping

The section titles in

index.md

and

items[].capabilities[]

use English short codes, corresponding to Chinese meanings as follows:

Code	Chinese Label
`TG`	Text Generation
`Reasoning`	Reasoning
`VU`	Visual Understanding
`IG`	Image Generation
`VG`	Video Generation
`TTS`	Speech Synthesis
`ASR`	Speech Recognition
`Realtime-ASR`	Realtime Speech Recognition
`Realtime-Text-to-Speech`	Realtime Text-to-Speech
`Realtime-Audio-Translate`	Realtime Audio Translation
`Realtime-Omni`	Realtime Multimodal
`Multimodal-Omni`	Multimodal
`ME`	Multimodal Embedding
`TR`	Translation
`3D-generation`	3D Generation

A model often has multiple capabilities. In

index.md

, models are categorized by

capabilities[0]

(primary capability). You can locate the section by Chinese label during search.

3. Raw Layer (Original Layer) —

raw/

Original documents crawled from help.aliyun.com, stored by category:

```
raw/model-user-guide/
```
— Model User Guide
```
raw/application-user-guide/
```
— Application User Guide
```
raw/model-api-reference/
```
— Model API Reference
```
raw/application-api-reference/
```
— Application API Reference

When information in the wiki / models layers is insufficient, obtain complete original text from the raw layer.

Strong Constraints

Answers must be based on actual file content under
wiki/
or
raw/
, do not fabricate APIs, parameter names, error codes or pricing figures from memory.
Attach relative paths when referencing (e.g.,
```
wiki/api/qwen-api.md
```
) to facilitate user verification.
If a page has
```
qualityScore <= 2
```
found in
```
wiki-metadata.json
```
, directly bypass this wiki page and fall back to the original text in
```
raw/
```
for answering.

Lookup Process

Check models first for model specification issues:
- Capabilities, context, QPM, pricing, official sample code of a specific model → Locate the family via
```
index.md
```
  → Open
```
groups/<slug>.json
```
- Cross-family model filtering (e.g., "models with context ≥ 1M", "reasoning models that support function-calling") → grep
```
models.jsonl
```
- Family-level filtering (e.g., "which pure reasoning families are there", "what families are under moonshot") → grep
```
families.jsonl
```
- After matching, open the corresponding
```
groups/<slug>.json
```
  via
```
detailPath
```
  to get complete fields like
```
samples
```
  /
```
predictConfig
```
Check wiki index for concepts/usage methods: Read
```
wiki/index.md
```
to find the corresponding topic pages, concept pages or comparison pages
Prioritize concepts/comparisons: For cross-domain issues or solution selection issues, prioritize checking
```
wiki/concepts/*
```
and
```
wiki/comparisons/*
```
Trace back to raw original text: Access
```
raw/.../*.md
```
from the
```
## Source Documents
```
list at the end of topic pages to view complete details
Full-text index:
```
llms.txt
```
contains the complete directory tree;
```
llms-full.txt
```
contains full-text concatenation, which can be searched with grep
Specific parameters/error codes: Directly use grep under
```
raw/
```

Quick Location Mapping

Keywords	Recommended Entry Path
contextWindow / pricing / QPM / sample code / parameter definition of a model	`models/groups/<slug>.json` (find slug from `models/index.md` )
Cross-family model filtering: Batch search by contextWindow / capability / feature / price	`models/models.jsonl` (one model per line, use `grep` / `jq` )
Family-level filtering: Find families by primaryCapability / providers / itemCount / maxContextWindow	`models/families.jsonl` (one family per line, includes items[] summary)
Overview of model families / browsing by capability buckets	`models/index.md`
Model list / Qwen / DeepSeek	`wiki/guides/more-about-models.md`
Text chat / Chat Completion	`wiki/api/text-generation-api-reference.md`
Speech synthesis / TTS	`wiki/api/speech-synthesis-api-reference.md`
Speech recognition / ASR	`wiki/api/speech-recognition-api-reference.md`
Image generation / image editing	`wiki/api/image-generation-api-reference.md`
Video generation	`wiki/api/video-generation-api-reference.md`
Multimodal / Omni	`wiki/api/multimodal-api-reference.md`
Function Calling / Tool Calling	`wiki/concepts/function-calling.md`
RAG / Knowledge Base	`wiki/concepts/rag.md` + `wiki/guides/knowledge-base.md`
Agent / Application Calling	`wiki/guides/application-call.md`
OpenAI Compatibility	`wiki/concepts/openai-compatibility.md`
Token / Billing / Rate Limit	`wiki/guides/billing-and-rate-limit.md`

Actual file names are subject to
wiki/index.md
; if there is any discrepancy in the above table, return to the index page to search.

Notes

Documents are continuously updated. In case of conflicts, follow the priority order:
```
models/
```
>
```
raw/
```
>
```
wiki/
```
(
```
models/
```
is directly pulled from the console gateway, the most up-to-date;
```
raw/
```
is crawled from the source site;
```
wiki/
```
is synthesized by LLM and may be lagging)
```
metadata.json
```
records the last modification time of each raw document
```
wiki-metadata.json
```
records the synthesis hash, timestamp, and optional
```
qualityScore
```
of each wiki page
```
wiki/eval-report.md
```
is the latest evaluation report (structural issues, broken links, length anomalies, etc.)
The
```
wiki/
```
directory may not have been generated yet (need to run
```
pnpm --filter bailian-docs-llm-wiki run synthesize
```
)

bailian-docs-llm-wiki

NPX Install

Tags

SKILL.md Content (Chinese)