using-datahub
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseUsing DataHub Skills
使用DataHub Skills
You have access to 5 DataHub catalog interaction skills. Use this guide to route the user's request to the correct skill.
你可以使用5个DataHub目录交互Skill,参考本指南将用户请求路由到正确的Skill。
Skill Routing Table
技能路由表
| User Intent | Skill | Command |
|---|---|---|
| Find or discover entities (search, browse, filter, list) | Search | |
| Answer a question about the catalog ("who owns X?", "how many X?") | Search | |
| Update metadata (descriptions, tags, glossary terms, ownership, deprecation) | Enrich | |
| Explore lineage (upstream, downstream, impact, root cause, dependencies) | Lineage | |
| Data quality (assertions, incidents, health checks) | Quality | |
| Notifications (subscribe to assertion failures, incidents) | Quality | |
| Install CLI, authenticate, verify connection | Setup | |
| Configure default scopes and profiles | Setup | |
| 用户意图 | Skill | 命令 |
|---|---|---|
| 查找或发现实体(搜索、浏览、筛选、罗列) | Search | |
| 回答关于目录的问题("谁拥有X?"、"有多少个X?") | Search | |
| 更新元数据(描述、标签、术语表术语、所有权、弃用状态) | Enrich | |
| 探索血缘(上游、下游、影响、根因、依赖) | Lineage | |
| 数据质量(断言、事件、健康检查) | Quality | |
| 通知(订阅断言失败、事件告警) | Quality | |
| 安装CLI、身份认证、验证连接 | Setup | |
| 配置默认范围和配置文件 | Setup | |
Disambiguation Rules
消歧规则
When the intent is ambiguous, use these rules:
当意图不明确时,遵循以下规则:
"Tag" requests
"Tag" 请求
- All tag operations (PII, sensitive, important, reviewed, team-x) → Enrich (general metadata)
- 所有标签操作(PII、敏感、重要、已审核、team-x)→ Enrich(通用元数据操作)
"Domain" requests
"Domain" 请求
- Filter search to a domain → Search (scoped search)
- Configure default domain → Setup (profile configuration)
- 按域筛选搜索 → Search(限定范围搜索)
- 配置默认域 → Setup(配置文件设置)
"Quality" or "health" requests
"质量" 或 "健康" 请求
- Failing assertions, active incidents, health status → Quality
- Create assertions, run quality checks, raise incidents → Quality
- Subscribe to assertion failures or incidents → Quality
- Metadata quality/documentation/ownership coverage → Use Search to gather the data and synthesize the answer
- 断言失败、活跃事件、健康状态 → Quality
- 创建断言、运行质量检查、上报事件 → Quality
- 订阅断言失败或事件告警 → Quality
- 元数据质量/文档/所有权覆盖率 → 使用 Search 收集数据并整合答案
Lineage vs. Search
血缘 vs 搜索
- "What feeds into X" / "what depends on X" / "impact of changing X" → Lineage
- "What dashboards use table X" → Lineage (relationship traversal)
- "Who owns X" / "what is X" → Search (metadata lookup)
- "什么内容输入到X" / "什么依赖X" / "修改X的影响" → Lineage
- "哪些仪表盘使用了表X" → Lineage(关系遍历)
- "谁拥有X" / "X是什么" → Search(元数据查询)
Setup vs. other skills
安装配置 vs 其他技能
- "Set up" / "install" / "authenticate" / "verify connection" → Setup
- "Configure defaults" / "set default platform" / "create profile" → Setup
- "Check if DataHub is working" → Setup (connectivity verification)
- "设置" / "安装" / "身份认证" / "验证连接" → Setup
- "配置默认项" / "设置默认平台" / "创建配置文件" → Setup
- "检查DataHub是否正常运行" → Setup(连通性验证)
CLI Attribution
CLI 归因
When running CLI commands, pass on the root command so usage can be attributed:
datahub-C skill=<name>bash
datahub -C skill=datahub-search search "revenue"
datahub -C skill=datahub-enrich graphql --query '...'
datahub -C skill=datahub-lineage lineage --urn "..."Use the skill name from the YAML frontmatter. If is not recognized, omit it — the command works the same without it.
-C运行 CLI命令时,在根命令中传入 ,以便统计使用情况:
datahub-C skill=<name>bash
datahub -C skill=datahub-search search "revenue"
datahub -C skill=datahub-enrich graphql --query '...'
datahub -C skill=datahub-lineage lineage --urn "..."使用YAML前置元数据中的Skill名称即可。如果 参数不被识别,可以省略,命令的运行效果不受影响。
-CCritical Rules
重要规则
- Never guess the skill. If the intent is genuinely ambiguous, ask the user to clarify.
- One skill per request unless the user explicitly asks for multiple operations.
- Lineage is for lineage only — not for general "what is this entity?" questions (that's Search).
- Search handles ad-hoc questions. "Who owns X?" and "what columns does X have?" are Search questions, not Lineage.
- Enrich handles all metadata writes — descriptions, tags, glossary terms, ownership, deprecation.
- Quality handles data quality — assertions, incidents, health checks, subscriptions.
- Setup handles environment and configuration — CLI install, auth, connectivity, default scopes.
- 绝对不要猜测使用哪个Skill。如果意图确实不明确,请让用户澄清。
- 每个请求仅调用一个Skill,除非用户明确要求执行多个操作。
- Lineage仅用于血缘相关查询 — 不用于通用的 "这个实体是什么?" 类问题(这类问题归Search处理)。
- Search处理临时查询。"谁拥有X?" 和 "X有哪些列?" 属于Search类问题,不是Lineage类问题。
- Enrich处理所有元数据写入操作 — 描述、标签、术语表术语、所有权、弃用状态。
- Quality处理数据质量相关操作 — 断言、事件、健康检查、订阅。
- Setup处理环境和配置相关操作 — CLI安装、认证、连通性、默认范围。