aws-observability
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseAWS Observability
AWS可观测性
Overview
概述
Domain expertise for AWS observability across metrics, logs, and traces. Covers CloudWatch platform capabilities (alarms, dashboards, Log Insights, custom metrics, EMF), X-Ray trace analysis, CloudTrail operational auditing, and ADOT collector configuration.
Works best with the AWS MCP server — enables running CLI commands, querying CloudWatch, and validating configurations directly. All guidance also works with standard AWS CLI access.
Note: Reference files contain specific runtime versions, quota values, and feature matrices that may change. When precision matters (e.g., deploying to production, choosing a runtime, or checking a quota), confirm values against current AWS documentation rather than relying solely on the values in these files.
涵盖指标、日志和追踪的AWS可观测性领域专业知识。包含CloudWatch平台功能(告警、仪表板、Log Insights、自定义指标、EMF)、X-Ray追踪分析、CloudTrail操作审计以及ADOT采集器配置。
最佳搭配AWS MCP服务器 —— 支持直接运行CLI命令、查询CloudWatch并验证配置。所有指导也适用于标准AWS CLI访问。
**注意:**参考文件包含特定的运行时版本、配额值和功能矩阵,这些内容可能会发生变化。当需要精准信息时(例如部署到生产环境、选择运行时或检查配额),请对照当前AWS文档确认数值,不要仅依赖这些文件中的值。
Routing
路由
| User need | Action |
|---|---|
| Writing Log Insights queries | Read log-insights.md |
| Configuring alarms (metric, composite, anomaly) | Read alarms.md |
| Publishing custom metrics or using EMF | Read metrics.md |
| Setting up X-Ray tracing or ADOT | Read tracing.md |
| Building dashboards | Read dashboards.md |
| Debugging observability issues | Read troubleshooting.md — starts with the 5 most common fixes |
| Debugging canary failures | Read synthetics.md — see Common failures table |
| CloudTrail operational auditing | Read cloudtrail.md |
| Setting up Lambda monitoring with CDK | Use alarm-template.ts as a starting point |
| Creating synthetic canaries | Read synthetics.md |
| Configuring ADOT collector | Use otel-config.yaml as a starting point |
| Spans multiple areas | Read the most specific reference first, then consult others as needed |
| 用户需求 | 操作 |
|---|---|
| 编写Log Insights查询 | 阅读log-insights.md |
| 配置告警(指标、复合、异常) | 阅读alarms.md |
| 发布自定义指标或使用EMF | 阅读metrics.md |
| 设置X-Ray追踪或ADOT | 阅读tracing.md |
| 构建仪表板 | 阅读dashboards.md |
| 排查可观测性问题 | 阅读troubleshooting.md —— 从5个最常见的修复方案开始 |
| 排查金丝雀失败问题 | 阅读synthetics.md —— 查看常见失败表格 |
| CloudTrail操作审计 | 阅读cloudtrail.md |
| 使用CDK设置Lambda监控 | 以alarm-template.ts为起点 |
| 创建合成监控金丝雀 | 阅读synthetics.md |
| 配置ADOT采集器 | 以otel-config.yaml为起点 |
| 涉及多个领域 | 先阅读最相关的参考文档,再根据需要查阅其他文档 |
Files
文件
| File | Content |
|---|---|
| alarms.md | Metric, composite, anomaly detection alarms — configuration, constraints, recommended defaults |
| log-insights.md | Complete query syntax, commands, functions, known issues, reusable query library |
| metrics.md | Custom metrics, EMF spec, metric filters, high-resolution, retention |
| tracing.md | X-Ray → ADOT migration, sampling rules, annotations vs metadata, collector config |
| dashboards.md | Widget types, cross-account/region, dynamic labels, sharing |
| troubleshooting.md | Error → cause → fix for all observability services |
| cloudtrail.md | Operational auditing, event types, S3+Athena queries |
| synthetics.md | Canary runtime/blueprint constraints, VPC networking, common failures |
| alarm-template.ts | Best-practice CDK Lambda monitoring (alarms + dashboard) |
| otel-config.yaml | ADOT collector config for X-Ray traces + CloudWatch EMF metrics |
| 文件 | 内容 |
|---|---|
| alarms.md | 指标告警、复合告警、异常检测告警 —— 配置、约束、推荐默认值 |
| log-insights.md | 完整查询语法、命令、函数、已知问题、可复用查询库 |
| metrics.md | 自定义指标、EMF规范、指标过滤器、高分辨率、保留策略 |
| tracing.md | X-Ray → ADOT迁移、采样规则、注解与元数据、采集器配置 |
| dashboards.md | 组件类型、跨账号/区域、动态标签、共享 |
| troubleshooting.md | 所有可观测性服务的错误→原因→修复方案 |
| cloudtrail.md | 操作审计、事件类型、S3+Athena查询 |
| synthetics.md | 金丝雀运行时/蓝图约束、VPC网络、常见失败 |
| alarm-template.ts | 最佳实践CDK Lambda监控(告警+仪表板) |
| otel-config.yaml | 用于X-Ray追踪+CloudWatch EMF指标的ADOT采集器配置 |