wa-review

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Well-Architected Review

Well-Architected 评审

Step 1: Define the workload scope

步骤1：定义工作负载范围

Ask the user to describe the workload:

What workload would you like me to review? Please share:

Workload name and brief description

Architecture diagram or description (services, data flows, integrations)

Business criticality (revenue-generating, internal tool, compliance-sensitive, etc.)

Current pain points (optional — anything you already know is problematic)

If the user has already provided architecture details, skip the prompt and proceed.

请用户描述工作负载：

您希望我评审哪个工作负载？请分享：

工作负载名称及简要描述

架构图或架构说明（服务、数据流、集成方式）

业务关键性（创收型、内部工具、合规敏感型等）

当前痛点（可选——您已知的任何问题）

如果用户已提供架构细节，跳过此提示直接进行下一步。

Step 2: Identify applicable lens

步骤2：确定适用的视角（Lens）

Determine if a specialized WA Lens applies:

SaaS, Serverless, Data Analytics, Machine Learning, IoT, SAP, Games, Financial Services, Healthcare, etc.

If a lens applies, note it and incorporate lens-specific questions in the review.

判断是否适用专门的WA视角：

SaaS、Serverless、数据分析、机器学习、IoT、SAP、游戏、金融服务、医疗保健等

如果适用某一视角，记录下来并在评审中加入该视角的特定问题。

Step 3: Evaluate each pillar

步骤3：评估每个支柱

For each of the six pillars, assess the workload against key best practices:

针对六大支柱中的每一个，对照关键最佳实践评估工作负载：

Operational Excellence

运营卓越

How are changes deployed? (CI/CD, IaC, rollback strategy)
How is the workload monitored? (metrics, alarms, dashboards)
How are operational events handled? (runbooks, on-call, post-incident reviews)
Is there a continuous improvement process?

如何部署变更？（CI/CD、IaC、回滚策略）
如何监控工作负载？（指标、告警、仪表盘）
如何处理运营事件？（运行手册、值班安排、事后复盘）
是否有持续改进流程？

Security

安全性

How are identities and permissions managed? (IAM, least privilege, federation)
How is data protected? (encryption at rest/transit, key management)
How are security events detected and responded to? (GuardDuty, Security Hub, incident response)
Is network segmentation in place? (VPC, WAF, private subnets)

如何管理身份与权限？（IAM、最小权限原则、联邦身份）
如何保护数据？（静态/传输中加密、密钥管理）
如何检测与响应安全事件？（GuardDuty、Security Hub、事件响应）
是否已实施网络分段？（VPC、WAF、私有子网）

Reliability

可靠性

How does the workload handle component failures? (multi-AZ, retries, circuit breakers)
How is capacity managed? (auto-scaling, load testing, quotas)
How are changes managed to avoid outages? (deployment strategies, health checks)
Are backups and DR tested?

工作负载如何处理组件故障？（多可用区、重试机制、断路器）
如何管理容量？（自动扩缩容、负载测试、配额）
如何管理变更以避免中断？（部署策略、健康检查）
备份与灾难恢复（DR）是否经过测试？

Performance Efficiency

性能效率

Are the right resource types and sizes selected? (instance types, storage tiers, Graviton)
How is performance monitored? (latency percentiles, bottleneck identification)
Are there opportunities for caching, CDN, or async processing?
Are database access patterns optimized?

是否选择了合适的资源类型与规格？（实例类型、存储层级、Graviton）
如何监控性能？（延迟百分位数、瓶颈识别）
是否有缓存、CDN或异步处理的优化机会？
数据库访问模式是否已优化？

Cost Optimization

成本优化

Are resources right-sized? (utilization metrics, Savings Plans, Reserved Instances)
Are there idle or orphaned resources?
Is the pricing model appropriate? (on-demand vs spot vs reserved, serverless vs provisioned)
Is cost visibility and allocation in place?

资源是否已合理调整规格？（利用率指标、Savings Plans、预留实例）
是否存在闲置或孤立资源？
定价模型是否合适？（按需 vs 竞价 vs 预留，Serverless vs 预置）
是否具备成本可见性与分配机制？

Sustainability

可持续性

Is resource utilization maximized? (right-sizing, scaling to zero)
Are managed services used where possible?
Is data lifecycle managed? (tiering, expiration, compression)
Are efficient compute options used? (Graviton, serverless)

是否最大化了资源利用率？（合理调整规格、缩容至零）
是否尽可能使用托管服务？
是否管理了数据生命周期？（分层、过期、压缩）
是否使用了高效计算选项？（Graviton、Serverless）

Step 4: Classify findings

步骤4：分类发现结果

For each finding, assign:

Severity: 🔴 High Risk Issue (HRI) | 🟡 Medium Risk Issue (MRI) | 🟢 Improvement opportunity
Pillar: Which pillar it belongs to
Impact: What could go wrong if not addressed
Effort: Low / Medium / High to remediate
AWS Services: Which services to use for remediation

Calibration guidance:

If the architecture already follows best practices (multi-AZ, WAF, canary deployments, encryption, monitoring), most findings should be 🟢 Improvement opportunities — not 🔴 HRI
A mature architecture should score 4–5 on most pillars. Resist over-flagging where good practices exist.
For well-architected systems, focus on advanced improvements: chaos engineering, multi-region DR, cost modeling, game days, sustainability gains
Acknowledge existing strengths explicitly before listing gaps

针对每个发现结果，分配以下属性：

严重程度：🔴 高风险问题（HRI）| 🟡 中风险问题（MRI）| 🟢 改进机会
所属支柱：该结果属于哪个支柱
影响：若不解决可能出现的问题
修复工作量：低 / 中 / 高
AWS服务：修复可使用的AWS服务

校准指南：

如果架构已遵循最佳实践（多可用区、WAF、金丝雀部署、加密、监控），大多数发现结果应为🟢改进机会——而非🔴高风险问题
成熟的架构在大多数支柱上应得4–5分。对于已采用良好实践的部分，避免过度标记
对于架构完善的系统，聚焦进阶改进：混沌工程、多区域灾难恢复、成本建模、故障演练、可持续性提升
在列出差距前，明确认可现有优势

Step 5: Produce the report

步骤5：生成报告

Output a structured report:

markdown

undefined

输出结构化报告：

markdown

undefined

Well-Architected Review: {Workload Name}

Well-Architected 评审：{工作负载名称}

Summary

摘要

Date: {date}
Pillars reviewed: 6/6
Lens applied: {lens or "General"}
Findings: {X} HRI, {Y} MRI, {Z} Improvements

日期：{date}
评审支柱数：6/6
适用视角：{lens或"通用"}
发现结果：{X}个高风险问题，{Y}个中风险问题，{Z}个改进机会

Pillar Scorecard

支柱评分卡

Pillar	Score (1-5)	Key Gap
Operational Excellence	{score}	{gap}
Security	{score}	{gap}
Reliability	{score}	{gap}
Performance Efficiency	{score}	{gap}
Cost Optimization	{score}	{gap}
Sustainability	{score}	{gap}

支柱	评分（1-5）	关键差距
运营卓越	{score}	{gap}
安全性	{score}	{gap}
可靠性	{score}	{gap}
性能效率	{score}	{gap}
成本优化	{score}	{gap}
可持续性	{score}	{gap}

High Risk Issues

高风险问题

{For each HRI: pillar, description, impact, recommendation, effort, AWS services to use}

{每个高风险问题：所属支柱、描述、影响、建议、工作量、可使用的AWS服务}

Medium Risk Issues

中风险问题

{For each MRI: pillar, description, impact, recommendation, effort, AWS services}

{每个中风险问题：所属支柱、描述、影响、建议、工作量、AWS服务}

Improvement Opportunities

改进机会

{For each improvement: pillar, description, benefit, recommendation, AWS services}

{每个改进项：所属支柱、描述、收益、建议、AWS服务}

Prioritized Remediation Plan

优先级修复计划

Quick Wins (< 1 week)

快速见效项（<1周）

{Low-effort, high-impact fixes — configuration changes, enabling services}

{低工作量、高影响的修复——配置变更、启用服务}

Foundation (1-4 weeks)

基础项（1-4周）

{Core architectural improvements — multi-AZ, monitoring, CI/CD}

{核心架构改进——多可用区、监控、CI/CD}

Strategic (1-3 months)

战略项（1-3个月）

{Major changes — re-architecture, multi-region, compliance programs}

{重大变更——重构、多区域部署、合规项目}

Next Steps

下一步行动

{Concrete actions the team should take this week}

undefined

{团队本周应采取的具体行动}

undefined

Step 6: Offer follow-up

步骤6：提供后续支持

After delivering the report, offer:

Would you like me to:

Deep-dive into any specific pillar?

Create an implementation plan for a specific finding?

Generate IaC templates to remediate an issue?

Compare alternative approaches for a recommendation?

Run a focused assessment (security, reliability, cost, etc.)?

交付报告后，提供以下选项：

您是否需要我：

深入分析特定支柱？

为特定发现结果制定实施计划？

生成IaC模板以修复问题？

对比建议方案的替代方法？

开展专项评估（安全、可靠性、成本等）？