n8n-production-readiness

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

n8n Production Readiness

n8n 生产就绪指南

Match workflow hardening to actual risk. Not every workflow needs the same rigor.

"The workflow you build locally... that's maybe 20% of what you actually need. The other 80% is security, validation, logging, and error handling."

根据实际风险匹配工作流加固程度。并非所有工作流都需要相同的严谨性。

"你在本地构建的工作流……可能只占实际所需的20%。剩下的80%是安全、验证、日志记录和错误处理。"

How to Use This Skill (AI Instructions)

如何使用该技能（AI指令）

The tier system adapts to the user. Some users want to specify a tier, others just want to build. Support both modes while always being ready to recommend changes based on what you observe.

分层系统会适配用户需求。有些用户希望指定层级，有些只想要构建工作流。需同时支持这两种模式，并且始终根据观察结果推荐调整方案。

After the Initial Prompt: Ask Once

初始提示后：询问一次

After the user's first message about an n8n workflow, ask once to establish the tier — but make it easy to skip:

"Quick question before we dive in — what level of hardening does this workflow need?

Tier 1 (Internal/Prototype): Quick and simple. Basic error handling, minimal setup.

Tier 2 (Production): Client-facing or business-critical. Full validation, logging, proper error responses.

Tier 3 (Mission-Critical): Payments, compliance, high-volume. Everything plus monitoring, rollback plans, idempotency.

Or just say 'autopilot' and I'll figure it out as we go based on what you're building."

If the user picks a tier: Build to that tier. Respect their choice, but still monitor for signals that suggest they need to adjust (see Tier Change Recommendations below).

If the user says "autopilot" or skips the question: Infer the tier silently from context and adapt as you go. Don't mention tiers again unless recommending a change.

If the user doesn't respond to the tier question: Assume autopilot mode and proceed.

在用户首次提及n8n工作流的消息后，询问一次以确定层级——但要让用户可以轻松跳过：

"在深入之前快速问一句：这个工作流需要什么级别的加固？

Tier 1（内部/原型）：快速简单。仅包含基础错误处理，设置最少。

Tier 2（生产环境）：面向客户或业务关键型。完整的验证、日志记录、规范的错误响应。

Tier 3（关键业务）：支付、合规、高流量场景。包含所有Tier 2内容，外加监控、回滚方案、幂等性处理。

或者直接说**'autopilot'（自动驾驶）**，我会根据你构建的内容自行判断。"

如果用户选择了层级：按照该层级构建。尊重用户的选择，但仍需留意是否有需要调整的信号（见下文的层级变更建议）。

如果用户说“autopilot”或跳过问题：根据上下文静默推断层级并相应调整。除非需要推荐变更，否则不要提及层级。

如果用户未回应层级问题：默认使用autopilot模式继续。

Respecting Explicit Tier Requests

尊重明确的层级请求

If a user explicitly asks for a tier (now or later in the conversation):

Build exactly to that tier's specifications
If they ask "what's in Tier 2?" — explain it
If they say "make this Tier 3" — upgrade accordingly
If they say "just keep it Tier 1" — simplify and remove hardening

Users who understand tiers get full control. Don't second-guess them on every decision — but still flag concerns if you observe serious issues.

如果用户明确要求某个层级（无论是现在还是对话后期）：

严格按照该层级的规范构建
如果用户问“Tier 2包含什么内容？”——进行解释
如果用户说“把这个升级到Tier 3”——相应地增强加固程度
如果用户说“就保持Tier 1”——简化并移除额外的加固内容

了解层级的用户拥有完全控制权。不要对他们的每一个决定指手画脚，但如果发现严重问题仍需提出警示。

Tier Lookup Requests

层级查询请求

If a user asks to see the tiers, explain them, or wants to understand the differences, provide the full breakdown:

Tier 1 (Internal/Prototype)

Basic null checks, simple try-catch

n8n's built-in error handling

~10% extra build time

Example: Slack notifications, personal automations

Tier 2 (Production)

Full input validation at entry point

External logging (Supabase/Postgres)

Proper HTTP status codes (400, 401, 404, 500)

Error notifications to team

Pre-deployment breaking tests

~80% extra build time

Example: Client form handlers, business integrations

Tier 3 (Mission-Critical)

Everything in Tier 2, plus:

Monitoring dashboards, alerting (PagerDuty)

Idempotency keys, rate limiting

Rollback strategy, audit logging

2-3x build time

Example: Payment processing, HIPAA compliance

如果用户要求查看层级说明、解释层级或了解不同层级的差异，提供完整的细分内容：

Tier 1（内部/原型）

基础空值检查、简单的try-catch

使用n8n内置的错误处理

仅增加约10%的构建时间

示例：Slack通知、个人自动化流程

Tier 2（生产环境）

入口点的完整输入验证

外部日志记录（Supabase/Postgres）

规范的HTTP状态码（400、401、404、500）

向团队发送错误通知

部署前的破坏性测试

增加约80%的构建时间

示例：客户表单处理程序、业务集成

Tier 3（关键业务）

包含Tier 2的所有内容，外加：

监控仪表板、告警（PagerDuty）

幂等键、速率限制

回滚策略、审计日志

构建时间为核心逻辑的2-3倍

示例：支付处理、HIPAA合规场景

Autopilot Mode: Silent Tier Assessment

Autopilot模式：静默层级评估

When in autopilot (or if the user skipped tier selection), infer the tier from context clues:

Context Clues	Internal Tier	Your Approach
"quick", "test", "just for me", "internal", "prototype", "playing around"	Tier 1	Build fast, basic error handling only
"client", "customer", "users", "production", "deploy", "business", "launch"	Tier 2	Include validation, logging, status codes
"payment", "Stripe", "checkout", "HIPAA", "compliance", "SLA", "enterprise", "high volume", "can't fail"	Tier 3	Full hardening + discuss monitoring/ops
Ambiguous or no clear signals	Tier 1	Start simple, monitor for escalation triggers

In autopilot mode, don't announce the tier. Just build appropriately and intervene if a tier change is needed.

当处于autopilot模式（或用户跳过了层级选择），根据上下文线索推断层级：

上下文线索	内部层级	应对方法
"快速"、"测试"、"仅用于个人"、"内部"、"原型"、"尝试"	Tier 1	快速构建，仅包含基础错误处理
"客户"、"顾客"、"用户"、"生产"、"部署"、"业务"、"上线"	Tier 2	包含验证、日志记录、状态码处理
"支付"、"Stripe"、"结账"、"HIPAA"、"合规"、"SLA"、"企业"、"高流量"、"不能失败"	Tier 3	完整加固 + 讨论监控/运维方案
模糊或无明确信号	Tier 1	从简单开始，监控是否有升级触发条件

在autopilot模式下，不要宣布层级。只需相应构建，并在需要层级变更时进行干预。

Tier Change Recommendations (Up or Down)

层级变更建议（升级或降级）

Whether the user picked a tier or is in autopilot, monitor the conversation and recommend tier changes in either direction when the task and outcome warrant it.

无论用户选择了层级还是处于autopilot模式，都要监控对话内容，当任务和结果需要时，推荐升级或降级层级。

Recommend Moving UP a Tier

Recommend Moving DOWN a Tier

When NOT to Recommend a Change

何时不推荐变更

Don't recommend moving up if:

User explicitly chose a lower tier and the workflow is working fine
The issue is a simple bug, not a systemic pattern
User is in early exploration/prototyping phase

Don't recommend moving down if:

User explicitly chose a higher tier for good reasons
The workflow handles sensitive data or money
User has mentioned compliance or SLA requirements

Trust user judgment, but flag concerns:

"I know you want to keep this at Tier 1, and that's fine for now — just know that once this goes to production, you'll probably want to add logging at minimum. I can help with that when you're ready."

不要推荐升级的情况：

用户明确选择了较低层级，且工作流运行正常
问题是简单的bug，而非系统性模式
用户处于早期探索/原型阶段

不要推荐降级的情况：

用户出于合理原因明确选择了较高层级
工作流处理敏感数据或资金
用户提及合规或SLA要求

尊重用户判断，但提出警示：

“我知道你想保持Tier 1，这目前没问题——但要注意，一旦部署到生产环境，你可能至少需要添加日志记录。等你准备好的时候我可以帮忙。”

How to Recommend Changes

如何推荐变更

When suggesting tier changes, be direct but not pushy:

State what you're observing — the pattern that triggered the recommendation
Explain the benefit — why this tier's patterns would help
Ask, don't mandate — "Want me to add it?" / "Should we upgrade?"
Respect the answer — if they say no, continue at current tier

If the user explicitly chose a tier and you're recommending a change:

"I know you said Tier 1, but we've been debugging this for a while and I think Tier 2 logging would save us time here. Up to you — want me to add it, or keep it simple?"

If in autopilot mode:

"I'm going to add some validation and logging here — we're hitting the kind of silent failures that these patterns are designed to catch. This'll take a few extra minutes but should make debugging much faster."

建议层级变更时，要直接但不要强硬：

说明观察到的情况——触发推荐的模式
解释好处——该层级的模式能解决什么问题
询问而非命令——“需要我添加吗？” / “要不要升级？”
尊重回答——如果用户说不，继续使用当前层级

如果用户明确选择了层级但你需要推荐变更：

“我知道你说过要Tier 1，但我们已经调试了一段时间，我觉得Tier 2的日志记录能帮我们节省时间。由你决定——需要我添加，还是保持简单？”

如果处于autopilot模式：

“我要在这里添加一些验证和日志记录——我们遇到的正是这类模式要解决的静默失败问题。这会多花几分钟时间，但能让调试变得快得多。”

Lifecycle-Aware Behavior

全生命周期适配行为

Adjust your approach based on where the user is in the workflow lifecycle:

Lifecycle Stage	How to Detect	Your Approach
Exploring/Prototyping	"trying to figure out", "is this possible?", "how would I..."	Tier 1. Fast, minimal. Get it working first.
Building	"build me", "create", "I need a workflow that..."	Ask tier or infer from context.
Testing	"let me test", "trying it out", "it works but..."	Stay at current tier. Focus on the specific issue.
Debugging	"not working", "error", "broken", "help"	Monitor for escalation triggers. Recommend logging if stuck.
Pre-deployment	"ready to deploy", "going live", "production"	Recommend Tier 2 minimum if not already there.
Post-deployment issues	"was working, now broken", "users are reporting", "in production"	Tier 2+. Recommend logging immediately to diagnose.
Scaling	"more users", "growing", "volume increasing"	Discuss Tier 3 patterns as relevant.

根据用户在工作流生命周期中的阶段调整方法：

生命周期阶段	识别方式	应对方法
探索/原型阶段	“想弄清楚”、“这可行吗？”、“我该怎么……”	使用Tier 1。快速构建，最简配置。先让流程跑通。
构建阶段	“帮我构建”、“创建”、“我需要一个能……的工作流”	询问层级或根据上下文推断。
测试阶段	“让我测试一下”、“正在尝试”、“能运行但……”	保持当前层级。专注于具体问题。
调试阶段	“不行”、“错误”、“坏了”、“帮忙”	监控升级触发条件。如果陷入僵局，建议添加日志记录。
部署前阶段	“准备部署”、“即将上线”、“生产环境”	如果还没到Tier 2，建议至少升级到Tier 2。
部署后问题排查	“之前能用现在坏了”、“用户反馈”、“生产环境”	使用Tier 2+。立即建议添加日志记录以排查问题。
扩容阶段	“更多用户”、“增长”、“流量增加”	讨论相关的Tier 3模式。

Tier Definitions (Reference)

层级定义（参考）

Tier 1: Internal / Prototype

Tier 1：内部 / 原型

Context signals: "quick", "simple", "just for me", "internal", "prototype", "test", "playing around"

What to include:

Basic null checks (
```
|| {}
```
,
```
|| ''
```
)
Simple try-catch around risky operations
n8n's built-in error handling (Error Trigger → Slack notification)

What to skip:

External logging database
Comprehensive input validation
Full status code handling
Extensive breaking tests

Example workflows: "New GitHub issue → Slack notification", "Daily weather → personal email", "RSS feed → Discord"

Build time impact: ~10% extra beyond core logic

上下文信号：“快速”、“简单”、“仅用于个人”、“内部”、“原型”、“测试”、“尝试”

包含内容：

基础空值检查（
```
|| {}
```
,
```
|| ''
```
）
针对高风险操作的简单try-catch
使用n8n内置的错误处理（Error Trigger → Slack通知）

跳过内容：

外部日志数据库
全面的输入验证
完整的状态码处理
大量的破坏性测试

示例工作流：“新GitHub issue → Slack通知”、“每日天气 → 个人邮箱”、“RSS订阅 → Discord”

构建时间影响：比核心逻辑仅增加约10%

Tier 2: Production / Client-Facing

Tier 2：生产环境 / 面向客户

Context signals: "client", "customer", "production", "deploy", "users will...", "business", "launch", "going live"

Escalation triggers: Debugging going in circles, silent failures, user mentions deployment, "works sometimes" issues

What to include:

Full entry-point validation (user exists? auth valid? data shaped right?)
Explicit null vs empty string handling
External logging to Supabase/Postgres
Proper HTTP status codes (400, 401, 403, 404, 500)
Error notifications to team (Slack/email)
Pre-deployment breaking tests
Test database before production

What to skip:

Real-time monitoring dashboards
Automated rollback
Rate limiting / idempotency (unless high volume)

Example workflows: "Customer form → CRM + email sequence", "Payment webhook → order fulfillment", "AI chatbot for client website"

Build time impact: ~80% extra beyond core logic (the 80/20 rule)

上下文信号：“客户”、“顾客”、“生产”、“部署”、“用户会……”、“业务”、“上线”、“即将发布”

升级触发条件：排查问题陷入循环、静默失败、用户提及部署、“有时能用”的问题

包含内容：

完整的入口点验证（用户存在？认证有效？数据格式正确？）
明确区分空值、空字符串和未定义
外部日志记录到Supabase/Postgres
规范的HTTP状态码（400、401、403、404、500）
向团队发送错误通知（Slack/邮件）
部署前的破坏性测试
在测试数据库中验证后再部署到生产环境

跳过内容：

实时监控仪表板
自动回滚
速率限制/幂等性（除非流量很高）

示例工作流：“客户表单 → CRM + 邮件序列”、“支付Webhook → 订单履行”、“客户网站AI聊天机器人”

构建时间影响：比核心逻辑增加约80%（80/20法则）

Tier 3: Mission-Critical / High-Volume

Tier 3：关键业务 / 高流量

Context signals: "payment", "Stripe", "checkout", "HIPAA", "compliance", "SLA", "enterprise", "high volume", "can't fail", "audit"

Escalation triggers: Volume/performance questions, "what if X goes down?", financial transactions, compliance mentions

What to include:

Everything from Tier 2, plus:
Real-time monitoring dashboard (Grafana, Datadog)
Automated alerting with escalation (PagerDuty)
Idempotency keys for duplicate request handling
Rate limiting to prevent cascade failures
Request queuing for traffic spikes
Rollback strategy (workflow versioning, feature flags)
Audit logging for compliance
Regular chaos testing (simulate failures)
Documented runbooks for incident response

Example workflows: "Stripe payment → inventory + fulfillment + accounting", "HIPAA-compliant patient data sync", "High-traffic API gateway"

Build time impact: 2-3x the core logic development time

上下文信号：“支付”、“Stripe”、“结账”、“HIPAA”、“合规”、“SLA”、“企业”、“高流量”、“不能失败”、“审计”

升级触发条件：询问规模/性能问题、“如果X宕机了怎么办？”、财务交易、提及合规要求

包含内容：

包含Tier 2的所有内容，外加：
实时监控仪表板（Grafana、Datadog）
带升级机制的自动告警（PagerDuty）
处理重复请求的幂等键
防止级联失败的速率限制
应对流量峰值的请求队列
回滚策略（工作流版本控制、功能开关）
满足合规要求的审计日志
定期混沌测试（模拟失败场景）
记录事件响应的运行手册

示例工作流：“Stripe支付 → 库存 + 履行 + 记账”、“符合HIPAA的患者数据同步”、“高流量API网关”

构建时间影响：核心逻辑开发时间的2-3倍

Tier Escalation Triggers Summary

层级升级触发条件总结

You've outgrown Tier 1 when:

You're debugging the same workflow for the third time
You find yourself saying "it works sometimes"
You can't tell what data the workflow actually received
Someone other than you will use or depend on it
A failure would cause more than minor annoyance

You've outgrown Tier 2 when:

You're processing 10,000+ requests/day
You're handling payments or sensitive PII
You have contractual SLAs
A 1-hour outage would cause significant revenue loss or legal exposure
You're asking "what happens if [external service] goes down?"

Downgrade is okay too:

Built Tier 2 for a prototype, realized it's overkill? Strip it back.
The goal is right-sized investment, not maximum hardening.

当你需要从Tier 1升级时：

你已经第三次调试同一个工作流
你发现自己在说“有时能用”
你无法知道工作流实际接收的数据是什么
除你之外还有其他人会使用或依赖这个工作流
失败会造成的影响不止是轻微的不便

当你需要从Tier 2升级时：

你每天处理10,000+请求
你处理支付或敏感个人身份信息（PII）
你有合同约定的SLA
1小时的停机时间会造成重大收入损失或法律风险
你在问“如果[外部服务]宕机了怎么办？”

降级也是可以的：

为原型构建了Tier 2，后来发现过度设计了？简化回去。
目标是合理的投入，而非最大化加固程度。

The 80/20 Rule by Tier

各层级的80/20法则

Component	Tier 1	Tier 2	Tier 3
Core logic	70%	20%	10%
Validation	10%	20%	15%
Error handling	10%	20%	20%
Logging	5%	20%	20%
Testing	5%	20%	15%
Monitoring/Ops	—	—	20%

The workflow logic is the easy part. The hard part is everything else — but only invest in "everything else" proportional to your risk.

组件	Tier 1	Tier 2	Tier 3
核心逻辑	70%	20%	10%
验证	10%	20%	15%
错误处理	10%	20%	20%
日志记录	5%	20%	20%
测试	5%	20%	15%
监控/运维	—	—	20%

工作流逻辑是简单的部分。难的是其他所有内容——但要根据风险程度合理投入到“其他内容”上。

Pre-Deployment Checklists

部署前检查清单

Tier 1 Checklist

Tier 1检查清单

Basic null/undefined checks on critical fields
Try-catch around external API calls
Error Trigger workflow sends Slack/email on failure
Tested manually with happy path

对关键字段进行基础的空值/未定义检查
对外部API调用使用try-catch
配置Error Trigger工作流，在失败时发送Slack/邮件通知
通过手动测试验证正常流程

Tier 2 Checklist

Tier 2检查清单

Validation: Every input field is validated
Null handling: Explicitly handle
```
null
```
vs empty string vs undefined
Type checking: Verify data types match expectations
Logging: External logging configured at entry, decisions, output, errors
Error responses: Proper HTTP status codes for all failure modes
Error notifications: Team gets alerted on failures (Slack, email)
Empty data test: Workflow handles empty inputs gracefully
Wrong type test: Workflow rejects malformed data
Auth test: Missing/invalid auth returns 401/403
Downstream failure test: External service failures handled
Test database: All tests run against test environment first
Documentation: Workflow purpose and data flow documented

验证：每个输入字段都经过验证
空值处理：明确处理
```
null
```
、空字符串和未定义
类型检查：验证数据类型符合预期
日志记录：在入口点、决策点、输出和错误处配置外部日志记录
错误响应：为所有失败场景返回规范的HTTP状态码
错误通知：团队会收到失败告警（Slack、邮件）
空数据测试：工作流能优雅处理空输入
错误类型测试：工作流会拒绝格式错误的数据
认证测试：缺失/无效认证返回401/403
下游失败测试：处理外部服务失败的情况
测试数据库：所有测试先在测试环境运行
文档：记录工作流的用途和数据流

Tier 3 Checklist

Tier 3检查清单

Summary: Adaptive Tier Management

总结：自适应层级管理

Ask once after initial prompt — let user pick tier or choose autopilot
Respect explicit tier choices — if they ask for a tier, build to it
Support tier lookups — explain tiers when asked
Infer silently in autopilot — don't mention tiers unless recommending a change
Monitor for tier change triggers — both UP (more hardening needed) and DOWN (over-engineered)
Recommend, don't mandate — ask permission, respect the answer
Trust user judgment — but flag serious concerns even if they decline

The goal: Users who know tiers get control. Users who don't get invisible guidance. The AI adapts to the user, the task, and the project — recommending more hardening when complexity demands it, and simplification when it's getting in the way.

初始提示后询问一次——让用户选择层级或autopilot模式
尊重明确的层级选择——如果用户指定层级，按要求构建
支持层级查询——当用户询问时解释层级
Autopilot模式下静默推断——除非需要推荐变更，否则不要提及层级
监控层级变更触发条件——包括升级（需要更多加固）和降级（过度设计）
推荐而非命令——请求许可，尊重用户的回答
尊重用户判断——但即使用户拒绝，也要提出严重问题的警示

目标：了解层级的用户拥有控制权。不了解层级的用户获得隐形指导。AI适配用户、任务和项目——当复杂度增加时推荐更多加固，当过度设计阻碍进展时推荐简化。

n8n-production-readiness

Original

Translation

n8n Production Readiness

n8n 生产就绪指南

How to Use This Skill (AI Instructions)

如何使用该技能（AI指令）

After the Initial Prompt: Ask Once

初始提示后：询问一次

Respecting Explicit Tier Requests

尊重明确的层级请求

Tier Lookup Requests

层级查询请求

Autopilot Mode: Silent Tier Assessment

Autopilot模式：静默层级评估

Tier Change Recommendations (Up or Down)

层级变更建议（升级或降级）

Recommend Moving UP a Tier

推荐升级层级

Tier 1 → Tier 2 triggers:

Tier 1 → Tier 2的触发条件：

Tier 2 → Tier 3 triggers:

Tier 2 → Tier 3的触发条件：

Recommend Moving DOWN a Tier

推荐降级层级

Tier 3 → Tier 2 triggers:

Tier 3 → Tier 2的触发条件：

Tier 2 → Tier 1 triggers:

Tier 2 → Tier 1的触发条件：

When NOT to Recommend a Change

何时不推荐变更

How to Recommend Changes

如何推荐变更

Lifecycle-Aware Behavior

全生命周期适配行为

Tier Definitions (Reference)

层级定义（参考）

Tier 1: Internal / Prototype

Tier 1：内部 / 原型

Tier 2: Production / Client-Facing

Tier 2：生产环境 / 面向客户

Tier 3: Mission-Critical / High-Volume

Tier 3：关键业务 / 高流量

Tier Escalation Triggers Summary

层级升级触发条件总结

The 80/20 Rule by Tier

各层级的80/20法则

Pre-Deployment Checklists

部署前检查清单

Tier 1 Checklist

Tier 1检查清单

Tier 2 Checklist

Tier 2检查清单

Tier 3 Checklist

Tier 3检查清单

Summary: Adaptive Tier Management

总结：自适应层级管理

Related Skills

相关技能