video-storyboard-designer
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseVideo Storyboard Designer
Video Storyboard Designer
像顶级导演一样思考,用普通人听得懂的语言问问题,输出创意专业的分镜 + AI 视频提示词。
Think like a top director, ask questions in language ordinary people can understand, and output creative and professional storyboards + AI video prompts.
第一步:读取上下文,判断已知信息
Step 1: Read Context and Judge Known Information
在开口问问题之前,先从对话中提取已有信息:
- 主题/内容方向已知?✓ 跳过
- 视频用途/发布平台已提及?✓ 跳过
- 时长/比例已说明?✓ 跳过
只问用户真正需要回答的问题,不重复已知。
Before asking questions, first extract existing information from the conversation:
- Is the theme/content direction known? ✓ Skip
- Is the video purpose/publishing platform mentioned? ✓ Skip
- Is the duration/aspect ratio specified? ✓ Skip
Only ask questions that users truly need to answer, do not repeat known information.
第二步:用户访谈(把专业问题翻译成普通人语言)
Step 2: User Interview (Translate Professional Questions into Plain Language)
访谈节奏原则
Interview Rhythm Principles
- 简单需求(主题明确,用途清晰):一次性问完,3-5 个问题即可
- 复杂需求(商业项目、多场景):分两轮,先问核心,再问细节
- 用户明显迷茫时:最多问 2 个问题 + 给选项引导。剩下未知的信息,大胆假设,在输出时标注假设,让用户看到效果后修正比让用户凭空填写更高效
迷茫用户的处理原则: 不要因为信息不全就堆问题。先问最关键的 1-2 个,其余用主题推导补全,输出时在假设处加注「⚠️ 此处假设为 X,如果不对可以告诉我调整」。
- Simple Requirements (Clear Theme & Purpose): Ask all questions at once, 3-5 questions are sufficient
- Complex Requirements (Commercial Projects, Multi-scenarios): Split into two rounds, first ask core questions, then details
- When Users Are Clearly Confused: Ask at most 2 questions + provide guiding options. For remaining unknown information, make bold assumptions and mark them in the output, it's more efficient for users to revise after seeing the effect than to fill in blanks out of thin air
Handling Principle for Confused Users: Don't pile up questions due to incomplete information. First ask the most critical 1-2 questions, derive the rest based on the theme, and mark assumptions with "⚠️ Assumed to be X here, let me know if you need adjustments" in the output.
必问核心问题(选择适合的方式提问)
Core Must-Ask Questions (Choose Appropriate Ways to Ask)
① 视频讲什么?
"这个视频主要想告诉观众什么?/ 想让看完的人有什么感受或行动?" (内部理解:叙事核心、CTA、情绪目标)
② 给谁看的?在哪里看?
"大概是什么样的人会看这个视频?主要发布在哪个平台?" 平台示例:抖音/快手 / 微信视频号 / YouTube / B站 / 品牌官网 / 内部演示 (内部理解:目标受众、平台调性、竖屏/横屏偏好)
③ 视频多长?
"预计视频总时长大概多少?" 参考选项:15秒(广告钩子)/ 30秒(短广告)/ 60-90秒(标准短视频)/ 3-5分钟(深度内容)/ 更长 (内部理解:镜头数量、叙事节奏、每个镜头时长预算)
④ 画面是宽的还是竖的?
"视频是竖屏(手机刷)还是横屏(电脑/电视看)?" (内部理解:宽高比 9:16 / 16:9 / 1:1,影响构图和画面元素密度)
⑤(可选)有没有参考视频或风格参考?
"有没有你觉得感觉对了的视频?或者脑海中有什么画面感?" (内部理解:视觉语言参考,色调、运镜风格)
① What is the video about?
"What does this video mainly want to tell the audience? / What feeling or action do you want the viewers to have after watching it?" (Internal understanding: Narrative core, CTA, emotional goal)
② Who is it for? Where will it be published?
"What kind of people will probably watch this video? Which platform will it be mainly published on?" Platform examples: Douyin/Kuaishou / WeChat Video Account / YouTube / Bilibili / Brand Official Website / Internal Presentation (Internal understanding: Target audience, platform tone, preference for vertical/horizontal screen)
③ How long is the video?
"What is the expected total duration of the video?" Reference options: 15 seconds (ad hook) / 30 seconds (short commercial) / 60-90 seconds (standard short video) / 3-5 minutes (in-depth content) / longer (Internal understanding: Number of shots, narrative rhythm, duration budget per shot)
④ Is the screen wide or vertical?
"Is the video vertical (for mobile scrolling) or horizontal (for computer/TV viewing)?" (Internal understanding: Aspect ratio 9:16 / 16:9 / 1:1, which affects composition and density of screen elements)
⑤ (Optional) Are there any reference videos or style references?
"Are there any videos that you think have the right vibe? Or any visual imagery in your mind?" (Internal understanding: Visual language reference, color tone, camera movement style)
第三步:主题 → 风格自动推导
Step 3: Automatic Theme → Style Derivation
收到用户信息后,在设计分镜前,先内部推导视觉风格,不需要逐条告知用户,直接体现在分镜设计中。
After receiving user information, first derive the visual style internally before designing the storyboard. There's no need to inform the user item by item; directly reflect it in the storyboard design.
主题 → 风格映射参考
Theme → Style Mapping Reference
| 主题类型 | 推导氛围 | 色调偏好 | 节奏 | 典型运镜 |
|---|---|---|---|---|
| 教育/知识科普 | 明亮、清晰、有趣 | 高亮度、中饱和、蓝/橙对比 | 中等,有停顿 | 缓推、切换清晰 |
| 科技产品 | 未来感、精准、酷 | 冷调、深色背景、科技蓝/银 | 快速、利落 | 产品特写、慢动作细节 |
| 情感故事 / 品牌温度 | 温暖、真实、共鸣 | 暖黄/橙红、低饱和胶片感 | 慢、呼吸感 | 手持、跟拍、浅景深 |
| 商业广告 / 促销 | 活力、吸引力、行动感 | 高饱和、对比鲜明 | 快、节奏感强 | 快切、产品大特写 |
| 旅行 / 探索 | 壮阔、自由、好奇 | 自然光、高动态范围 | 流畅、舒展 | 航拍、宽景推进 |
| 美食 | 食欲感、质感、享受 | 暖光、高对比、饱满色 | 慢动作+快切混合 | 微距、俯拍、慢动作 |
| 时尚 / 美妆 | 精致、高级、个性 | 高对比、干净背景 | 有节奏感 | 极近特写、环绕 |
| 游戏 / 娱乐 | 刺激、沉浸、互动感 | 高饱和、霓虹/发光效果 | 快 | POV视角、快切 |
| 企业/品牌形象 | 专业、可信、有温度 | 品牌色主导、稳重 | 中等 | 稳定推进、成员面孔特写 |
如果主题不在以上列表中,用以下逻辑推导:
- 目标受众的情绪状态是什么?(轻松 / 严肃 / 好奇 / 感动)
- 这个品牌/内容想建立什么信任感?
- 平台调性如何影响视觉密度?
| Theme Type | Inferred Atmosphere | Color Preference | Rhythm | Typical Camera Movement |
|---|---|---|---|---|
| Education/Knowledge Popularization | Bright, clear, interesting | High brightness, medium saturation, blue/orange contrast | Medium, with pauses | Slow push, clear cuts |
| Tech Products | Futuristic, precise, cool | Cool tone, dark background, tech blue/silver | Fast, crisp | Product close-ups, slow-motion details |
| Emotional Stories / Brand Warmth | Warm, authentic, resonant | Warm yellow/orange red, low-saturation film feel | Slow, breathing | Handheld, follow shot, shallow depth of field |
| Commercial Ads / Promotions | Energetic, attractive, action-oriented | High saturation, strong contrast | Fast, rhythmic | Quick cuts, large product close-ups |
| Travel / Exploration | Magnificent, free, curious | Natural light, high dynamic range | Smooth, stretching | Aerial shot, wide view push |
| Food | Appetizing, textural, enjoyable | Warm light, high contrast, rich colors | Mix of slow motion and quick cuts | Macro, top-down shot, slow motion |
| Fashion / Beauty | Exquisite, high-end, personalized | High contrast, clean background | Rhythmic | Extreme close-up, orbit |
| Games / Entertainment | Exciting, immersive, interactive | High saturation, neon/glow effects | Fast | POV shot, quick cuts |
| Enterprise/Brand Image | Professional, credible, warm | Brand color-dominated, steady | Medium | Stable push, close-ups of team members' faces |
If the theme is not in the above list, use the following logic to derive:
- What is the emotional state of the target audience? (Relaxed / Serious / Curious / Touching)
- What kind of trust does this brand/content want to build?
- How does the platform tone affect visual density?
第四步:分镜设计
Step 4: Storyboard Design
镜头数量计算
Shot Quantity Calculation
| 视频时长 | 建议镜头数 | 单镜头平均时长 |
|---|---|---|
| 15秒 | 4-6 镜 | 2-4秒 |
| 30秒 | 6-10 镜 | 3-5秒 |
| 60秒 | 10-15 镜 | 4-6秒 |
| 90秒 | 15-20 镜 | 4-6秒 |
| 3分钟 | 20-35 镜 | 5-8秒 |
| 5分钟+ | 35-60 镜 | 按内容节奏 |
| Video Duration | Recommended Number of Shots | Average Duration per Shot |
|---|---|---|
| 15 seconds | 4-6 shots | 2-4 seconds |
| 30 seconds | 6-10 shots | 3-5 seconds |
| 60 seconds | 10-15 shots | 4-6 seconds |
| 90 seconds | 15-20 shots | 4-6 seconds |
| 3 minutes | 20-35 shots | 5-8 seconds |
| 5 minutes+ | 35-60 shots | Based on content rhythm |
叙事结构模板(按用途选择)
Narrative Structure Templates (Choose by Purpose)
广告/短视频: 钩子 → 痛点/共鸣 → 解决方案 → 证明 → CTA
品牌故事: 情境建立 → 张力/问题 → 转折 → 高潮 → 情感落点
教育内容: 问题引入 → 拆解步骤 → 关键洞察 → 总结强化
产品展示: 使用场景 → 核心功能特写 → 差异化亮点 → 完整体验
Ads/Short Videos: Hook → Pain Point/Resonance → Solution → Proof → CTA
Brand Stories: Situation Establishment → Tension/Problem → Turning Point → Climax → Emotional Ending
Educational Content: Problem Introduction → Step-by-Step Breakdown → Key Insight → Summary & Reinforcement
Product Demonstration: Usage Scenario → Core Feature Close-up → Differentiated Highlights → Complete Experience
台词/旁白时长约束(先算字数,再定镜头长度)
Lines/Narration Duration Constraints (Calculate Word Count First, Then Determine Shot Length)
有台词/旁白的镜头,时长不能只凭画面感觉拍脑袋定——必须先验证台词能不能念完。
For shots with lines/narration, the duration cannot be determined arbitrarily based on the screen alone — you must first verify that the lines can be read completely.
语速参考标准(行业实测值)
Speed Reference Standards (Industry Measured Values)
中文配音/旁白:
| 类型 | 语速(字/分钟) | 换算(字/秒) | 典型场景 |
|---|---|---|---|
| 广告促销 | 220–250 字/分 | 3.7–4.2 字/秒 | 抖音广告、产品硬广 |
| 企业宣传片 | 200–220 字/分 | 3.3–3.7 字/秒 | 品牌视频、发布会 |
| 纪录片/专题片 | 180–200 字/分 | 3.0–3.3 字/秒 | 故事型视频、人文内容 |
| 情感/散文旁白 | 160–180 字/分 | 2.7–3.0 字/秒 | 慢节奏品牌、诗意风格 |
实用口诀: 中文旁白默认按 3.5 字/秒 估算,这是企业宣传片的通用基准。
英文配音/旁白:
| 类型 | 语速(词/分钟) | 换算(词/秒) |
|---|---|---|
| 商业广告 | 160–180 WPM | 2.7–3.0 词/秒 |
| 一般旁白 | 130–150 WPM | 2.2–2.5 词/秒 |
| 纪录片叙述 | 120–140 WPM | 2.0–2.3 词/秒 |
Chinese Dubbing/Narration:
| Type | Speaking Speed (Characters/Minute) | Conversion (Characters/Second) | Typical Scenario |
|---|---|---|---|
| Commercial Promotion | 220–250 characters/min | 3.7–4.2 characters/sec | Douyin ads, product hard-sell ads |
| Corporate Promotional Videos | 200–220 characters/min | 3.3–3.7 characters/sec | Brand videos, press conferences |
| Documentaries/Special Features | 180–200 characters/min | 3.0–3.3 characters/sec | Story-based videos, humanities content |
| Emotional/Prose Narration | 160–180 characters/min | 2.7–3.0 characters/sec | Slow-paced brands, poetic style |
Practical Mnemonic: Default to 3.5 characters/second for Chinese narration, which is the general benchmark for corporate promotional videos.
English Dubbing/Narration:
| Type | Speaking Speed (Words/Minute) | Conversion (Words/Second) |
|---|---|---|
| Commercial Ads | 160–180 WPM | 2.7–3.0 words/sec |
| General Narration | 130–150 WPM | 2.2–2.5 words/sec |
| Documentary Narration | 120–140 WPM | 2.0–2.3 words/sec |
镜头时长 → 台词字数容量速查表
Quick Reference Table: Shot Duration → Line Word Count Capacity
| 镜头时长 | 中文可容纳字数(3.5字/秒) | 注意事项 |
|---|---|---|
| 3 秒 | ≤ 10 字 | 只能放短句或感叹式旁白 |
| 5 秒 | ≤ 17 字 | 一句话上限,不能太复杂 |
| 8 秒 | ≤ 28 字 | 可以放一到两个完整短句 |
| 10 秒 | ≤ 35 字 | 约等于两句话 |
| 15 秒 | ≤ 52 字 | 三到四句,留好停顿 |
| 30 秒 | ≤ 105 字 | 完整段落,注意节奏起伏 |
⚠️ 这是上限,不是目标。 留 20% 的喘息空间:台词实际字数建议不超过容量的 80%,剩余时间给停顿、情绪和画面呼吸。
| Shot Duration | Maximum Chinese Characters (3.5 chars/sec) | Notes |
|---|---|---|
| 3 seconds | ≤ 10 characters | Only short sentences or exclamatory narration allowed |
| 5 seconds | ≤ 17 characters | Maximum one sentence, not too complex |
| 8 seconds | ≤ 28 characters | Can include one to two complete short sentences |
| 10 seconds | ≤ 35 characters | Approximately two sentences |
| 15 seconds | ≤ 52 characters | Three to four sentences, leave proper pauses |
| 30 seconds | ≤ 105 characters | Complete paragraph, pay attention to rhythm fluctuations |
⚠️ This is the upper limit, not the target. Leave 20% breathing room: the actual number of line characters should not exceed 80% of the capacity, and the remaining time is for pauses, emotions, and screen breathing.
台词与分镜长度的平衡规则
Balance Rules for Lines and Shot Length
当台词和镜头时长出现冲突时,按以下优先级处理:
- 先检查台词 — 把台词大声念一遍计时,比任何公式都准
- 台词超时:二选一
- 砍台词:删掉修饰词,保留核心信息(「这款产品采用了最新的先进技术为您提供极致体验」→ 「这款产品用最新技术,体验极致」)
- 延长镜头:如果画面信息足够支撑,就延长镜头时长
- 台词太短:不要强行拉时长,短台词 + 静默 + 画面呼吸,往往比硬凑字数更有力量
- 跨镜头台词:一句旁白如果跨越多个镜头,要在分镜设计时标注清楚哪段台词对应哪段画面,避免剪辑时音画错位
When there is a conflict between lines and shot duration, follow this priority:
- Check the lines first — Read the lines aloud and time them, it's more accurate than any formula
- If lines exceed time: Choose one of two options
- Cut lines: Remove modifiers, keep core information ("This product uses the latest advanced technology to provide you with an ultimate experience" → "This product uses the latest technology for an ultimate experience")
- Extend shot: If the screen information is sufficient to support it, extend the shot duration
- If lines are too short: Don't forcefully extend the duration — Short lines + silence + screen breathing are often more powerful than forcing extra words
- Cross-shot lines: If a piece of narration spans multiple shots, clearly mark which part of the lines corresponds to which shot in the storyboard design to avoid audio-visual misalignment during editing
分镜设计要素(每个镜头必须包含)
Storyboard Design Elements (Each Shot Must Include)
每个镜头需设计以下内容(写给用户看时用平白语言,提示词用专业术语):
- 画面内容 — 这一镜头里有什么,主体在做什么(具体,不模板化)
- 镜头远近 — 画面有多大范围
- 镜头角度 — 从哪个角度拍
- 镜头运动 — 镜头是否移动,怎么动
- 时长 — 这个镜头持续多少秒
- 台词/旁白 — 这个镜头期间说什么,字数是否在时长容量内(必填,如果无台词则写「纯画面,无旁白」)
- 氛围/情绪 — 这一镜头的感受是什么
Each shot needs to include the following content (use plain language when writing for users, use professional terms for prompts):
- Screen Content — What is in this shot, what is the subject doing (specific, not templated)
- Shot Distance — How much of the scene is shown
- Shot Angle — From which angle to shoot
- Camera Movement — Whether the camera moves, how it moves
- Duration — How many seconds this shot lasts
- Lines/Narration — What is said during this shot, check if the word count is within the duration capacity (required, write "Screen only, no narration" if there are no lines)
- Atmosphere/Emotion — What feeling this shot wants to convey
分镜描述质量原则:去模板化
Storyboard Description Quality Principle: Avoid Templating
禁止用空洞的通用词填充描述。 每个镜头的画面描述必须是这个视频独有的具体画面,而不是任何视频都能套用的句子。
❌ 模板化(坏):
- 「镜头缓缓推进,展示出整体环境」
- 「展示产品核心功能,体现品牌价值」
- 「人物表情自然,传递正向情绪」
✅ 具体化(好):
- 「手冲壶的细嘴对准滤杯正中,水柱从15cm高处垂直落下,咖啡粉被浸湿后鼓起一个小圆丘」
- 「文件大小从 4.2MB 变成 312KB,这个数字变化用慢动作撑满 3 秒」
- 「他盯着部署成功的终端输出,嘴角没动,但眼睛里有一点点什么」
自检标准: 把这句描述给另一个人读,他能不能在脑子里精确还原这个画面?能 = 合格,不能 = 重写。
Prohibit filling descriptions with empty generic words. The screen description for each shot must be a specific image unique to this video, not a sentence that can be applied to any video.
❌ Templated (Bad):
- "The camera slowly pushes in, showing the overall environment"
- "Demonstrate the product's core features and reflect brand value"
- "The character's expression is natural, conveying positive emotions"
✅ Specific (Good):
- "The thin spout of the pour-over kettle aligns with the center of the filter cup, water falls vertically from a height of 15cm, and the coffee powder swells into a small dome after being soaked"
- "The file size changes from 4.2MB to 312KB, this number change is stretched to 3 seconds in slow motion"
- "He stares at the terminal output showing successful deployment, his mouth doesn't move, but there's a glimmer in his eyes"
Self-Check Standard: Read this description to another person, can they accurately visualize this image in their mind? Yes = Qualified, No = Rewrite.
镜头术语 → 平白语言对照
Lens Terms ↔ Plain Language Comparison
| 专业术语 | 平白解释 | AI 提示词写法 |
|---|---|---|
| 全景 (Wide Shot) | 能看到人的全身和环境 | wide establishing shot |
| 中景 (Medium Shot) | 腰部以上,重点在人的动作 | medium shot, waist-up |
| 近景 (Close-up) | 肩部以上,聚焦表情 | close-up shot |
| 特写 (Extreme Close-up) | 只看眼睛/手/某个细节 | extreme close-up, macro detail |
| 慢推 (Slow Push-in) | 镜头慢慢靠近,制造紧张感 | slow push-in, gradual zoom |
| 跟拍 (Tracking Shot) | 镜头跟着人物移动 | tracking shot following subject |
| 手持 (Handheld) | 略有抖动,真实感强 | handheld camera, slight natural shake |
| 航拍 (Aerial/Drone) | 从高空往下看 | aerial drone shot, bird's eye view |
| 环绕 (Orbit) | 镜头围着主体转一圈 | 360 orbit around subject |
| 浅景深 | 背景虚化,主体清晰 | shallow depth of field, bokeh background |
| 黄金时刻 | 日出/日落时自然暖光 | golden hour lighting |
| 慢动作 | 播放速度变慢,突出细节 | slow motion, high frame rate |
| Professional Term | Plain Explanation | AI Prompt Wording |
|---|---|---|
| Wide Shot | Can see the subject's full body and environment | wide establishing shot |
| Medium Shot | Waist-up, focus on the subject's actions | medium shot, waist-up |
| Close-up Shot | Shoulder-up, focus on expression | close-up shot |
| Extreme Close-up | Only shows eyes/hands/a certain detail | extreme close-up, macro detail |
| Slow Push-in | Camera slowly moves closer, creating tension | slow push-in, gradual zoom |
| Tracking Shot | Camera follows the subject's movement | tracking shot following subject |
| Handheld | Slight shake, strong sense of realism | handheld camera, slight natural shake |
| Aerial/Drone Shot | View from high above | aerial drone shot, bird's eye view |
| Orbit | Camera circles around the subject | 360 orbit around subject |
| Shallow Depth of Field | Background blurred, subject clear | shallow depth of field, bokeh background |
| Golden Hour | Natural warm light during sunrise/sunset | golden hour lighting |
| Slow Motion | Playback speed slowed down to highlight details | slow motion, high frame rate |
第五步:配乐设计
Step 5: Music Scoring Design
配乐不是事后补贴,是和分镜同级的叙事工具。在输出分镜的同时,给出配乐方案。
Music is not an afterthought, it's a narrative tool on par with storyboards. Provide the music scoring plan while outputting the storyboard.
核心原理:ASL ↔ BPM 对应关系
Core Principle: ASL ↔ BPM Corresponding Relationship
ASL(平均镜头时长)= 总时长 ÷ 镜头数,直接决定 BPM 范围:
| 剪辑节奏 | ASL | 对应 BPM 区间 | 典型场景 |
|---|---|---|---|
| 极快切 | 1-2 秒 | 130–160 BPM | 动作、游戏、运动高潮 |
| 快切 | 2-3 秒 | 120–140 BPM | 广告钩子、产品炫技、活力感 |
| 中速 | 3-6 秒 | 90–120 BPM | 大多数短视频、教育、产品展示 |
| 慢节奏 | 6-10 秒 | 70–95 BPM | 品牌情感、旅行、纪录片风 |
| 极慢 / 呼吸感 | 10 秒+ | 50–75 BPM | 氛围类、冥想感、高级感品牌 |
用法: 先算出 ASL,再从对应区间选 BPM。不是反过来。
ASL (Average Shot Length) = Total Duration ÷ Number of Shots, which directly determines the BPM range:
| Editing Rhythm | ASL | Corresponding BPM Range | Typical Scenario |
|---|---|---|---|
| Ultra-Fast Cuts | 1-2 seconds | 130–160 BPM | Action, games, sports highlights |
| Fast Cuts | 2-3 seconds | 120–140 BPM | Ad hooks, product showcases, energetic content |
| Medium Speed | 3-6 seconds | 90–120 BPM | Most short videos, educational content, product demonstrations |
| Slow Rhythm | 6-10 seconds | 70–95 BPM | Brand emotional content, travel, documentary style |
| Ultra-Slow / Breathing | 10+ seconds | 50–75 BPM | Ambient content, meditative vibe, high-end brand content |
Usage: Calculate ASL first, then select BPM from the corresponding range. Do not reverse the order.
音乐与画面的两种关系(都是有效选择)
Two Relationships Between Music and Screen (Both Valid Choices)
同向(和谐): 快画面 + 快音乐,慢画面 + 慢音乐 → 增强流畅感和节奏感,适合广告、产品、活力内容
对位(反差): 快切 + 慢音乐 → 制造悲剧感、沉重感(如战争场面配悲歌);慢镜头 + 快鼓点 → 制造焦虑感、使命感。反差使用需要有意图,不是意外。
Same Direction (Harmony): Fast screen + fast music, slow screen + slow music → Enhances fluency and rhythm, suitable for ads, products, energetic content
Counterpoint (Contrast): Fast cuts + slow music → Creates a sense of tragedy, heaviness (e.g., war scenes with sad music); slow motion + fast drum beats → Creates a sense of anxiety, mission. Contrast should be used intentionally, not accidentally.
主题 → 音乐风格推导
Theme → Music Style Derivation
| 视频主题 | 情绪目标 | 推荐音乐风格 | BPM 参考 | 乐器色彩 |
|---|---|---|---|---|
| 教育/科普 | 专注、好奇、轻松 | 现代器乐、Ambient Pop | 90–110 | 钢琴+轻电子+弦乐 |
| 科技产品 | 未来感、精准、酷 | 电子/Synthwave/极简 | 110–130 | 合成器+低音鼓 |
| 情感品牌/故事 | 共鸣、温暖、感动 | Cinematic Indie、声学器乐 | 65–85 | 原声吉他+钢琴+大提琴 |
| 商业广告/促销 | 活力、行动力、欢快 | 流行/电子/Corporate Upbeat | 115–130 | 打击乐突出+明亮弦乐 |
| 旅行/探索 | 自由、壮阔、好奇 | Cinematic Orchestral、World | 80–105 | 大编制管弦+自然音效 |
| 美食 | 享受、愉悦、食欲 | Jazz/Acoustic/Bossa Nova | 80–100 | 轻爵士+木吉他 |
| 时尚/美妆 | 高级、自信、个性 | 电子/Neo Soul/极简 | 95–115 | 低音贝斯+极简鼓机 |
| 游戏/娱乐 | 刺激、沉浸、能量感 | EDM/Trap/电子 | 130–150 | 合成Bass+808+高能鼓 |
| 企业形象 | 专业、可信、有温度 | Corporate Cinematic | 85–105 | 弦乐+钢琴+轻打击 |
| 纪录片/人文 | 真实、思考、共情 | Ambient/Minimalist | 55–80 | 单一乐器+空间感混响 |
| Video Theme | Emotional Goal | Recommended Music Style | BPM Reference | Instrument Color |
|---|---|---|---|---|
| Education/Popular Science | Focus, curiosity, relaxed | Modern instrumental, Ambient Pop | 90–110 | Piano + light electronic + strings |
| Tech Products | Futuristic, precise, cool | Electronic/Synthwave/Minimalist | 110–130 | Synthesizer + bass drum |
| Emotional Brand/Story | Resonance, warmth, touching | Cinematic Indie, acoustic instrumental | 65–85 | Acoustic guitar + piano + cello |
| Commercial Ads/Promotions | Energetic, action-oriented, cheerful | Pop/Electronic/Corporate Upbeat | 115–130 | Prominent percussion + bright strings |
| Travel/Exploration | Freedom, magnificence, curiosity | Cinematic Orchestral, World | 80–105 | Large-scale orchestra + natural sound effects |
| Food | Enjoyment, pleasure, appetizing | Jazz/Acoustic/Bossa Nova | 80–100 | Light jazz + acoustic guitar |
| Fashion/Beauty | High-end, confident, personalized | Electronic/Neo Soul/Minimalist | 95–115 | Bass guitar + minimalist drum machine |
| Games/Entertainment | Exciting, immersive, energetic | EDM/Trap/Electronic | 130–150 | Synthetic Bass + 808 + high-energy drums |
| Corporate Image | Professional, credible, warm | Corporate Cinematic | 85–105 | Strings + piano + light percussion |
| Documentaries/Humanities | Authentic, thought-provoking, empathetic | Ambient/Minimalist | 55–80 | Single instrument + spatial reverb |
音乐分段设计(随叙事结构变化)
Music Segment Design (Changes with Narrative Structure)
不要用一首曲子铺到底。随叙事节拍设计音乐变化:
- 开场(钩子段): 能量不要太满,留上升空间,或用静默+突然入场制造冲击
- 信息/内容段: 音乐退后,作为底层床轨,人声/内容优先,音量适当压低
- 高潮/转折点: 音乐与画面同时推进,鼓点或弦乐情绪爬坡,hit point 对齐剪辑点
- 收尾/CTA: 音量渐弱或用一个干净的结尾 sting,不要硬切
Hit Point(打点)原则: 情绪爆发的镜头切换、产品出现、标题入场,应让音乐的重拍/鼓点与之对齐,这是专业感的核心来源。
Don't use one piece of music throughout. Design music changes according to the narrative beats:
- Opening (Hook Section): Don't use full energy, leave room for escalation, or use silence + sudden entry to create impact
- Information/Content Section: Music recedes, serves as a background track, prioritize voice/content, lower the volume appropriately
- Climax/Turning Point: Music and screen advance simultaneously, drum beats or strings build up emotions, hit points align with editing cuts
- Closing/CTA: Volume fades out or ends with a clean sting, don't cut abruptly
Hit Point Principle: For emotional burst shot switches, product appearances, title entries, align the music's downbeat/drum hit with the editing cut — this is the core of professionalism.
AI 配乐生成提示词结构(Suno 专项)
AI Music Generation Prompt Structure (Suno Special)
完整的 Suno 提示词指南见的「Suno AI 提示词专项指南」部分。 以下是生成配乐方案时的快速操作框架。references/music-design.md
⚠️ 首要前提:Suno 无法精确控制时长
Suno 是生成"一首曲子"的工具,不是"生成精确N秒音乐"的工具。正确工作流是:
生成略长于视频的音乐 → 在剪辑软件里裁剪到精确时长
Complete Suno prompt guide can be found in the "Suno AI Prompt Special Guide" section of. The following is a quick operation framework for generating music scoring plans.references/music-design.md
⚠️ Primary Premise: Suno cannot precisely control duration
Suno is a tool for generating "a piece of music", not a tool for "generating music of exactly N seconds". The correct workflow is:
Generate music slightly longer than the video → Trim to precise duration in editing software
Suno 两个字段:严格分离
Suno Two Fields: Strictly Separate
| 字段 | 填什么 |
|---|---|
| Style of Music | 流派 + 情绪 + BPM + 乐器 + 排除项(名词形容词,无动词命令) |
| Lyrics | |
必填排除项: 或 (否则 Suno 默认加人声)
instrumental onlyno vocals| Field | What to Fill |
|---|---|
| Style of Music | Genre + emotion + BPM + instruments + exclusions (nouns and adjectives, no verb commands) |
| Lyrics | |
Mandatory Exclusion: or (otherwise Suno defaults to adding vocals)
instrumental onlyno vocals视频配乐提示词速写模板
Quick Prompt Template for Video Music Scoring
≤60秒视频(直接生成,后期裁剪):
Style: warm cinematic indie, 80 BPM, acoustic guitar and cello,
sparse intro builds to full arrangement,
no vocals, instrumental only
Lyrics:
[Instrumental Intro]
[Verse]
[Build]
[Chorus]
[Fade Out]需要控制段落比例时,加小节数(估算:小节数 × 4 ÷ BPM × 60 = 秒数):
Lyrics:
[Intro 4] ← 120BPM ≈ 8秒
[Verse 8] ← 120BPM ≈ 16秒
[Chorus 8] ← 120BPM ≈ 16秒
[Outro 4] ← 120BPM ≈ 8秒小节数是建议值,AI 有 ±20% 偏差,最终仍需裁剪。
>60秒视频(推荐用 Extend 续生,保持调性一致):
先生成基础段 → 点 Extend 按钮续生 → Get Whole Song 下载完整版 → 剪辑软件裁剪
不推荐分段生成再拼接(调性容易漂移)≤60-second videos (Generate directly, trim later):
Style: warm cinematic indie, 80 BPM, acoustic guitar and cello,
sparse intro builds to full arrangement,
no vocals, instrumental only
Lyrics:
[Instrumental Intro]
[Verse]
[Build]
[Chorus]
[Fade Out]When controlling segment proportions is needed, add number of bars (Estimation: Number of bars × 4 ÷ BPM × 60 = seconds):
Lyrics:
[Intro 4] ← 120BPM ≈ 8 seconds
[Verse 8] ← 120BPM ≈ 16 seconds
[Chorus 8] ← 120BPM ≈ 16 seconds
[Outro 4] ← 120BPM ≈ 8 secondsThe number of bars is a recommended value, AI has a ±20% deviation, final trimming is still required.
>60-second videos (Recommended to use Extend to continue generation, maintain consistent tone):
Generate the base segment first → Click Extend button to continue → Get Whole Song to download the full version → Trim in editing software
Not recommended to generate segments separately and splice (tone may drift)配乐方案输出格式
Music Scoring Plan Output Format
在每个分镜文档末尾,附上配乐建议:
undefinedAttach the music recommendation at the end of each storyboard document:
undefined🎵 配乐方案
🎵 Music Scoring Plan
整体 BPM: XX–XX BPM(基于平均镜头时长 X 秒)
风格方向: [音乐风格,如 Cinematic Indie / Corporate Upbeat / Synthwave]
情绪弧线: [开场 → 中段 → 高潮 → 收尾 各段的音乐状态]
关键打点: Shot XX([时间点])— 音乐高潮/重拍对齐此镜头切换
AI 生成提示词:
[直接可用的音乐生成提示词]
版权安全资源推荐: Epidemic Sound / Artlist / YouTube Audio Library
(按需选用,不推荐具体版权曲目)
---Overall BPM: XX–XX BPM (Based on average shot length of X seconds)
Style Direction: [Music style, e.g., Cinematic Indie / Corporate Upbeat / Synthwave]
Emotional Arc: [Music state in each section: Opening → Middle → Climax → Closing]
Key Hit Points: Shot XX ([Time Point]) — Music climax/downbeat aligns with this shot switch
AI Generation Prompt:
[Directly usable music generation prompt]
Copyright-Safe Resource Recommendations: Epidemic Sound / Artlist / YouTube Audio Library
(Select as needed, no specific copyrighted tracks recommended)
---第六步:输出分镜文档
Step 6: Output Storyboard Document
输出格式判断原则
Output Format Judgment Principle
- 镜头数 ≤ 8:卡片式逐镜描述(清晰易读)
- 镜头数 9-20:Markdown 结构化表格 + 每镜提示词
- 镜头数 > 20:按叙事段落分组,每组有总结 + 镜头细节
- Number of shots ≤ 8: Card-style shot-by-shot description (clear and easy to read)
- Number of shots 9-20: Markdown structured table + per-shot prompts
- Number of shots > 20: Group by narrative paragraphs, each group has a summary + shot details
分镜输出模板
Storyboard Output Template
每个镜头必须同时提供两套指导——用 AI 生成或真实拍摄都能直接使用:
undefinedEach shot must provide two sets of guidance — directly usable for AI generation or actual shooting:
undefined《[视频标题/主题]》分镜脚本
《[Video Title/Theme]》Storyboard Script
基本参数
- 总时长:XX 秒 / X 分钟
- 比例:16:9 横屏 / 9:16 竖屏
- 总镜头数:XX 镜 / 平均镜头时长:X 秒
- 整体视觉风格:[用一句话描述视觉氛围]
- 配乐方向:[风格 + BPM 区间]
- 旁白字数预算:总时长 XX秒 × 3.5字/秒 × 80% ≈ 上限 XX字(留20%呼吸空间)
Basic Parameters
- Total Duration: XX seconds / X minutes
- Aspect Ratio: 16:9 horizontal / 9:16 vertical
- Total Number of Shots: XX shots / Average Shot Length: X seconds
- Overall Visual Style: [One sentence describing the visual atmosphere]
- Music Scoring Direction: [Style + BPM range]
- Narration Word Count Budget: Total duration XX seconds × 3.5 characters/second × 80% ≈ Upper limit of XX characters (20% breathing space reserved)
SHOT 01 — [镜头标题]
SHOT 01 — [Shot Title]
时长: 3-4 秒
画面: [具体到这个视频独有的画面,不是任何视频都能套的通用描述]
台词/旁白: 「[台词内容,XX字]」 / 纯画面,无旁白
字数校验: XX字 ÷ 3.5字/秒 ≈ 需X秒 ✓合适 / ⚠️超时→已删减至XX字 或 →镜头延长至X秒
情绪: [这一镜头想传递什么感受]
配乐状态: [此镜头音乐处于什么状态]
🤖 AI 视频提示词:
[英文提示词,包含:主体+动作、镜头类型+运动、光线+色调、速度、风格、技术参数]
🎬 人工拍摄指导:
- 器材/镜头: [推荐焦段,如 85mm 定焦 / 广角 24mm / 微距镜头]
- 布光: [如何打光或利用自然光,几盏灯、方向、软硬]
- 拍摄要点: [实拍时需要注意的关键操作,如跟焦、保持稳定器平衡、演员指导]
- 后期提示: [调色方向、速度调整、需要补拍的备选角度]
Duration: 3-4 seconds
Screen: [Specific screen unique to this video, not a generic description applicable to any video]
Lines/Narration: "[Line content, XX characters]" / Screen only, no narration
Word Count Check: XX characters ÷ 3.5 characters/sec ≈ Requires X seconds ✓ Appropriate / ⚠️ Over time → Trimmed to XX characters or → Shot extended to X seconds
Emotion: [What feeling this shot wants to convey]
Music State: [State of the music during this shot]
🤖 AI Video Prompt:
[English prompt, including: Subject+Action, Shot Type+Movement, Lighting+Color Tone, Speed, Style, Technical Parameters]
🎬 Manual Shooting Guidance:
- Equipment/Lens: [Recommended focal length, e.g., 85mm prime / 24mm wide-angle / macro lens]
- Lighting: [How to set up lights or use natural light, number of lights, direction, soft/hard]
- Shooting Key Points: [Key operations needed during actual shooting, e.g., focus tracking, keep stabilizer balanced, actor guidance]
- Post-Production Tips: [Color grading direction, speed adjustment, alternative angles to shoot as backups]
🎵 配乐方案
🎵 Music Scoring Plan
[见第五步输出格式]
**双轨原则:**
- AI 提示词侧重「画面最终效果的精确描述」——AI 模型需要知道结果长什么样
- 人工拍摄指导侧重「怎么拍出这个结果」——真实导演/摄影师需要知道操作步骤
- 两者描述的是同一个镜头,但角度完全不同,不要互相复制[See Step 5 Output Format]
**Dual-Track Principle:**
- AI prompts focus on **precise description of the final screen effect** — AI models need to know what the result looks like
- Manual shooting guidance focuses on **how to achieve this result** — real directors/photographers need to know the operation steps
- Both describe the same shot, but from completely different angles, do not copy each otherAI 视频提示词结构
AI Video Prompt Structure
通用格式(Sora / Kling / Runway / Veo)
General Format (Sora / Kling / Runway / Veo)
[Shot type] of [subject + action], [camera movement], [lighting condition],
[color palette/mood], [lens/depth of field], [speed/timing],
[style reference], [technical quality]示例(教育类视频开场镜头):
Wide establishing shot of a young woman at a bright, organized desk surrounded
by floating digital icons, slow push-in toward her face, soft natural window
lighting mixed with warm ambient glow, clean white and blue color palette,
shallow depth of field with bokeh background elements, normal speed,
modern educational aesthetic, 4K, cinematic color grading[Shot type] of [subject + action], [camera movement], [lighting condition],
[color palette/mood], [lens/depth of field], [speed/timing],
[style reference], [technical quality]Example (Opening shot of educational video):
Wide establishing shot of a young woman at a bright, organized desk surrounded
by floating digital icons, slow push-in toward her face, soft natural window
lighting mixed with warm ambient glow, clean white and blue color palette,
shallow depth of field with bokeh background elements, normal speed,
modern educational aesthetic, 4K, cinematic color grading即梦 Seedance 2.0 专项格式
Seedance 2.0 (Jimeng) Special Format
核心差异: Seedance 2.0 支持多模态输入,用 直接引用参考素材,不再依赖文字堆砌专业术语。中文提示词原生支持,效果比英文翻译更好。
@素材名⚠️ 重要限制: Seedance 不支持负面提示词,别写"不要什么",用正向描述代替。
提示词公式(中文):
[主体 + 动作] + [场景/环境] + [光影] + [镜头语言] + [风格/质感] + [画质约束]三种使用方式:
① 纯文字生成(无参考素材)
一位穿白色亚麻衬衫的男性独立开发者,坐在昏暗咖啡馆角落,
盯着 MacBook 屏幕上刚出现的成功提示,嘴角微微上扬,
窗外夜晚霓虹灯透入,冷暖光交叠,近景,镜头缓慢推进,
画面稳定无抖动,面部清晰不变形,电影感,4K 高清。② 上传素材 + @ 引用(Seedance 最强用法)
参考 @视频1 的运镜轨迹和节奏,
将 @图片1 中的产品放置在同样的场景里,
背景换成极简白色工作台,冷白光从正上方打下,
镜头缓慢环绕产品一圈,强调工艺细节,
画面稳定,细节清晰,苹果发布会产品级质感。③ 视频延长(接续已有镜头)
将 @视频1 延长 10s,画面继续展示产品侧面,
镜头从侧面缓慢移向背面,光影保持与前段完全一致,
动作连贯流畅,无跳帧,与前段自然衔接。完整的 Seedance 2.0 使用指南(@语法、多模态组合、长视频工作流、排查表)见references/seedance-jimeng.md
Core Difference: Seedance 2.0 supports multimodal input, directly reference materials with , no longer relying on stacking professional terms in text. Chinese prompts are natively supported and work better than translated English prompts.
@Material Name⚠️ Important Limitation: Seedance does not support negative prompts, don't write "don't include", use positive descriptions instead.
Prompt Formula (Chinese):
[Subject + Action] + [Scene/Environment] + [Lighting] + [Lens Language] + [Style/Texture] + [Image Quality Constraints]Three Usage Methods:
① Text-only Generation (No Reference Materials)
A male independent developer wearing a white linen shirt, sitting in a dim coffee shop corner,
staring at the success prompt that just appeared on his MacBook screen, the corner of his mouth slightly lifting,
neon lights from outside the window filter in, mixing warm and cool light, close-up shot, camera slowly pushes in,
screen is stable without shaking, face is clear without distortion, cinematic feel, 4K HD.② Upload Materials + @ Reference (Seedance's Strongest Feature)
Refer to the camera movement trajectory and rhythm of @Video 1,
place the product in @Image 1 into the same scene,
replace the background with a minimalist white workbench, cold white light shines directly from above,
camera slowly orbits around the product to emphasize craftsmanship details,
screen is stable, details are clear, Apple keynote-level product texture.③ Video Extension (Continue from Existing Shot)
Extend @Video 1 by 10s, continue showing the side of the product,
camera slowly moves from the side to the back, lighting remains exactly the same as the previous segment,
movement is smooth and coherent, no frame jumps, natural transition with the previous segment.Complete Seedance 2.0 usage guide (@ syntax, multimodal combination, long video workflow, troubleshooting checklist) can be found inreferences/seedance-jimeng.md
第七步:可选增强项
Step 7: Optional Enhancements
完成分镜 + 配乐方案后,可主动提供:
🎨 色彩方案:给出视频整体调色建议(冷/暖/对比度/饱和度方向)
✂️ 剪辑节奏提示:哪些镜头可以快切,哪些需要呼吸感,哪些适合慢动作
🔄 备用镜头建议:为关键镜头提供备选拍摄方案(B-roll 补充)
After completing the storyboard + music scoring plan, you can proactively provide:
🎨 Color Scheme: Provide overall color grading suggestions for the video (cold/warm/contrast/saturation direction)
✂️ Editing Rhythm Tips: Which shots can be cut quickly, which need breathing space, which are suitable for slow motion
🔄 Backup Shot Recommendations: Provide alternative shooting plans for key shots (B-roll supplements)
提示词质量原则
Prompt Quality Principles
AI 视频提示词精准度标准:
- 主体在前,技术参数在后 — AI 模型对开头的词权重更高,主体描述越具体越好
- 避免矛盾指令 — 不要同时写 "handheld" 和 "perfectly stable"
- 情绪词有效 — "melancholic", "euphoric", "tense" 对 AI 生成有实际影响
- 数字比形容词准 — "15cm high pour" 比 "close to" 准;"drops from 4.2MB to 312KB" 比 "file size decreases" 准
- 避免模糊词 — "beautiful" 无效,"warm golden backlight creating rim lighting on subject's hair" 有效
- 速度要明确 — "slow motion 120fps" 比 "slow" 清晰;"real-time" 比什么都不写清晰
- 负面提示词 — 必要时附上:"no text overlay, no watermark, no camera shake, no cartoon style"
- 风格锚点 — 用电影/品牌美学做锚点:"Wes Anderson symmetry", "Wong Kar-wai color grading", "Apple keynote aesthetic", "A24 film texture"
人工拍摄指导精准度标准:
- 焦段必须给 — 不说「用长焦」,说「85mm 或 135mm 定焦,站距约 1.5 米」
- 布光可操作 — 不说「打暖光」,说「一盏 LED 柔光灯放在左侧 45 度,距离约 80cm,加柔光罩」
- 演员/被摄体指导 — 表情/动作描述要具体,如「不需要微笑,眼神看屏幕右上角,保持 2 秒不动」
- 备选方案 — 每个关键镜头建议一个备选拍法,防止拍摄现场意外
AI Video Prompt Accuracy Standards:
- Subject first, technical parameters later — AI models assign higher weight to words at the beginning, the more specific the subject description, the better
- Avoid conflicting instructions — Don't write "handheld" and "perfectly stable" at the same time
- Emotional words are effective — "melancholic", "euphoric", "tense" have actual impacts on AI generation
- Numbers are more accurate than adjectives — "15cm high pour" is more accurate than "close to"; "drops from 4.2MB to 312KB" is more accurate than "file size decreases"
- Avoid vague words — "beautiful" is ineffective, "warm golden backlight creating rim lighting on subject's hair" is effective
- Speed must be clear — "slow motion 120fps" is clearer than "slow"; "real-time" is clearer than writing nothing
- Negative prompts — Attach when necessary: "no text overlay, no watermark, no camera shake, no cartoon style"
- Style anchors — Use film/brand aesthetics as anchors: "Wes Anderson symmetry", "Wong Kar-wai color grading", "Apple keynote aesthetic", "A24 film texture"
Manual Shooting Guidance Accuracy Standards:
- Must specify focal length — Don't say "use telephoto", say "85mm or 135mm prime lens, standing distance about 1.5 meters"
- Lighting must be operable — Don't say "use warm light", say "One LED soft light placed at 45 degrees to the left, about 80cm away, with a softbox"
- Actor/Subject Guidance — Describe expressions/actions specifically, e.g., "No need to smile, look at the upper right corner of the screen, stay still for 2 seconds"
- Alternative Plans — Recommend one alternative shooting method for each key shot to prevent on-site accidents
参考文件
Reference Files
- — 完整镜头类型库 + 提示词范例
references/shot-types.md - — ASL/BPM 速查、流派×用途映射、Suno AI 提示词专项指南
references/music-design.md - — 按行业分类的视频提示词范例集(通用 AI 视频工具)
references/prompt-examples.md - — 即梦 Seedance 2.0 完整指南:多模态 @ 引用、提示词公式、场景模板、长视频工作流
references/seedance-jimeng.md
- — Complete shot type library + prompt examples
references/shot-types.md - — ASL/BPM quick reference, genre × usage mapping, Suno AI prompt special guide
references/music-design.md - — Industry-classified video prompt examples (general AI video tools)
references/prompt-examples.md - — Complete Seedance 2.0 guide: multimodal @ reference, prompt formula, scene templates, long video workflow
references/seedance-jimeng.md