video-storyboard-designer

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Video Storyboard Designer

Video Storyboard Designer

像顶级导演一样思考,用普通人听得懂的语言问问题,输出创意专业的分镜 + AI 视频提示词。

Think like a top director, ask questions in language ordinary people can understand, and output creative and professional storyboards + AI video prompts.

第一步:读取上下文,判断已知信息

Step 1: Read Context and Judge Known Information

在开口问问题之前,先从对话中提取已有信息:
  • 主题/内容方向已知?✓ 跳过
  • 视频用途/发布平台已提及?✓ 跳过
  • 时长/比例已说明?✓ 跳过
只问用户真正需要回答的问题,不重复已知。

Before asking questions, first extract existing information from the conversation:
  • Is the theme/content direction known? ✓ Skip
  • Is the video purpose/publishing platform mentioned? ✓ Skip
  • Is the duration/aspect ratio specified? ✓ Skip
Only ask questions that users truly need to answer, do not repeat known information.

第二步:用户访谈(把专业问题翻译成普通人语言)

Step 2: User Interview (Translate Professional Questions into Plain Language)

访谈节奏原则

Interview Rhythm Principles

  • 简单需求(主题明确,用途清晰):一次性问完,3-5 个问题即可
  • 复杂需求(商业项目、多场景):分两轮,先问核心,再问细节
  • 用户明显迷茫时:最多问 2 个问题 + 给选项引导。剩下未知的信息,大胆假设,在输出时标注假设,让用户看到效果后修正比让用户凭空填写更高效
迷茫用户的处理原则: 不要因为信息不全就堆问题。先问最关键的 1-2 个,其余用主题推导补全,输出时在假设处加注「⚠️ 此处假设为 X,如果不对可以告诉我调整」。
  • Simple Requirements (Clear Theme & Purpose): Ask all questions at once, 3-5 questions are sufficient
  • Complex Requirements (Commercial Projects, Multi-scenarios): Split into two rounds, first ask core questions, then details
  • When Users Are Clearly Confused: Ask at most 2 questions + provide guiding options. For remaining unknown information, make bold assumptions and mark them in the output, it's more efficient for users to revise after seeing the effect than to fill in blanks out of thin air
Handling Principle for Confused Users: Don't pile up questions due to incomplete information. First ask the most critical 1-2 questions, derive the rest based on the theme, and mark assumptions with "⚠️ Assumed to be X here, let me know if you need adjustments" in the output.

必问核心问题(选择适合的方式提问)

Core Must-Ask Questions (Choose Appropriate Ways to Ask)

① 视频讲什么?
"这个视频主要想告诉观众什么?/ 想让看完的人有什么感受或行动?" (内部理解:叙事核心、CTA、情绪目标)
② 给谁看的?在哪里看?
"大概是什么样的人会看这个视频?主要发布在哪个平台?" 平台示例:抖音/快手 / 微信视频号 / YouTube / B站 / 品牌官网 / 内部演示 (内部理解:目标受众、平台调性、竖屏/横屏偏好)
③ 视频多长?
"预计视频总时长大概多少?" 参考选项:15秒(广告钩子)/ 30秒(短广告)/ 60-90秒(标准短视频)/ 3-5分钟(深度内容)/ 更长 (内部理解:镜头数量、叙事节奏、每个镜头时长预算)
④ 画面是宽的还是竖的?
"视频是竖屏(手机刷)还是横屏(电脑/电视看)?" (内部理解:宽高比 9:16 / 16:9 / 1:1,影响构图和画面元素密度)
⑤(可选)有没有参考视频或风格参考?
"有没有你觉得感觉对了的视频?或者脑海中有什么画面感?" (内部理解:视觉语言参考,色调、运镜风格)

① What is the video about?
"What does this video mainly want to tell the audience? / What feeling or action do you want the viewers to have after watching it?" (Internal understanding: Narrative core, CTA, emotional goal)
② Who is it for? Where will it be published?
"What kind of people will probably watch this video? Which platform will it be mainly published on?" Platform examples: Douyin/Kuaishou / WeChat Video Account / YouTube / Bilibili / Brand Official Website / Internal Presentation (Internal understanding: Target audience, platform tone, preference for vertical/horizontal screen)
③ How long is the video?
"What is the expected total duration of the video?" Reference options: 15 seconds (ad hook) / 30 seconds (short commercial) / 60-90 seconds (standard short video) / 3-5 minutes (in-depth content) / longer (Internal understanding: Number of shots, narrative rhythm, duration budget per shot)
④ Is the screen wide or vertical?
"Is the video vertical (for mobile scrolling) or horizontal (for computer/TV viewing)?" (Internal understanding: Aspect ratio 9:16 / 16:9 / 1:1, which affects composition and density of screen elements)
⑤ (Optional) Are there any reference videos or style references?
"Are there any videos that you think have the right vibe? Or any visual imagery in your mind?" (Internal understanding: Visual language reference, color tone, camera movement style)

第三步:主题 → 风格自动推导

Step 3: Automatic Theme → Style Derivation

收到用户信息后,在设计分镜前,先内部推导视觉风格,不需要逐条告知用户,直接体现在分镜设计中。
After receiving user information, first derive the visual style internally before designing the storyboard. There's no need to inform the user item by item; directly reflect it in the storyboard design.

主题 → 风格映射参考

Theme → Style Mapping Reference

主题类型推导氛围色调偏好节奏典型运镜
教育/知识科普明亮、清晰、有趣高亮度、中饱和、蓝/橙对比中等,有停顿缓推、切换清晰
科技产品未来感、精准、酷冷调、深色背景、科技蓝/银快速、利落产品特写、慢动作细节
情感故事 / 品牌温度温暖、真实、共鸣暖黄/橙红、低饱和胶片感慢、呼吸感手持、跟拍、浅景深
商业广告 / 促销活力、吸引力、行动感高饱和、对比鲜明快、节奏感强快切、产品大特写
旅行 / 探索壮阔、自由、好奇自然光、高动态范围流畅、舒展航拍、宽景推进
美食食欲感、质感、享受暖光、高对比、饱满色慢动作+快切混合微距、俯拍、慢动作
时尚 / 美妆精致、高级、个性高对比、干净背景有节奏感极近特写、环绕
游戏 / 娱乐刺激、沉浸、互动感高饱和、霓虹/发光效果POV视角、快切
企业/品牌形象专业、可信、有温度品牌色主导、稳重中等稳定推进、成员面孔特写
如果主题不在以上列表中,用以下逻辑推导:
  1. 目标受众的情绪状态是什么?(轻松 / 严肃 / 好奇 / 感动)
  2. 这个品牌/内容想建立什么信任感?
  3. 平台调性如何影响视觉密度?

Theme TypeInferred AtmosphereColor PreferenceRhythmTypical Camera Movement
Education/Knowledge PopularizationBright, clear, interestingHigh brightness, medium saturation, blue/orange contrastMedium, with pausesSlow push, clear cuts
Tech ProductsFuturistic, precise, coolCool tone, dark background, tech blue/silverFast, crispProduct close-ups, slow-motion details
Emotional Stories / Brand WarmthWarm, authentic, resonantWarm yellow/orange red, low-saturation film feelSlow, breathingHandheld, follow shot, shallow depth of field
Commercial Ads / PromotionsEnergetic, attractive, action-orientedHigh saturation, strong contrastFast, rhythmicQuick cuts, large product close-ups
Travel / ExplorationMagnificent, free, curiousNatural light, high dynamic rangeSmooth, stretchingAerial shot, wide view push
FoodAppetizing, textural, enjoyableWarm light, high contrast, rich colorsMix of slow motion and quick cutsMacro, top-down shot, slow motion
Fashion / BeautyExquisite, high-end, personalizedHigh contrast, clean backgroundRhythmicExtreme close-up, orbit
Games / EntertainmentExciting, immersive, interactiveHigh saturation, neon/glow effectsFastPOV shot, quick cuts
Enterprise/Brand ImageProfessional, credible, warmBrand color-dominated, steadyMediumStable push, close-ups of team members' faces
If the theme is not in the above list, use the following logic to derive:
  1. What is the emotional state of the target audience? (Relaxed / Serious / Curious / Touching)
  2. What kind of trust does this brand/content want to build?
  3. How does the platform tone affect visual density?

第四步:分镜设计

Step 4: Storyboard Design

镜头数量计算

Shot Quantity Calculation

视频时长建议镜头数单镜头平均时长
15秒4-6 镜2-4秒
30秒6-10 镜3-5秒
60秒10-15 镜4-6秒
90秒15-20 镜4-6秒
3分钟20-35 镜5-8秒
5分钟+35-60 镜按内容节奏
Video DurationRecommended Number of ShotsAverage Duration per Shot
15 seconds4-6 shots2-4 seconds
30 seconds6-10 shots3-5 seconds
60 seconds10-15 shots4-6 seconds
90 seconds15-20 shots4-6 seconds
3 minutes20-35 shots5-8 seconds
5 minutes+35-60 shotsBased on content rhythm

叙事结构模板(按用途选择)

Narrative Structure Templates (Choose by Purpose)

广告/短视频: 钩子 → 痛点/共鸣 → 解决方案 → 证明 → CTA 品牌故事: 情境建立 → 张力/问题 → 转折 → 高潮 → 情感落点 教育内容: 问题引入 → 拆解步骤 → 关键洞察 → 总结强化 产品展示: 使用场景 → 核心功能特写 → 差异化亮点 → 完整体验
Ads/Short Videos: Hook → Pain Point/Resonance → Solution → Proof → CTA Brand Stories: Situation Establishment → Tension/Problem → Turning Point → Climax → Emotional Ending Educational Content: Problem Introduction → Step-by-Step Breakdown → Key Insight → Summary & Reinforcement Product Demonstration: Usage Scenario → Core Feature Close-up → Differentiated Highlights → Complete Experience

台词/旁白时长约束(先算字数,再定镜头长度)

Lines/Narration Duration Constraints (Calculate Word Count First, Then Determine Shot Length)

有台词/旁白的镜头,时长不能只凭画面感觉拍脑袋定——必须先验证台词能不能念完。
For shots with lines/narration, the duration cannot be determined arbitrarily based on the screen alone — you must first verify that the lines can be read completely.

语速参考标准(行业实测值)

Speed Reference Standards (Industry Measured Values)

中文配音/旁白:
类型语速(字/分钟)换算(字/秒)典型场景
广告促销220–250 字/分3.7–4.2 字/秒抖音广告、产品硬广
企业宣传片200–220 字/分3.3–3.7 字/秒品牌视频、发布会
纪录片/专题片180–200 字/分3.0–3.3 字/秒故事型视频、人文内容
情感/散文旁白160–180 字/分2.7–3.0 字/秒慢节奏品牌、诗意风格
实用口诀: 中文旁白默认按 3.5 字/秒 估算,这是企业宣传片的通用基准。
英文配音/旁白:
类型语速(词/分钟)换算(词/秒)
商业广告160–180 WPM2.7–3.0 词/秒
一般旁白130–150 WPM2.2–2.5 词/秒
纪录片叙述120–140 WPM2.0–2.3 词/秒
Chinese Dubbing/Narration:
TypeSpeaking Speed (Characters/Minute)Conversion (Characters/Second)Typical Scenario
Commercial Promotion220–250 characters/min3.7–4.2 characters/secDouyin ads, product hard-sell ads
Corporate Promotional Videos200–220 characters/min3.3–3.7 characters/secBrand videos, press conferences
Documentaries/Special Features180–200 characters/min3.0–3.3 characters/secStory-based videos, humanities content
Emotional/Prose Narration160–180 characters/min2.7–3.0 characters/secSlow-paced brands, poetic style
Practical Mnemonic: Default to 3.5 characters/second for Chinese narration, which is the general benchmark for corporate promotional videos.
English Dubbing/Narration:
TypeSpeaking Speed (Words/Minute)Conversion (Words/Second)
Commercial Ads160–180 WPM2.7–3.0 words/sec
General Narration130–150 WPM2.2–2.5 words/sec
Documentary Narration120–140 WPM2.0–2.3 words/sec

镜头时长 → 台词字数容量速查表

Quick Reference Table: Shot Duration → Line Word Count Capacity

镜头时长中文可容纳字数(3.5字/秒)注意事项
3 秒≤ 10 字只能放短句或感叹式旁白
5 秒≤ 17 字一句话上限,不能太复杂
8 秒≤ 28 字可以放一到两个完整短句
10 秒≤ 35 字约等于两句话
15 秒≤ 52 字三到四句,留好停顿
30 秒≤ 105 字完整段落,注意节奏起伏
⚠️ 这是上限,不是目标。 留 20% 的喘息空间:台词实际字数建议不超过容量的 80%,剩余时间给停顿、情绪和画面呼吸。
Shot DurationMaximum Chinese Characters (3.5 chars/sec)Notes
3 seconds≤ 10 charactersOnly short sentences or exclamatory narration allowed
5 seconds≤ 17 charactersMaximum one sentence, not too complex
8 seconds≤ 28 charactersCan include one to two complete short sentences
10 seconds≤ 35 charactersApproximately two sentences
15 seconds≤ 52 charactersThree to four sentences, leave proper pauses
30 seconds≤ 105 charactersComplete paragraph, pay attention to rhythm fluctuations
⚠️ This is the upper limit, not the target. Leave 20% breathing room: the actual number of line characters should not exceed 80% of the capacity, and the remaining time is for pauses, emotions, and screen breathing.

台词与分镜长度的平衡规则

Balance Rules for Lines and Shot Length

当台词和镜头时长出现冲突时,按以下优先级处理:
  1. 先检查台词 — 把台词大声念一遍计时,比任何公式都准
  2. 台词超时:二选一
    • 砍台词:删掉修饰词,保留核心信息(「这款产品采用了最新的先进技术为您提供极致体验」→ 「这款产品用最新技术,体验极致」)
    • 延长镜头:如果画面信息足够支撑,就延长镜头时长
  3. 台词太短:不要强行拉时长,短台词 + 静默 + 画面呼吸,往往比硬凑字数更有力量
  4. 跨镜头台词:一句旁白如果跨越多个镜头,要在分镜设计时标注清楚哪段台词对应哪段画面,避免剪辑时音画错位
When there is a conflict between lines and shot duration, follow this priority:
  1. Check the lines first — Read the lines aloud and time them, it's more accurate than any formula
  2. If lines exceed time: Choose one of two options
    • Cut lines: Remove modifiers, keep core information ("This product uses the latest advanced technology to provide you with an ultimate experience" → "This product uses the latest technology for an ultimate experience")
    • Extend shot: If the screen information is sufficient to support it, extend the shot duration
  3. If lines are too short: Don't forcefully extend the duration — Short lines + silence + screen breathing are often more powerful than forcing extra words
  4. Cross-shot lines: If a piece of narration spans multiple shots, clearly mark which part of the lines corresponds to which shot in the storyboard design to avoid audio-visual misalignment during editing

分镜设计要素(每个镜头必须包含)

Storyboard Design Elements (Each Shot Must Include)

每个镜头需设计以下内容(写给用户看时用平白语言,提示词用专业术语):
  1. 画面内容 — 这一镜头里有什么,主体在做什么(具体,不模板化)
  2. 镜头远近 — 画面有多大范围
  3. 镜头角度 — 从哪个角度拍
  4. 镜头运动 — 镜头是否移动,怎么动
  5. 时长 — 这个镜头持续多少秒
  6. 台词/旁白 — 这个镜头期间说什么,字数是否在时长容量内(必填,如果无台词则写「纯画面,无旁白」)
  7. 氛围/情绪 — 这一镜头的感受是什么
Each shot needs to include the following content (use plain language when writing for users, use professional terms for prompts):
  1. Screen Content — What is in this shot, what is the subject doing (specific, not templated)
  2. Shot Distance — How much of the scene is shown
  3. Shot Angle — From which angle to shoot
  4. Camera Movement — Whether the camera moves, how it moves
  5. Duration — How many seconds this shot lasts
  6. Lines/Narration — What is said during this shot, check if the word count is within the duration capacity (required, write "Screen only, no narration" if there are no lines)
  7. Atmosphere/Emotion — What feeling this shot wants to convey

分镜描述质量原则:去模板化

Storyboard Description Quality Principle: Avoid Templating

禁止用空洞的通用词填充描述。 每个镜头的画面描述必须是这个视频独有的具体画面,而不是任何视频都能套用的句子。
❌ 模板化(坏):
  • 「镜头缓缓推进,展示出整体环境」
  • 「展示产品核心功能,体现品牌价值」
  • 「人物表情自然,传递正向情绪」
✅ 具体化(好):
  • 「手冲壶的细嘴对准滤杯正中,水柱从15cm高处垂直落下,咖啡粉被浸湿后鼓起一个小圆丘」
  • 「文件大小从 4.2MB 变成 312KB,这个数字变化用慢动作撑满 3 秒」
  • 「他盯着部署成功的终端输出,嘴角没动,但眼睛里有一点点什么」
自检标准: 把这句描述给另一个人读,他能不能在脑子里精确还原这个画面?能 = 合格,不能 = 重写。
Prohibit filling descriptions with empty generic words. The screen description for each shot must be a specific image unique to this video, not a sentence that can be applied to any video.
❌ Templated (Bad):
  • "The camera slowly pushes in, showing the overall environment"
  • "Demonstrate the product's core features and reflect brand value"
  • "The character's expression is natural, conveying positive emotions"
✅ Specific (Good):
  • "The thin spout of the pour-over kettle aligns with the center of the filter cup, water falls vertically from a height of 15cm, and the coffee powder swells into a small dome after being soaked"
  • "The file size changes from 4.2MB to 312KB, this number change is stretched to 3 seconds in slow motion"
  • "He stares at the terminal output showing successful deployment, his mouth doesn't move, but there's a glimmer in his eyes"
Self-Check Standard: Read this description to another person, can they accurately visualize this image in their mind? Yes = Qualified, No = Rewrite.

镜头术语 → 平白语言对照

Lens Terms ↔ Plain Language Comparison

专业术语平白解释AI 提示词写法
全景 (Wide Shot)能看到人的全身和环境wide establishing shot
中景 (Medium Shot)腰部以上,重点在人的动作medium shot, waist-up
近景 (Close-up)肩部以上,聚焦表情close-up shot
特写 (Extreme Close-up)只看眼睛/手/某个细节extreme close-up, macro detail
慢推 (Slow Push-in)镜头慢慢靠近,制造紧张感slow push-in, gradual zoom
跟拍 (Tracking Shot)镜头跟着人物移动tracking shot following subject
手持 (Handheld)略有抖动,真实感强handheld camera, slight natural shake
航拍 (Aerial/Drone)从高空往下看aerial drone shot, bird's eye view
环绕 (Orbit)镜头围着主体转一圈360 orbit around subject
浅景深背景虚化,主体清晰shallow depth of field, bokeh background
黄金时刻日出/日落时自然暖光golden hour lighting
慢动作播放速度变慢,突出细节slow motion, high frame rate

Professional TermPlain ExplanationAI Prompt Wording
Wide ShotCan see the subject's full body and environmentwide establishing shot
Medium ShotWaist-up, focus on the subject's actionsmedium shot, waist-up
Close-up ShotShoulder-up, focus on expressionclose-up shot
Extreme Close-upOnly shows eyes/hands/a certain detailextreme close-up, macro detail
Slow Push-inCamera slowly moves closer, creating tensionslow push-in, gradual zoom
Tracking ShotCamera follows the subject's movementtracking shot following subject
HandheldSlight shake, strong sense of realismhandheld camera, slight natural shake
Aerial/Drone ShotView from high aboveaerial drone shot, bird's eye view
OrbitCamera circles around the subject360 orbit around subject
Shallow Depth of FieldBackground blurred, subject clearshallow depth of field, bokeh background
Golden HourNatural warm light during sunrise/sunsetgolden hour lighting
Slow MotionPlayback speed slowed down to highlight detailsslow motion, high frame rate

第五步:配乐设计

Step 5: Music Scoring Design

配乐不是事后补贴,是和分镜同级的叙事工具。在输出分镜的同时,给出配乐方案。
Music is not an afterthought, it's a narrative tool on par with storyboards. Provide the music scoring plan while outputting the storyboard.

核心原理:ASL ↔ BPM 对应关系

Core Principle: ASL ↔ BPM Corresponding Relationship

ASL(平均镜头时长)= 总时长 ÷ 镜头数,直接决定 BPM 范围:
剪辑节奏ASL对应 BPM 区间典型场景
极快切1-2 秒130–160 BPM动作、游戏、运动高潮
快切2-3 秒120–140 BPM广告钩子、产品炫技、活力感
中速3-6 秒90–120 BPM大多数短视频、教育、产品展示
慢节奏6-10 秒70–95 BPM品牌情感、旅行、纪录片风
极慢 / 呼吸感10 秒+50–75 BPM氛围类、冥想感、高级感品牌
用法: 先算出 ASL,再从对应区间选 BPM。不是反过来。
ASL (Average Shot Length) = Total Duration ÷ Number of Shots, which directly determines the BPM range:
Editing RhythmASLCorresponding BPM RangeTypical Scenario
Ultra-Fast Cuts1-2 seconds130–160 BPMAction, games, sports highlights
Fast Cuts2-3 seconds120–140 BPMAd hooks, product showcases, energetic content
Medium Speed3-6 seconds90–120 BPMMost short videos, educational content, product demonstrations
Slow Rhythm6-10 seconds70–95 BPMBrand emotional content, travel, documentary style
Ultra-Slow / Breathing10+ seconds50–75 BPMAmbient content, meditative vibe, high-end brand content
Usage: Calculate ASL first, then select BPM from the corresponding range. Do not reverse the order.

音乐与画面的两种关系(都是有效选择)

Two Relationships Between Music and Screen (Both Valid Choices)

同向(和谐): 快画面 + 快音乐,慢画面 + 慢音乐 → 增强流畅感和节奏感,适合广告、产品、活力内容
对位(反差): 快切 + 慢音乐 → 制造悲剧感、沉重感(如战争场面配悲歌);慢镜头 + 快鼓点 → 制造焦虑感、使命感。反差使用需要有意图,不是意外。
Same Direction (Harmony): Fast screen + fast music, slow screen + slow music → Enhances fluency and rhythm, suitable for ads, products, energetic content
Counterpoint (Contrast): Fast cuts + slow music → Creates a sense of tragedy, heaviness (e.g., war scenes with sad music); slow motion + fast drum beats → Creates a sense of anxiety, mission. Contrast should be used intentionally, not accidentally.

主题 → 音乐风格推导

Theme → Music Style Derivation

视频主题情绪目标推荐音乐风格BPM 参考乐器色彩
教育/科普专注、好奇、轻松现代器乐、Ambient Pop90–110钢琴+轻电子+弦乐
科技产品未来感、精准、酷电子/Synthwave/极简110–130合成器+低音鼓
情感品牌/故事共鸣、温暖、感动Cinematic Indie、声学器乐65–85原声吉他+钢琴+大提琴
商业广告/促销活力、行动力、欢快流行/电子/Corporate Upbeat115–130打击乐突出+明亮弦乐
旅行/探索自由、壮阔、好奇Cinematic Orchestral、World80–105大编制管弦+自然音效
美食享受、愉悦、食欲Jazz/Acoustic/Bossa Nova80–100轻爵士+木吉他
时尚/美妆高级、自信、个性电子/Neo Soul/极简95–115低音贝斯+极简鼓机
游戏/娱乐刺激、沉浸、能量感EDM/Trap/电子130–150合成Bass+808+高能鼓
企业形象专业、可信、有温度Corporate Cinematic85–105弦乐+钢琴+轻打击
纪录片/人文真实、思考、共情Ambient/Minimalist55–80单一乐器+空间感混响
Video ThemeEmotional GoalRecommended Music StyleBPM ReferenceInstrument Color
Education/Popular ScienceFocus, curiosity, relaxedModern instrumental, Ambient Pop90–110Piano + light electronic + strings
Tech ProductsFuturistic, precise, coolElectronic/Synthwave/Minimalist110–130Synthesizer + bass drum
Emotional Brand/StoryResonance, warmth, touchingCinematic Indie, acoustic instrumental65–85Acoustic guitar + piano + cello
Commercial Ads/PromotionsEnergetic, action-oriented, cheerfulPop/Electronic/Corporate Upbeat115–130Prominent percussion + bright strings
Travel/ExplorationFreedom, magnificence, curiosityCinematic Orchestral, World80–105Large-scale orchestra + natural sound effects
FoodEnjoyment, pleasure, appetizingJazz/Acoustic/Bossa Nova80–100Light jazz + acoustic guitar
Fashion/BeautyHigh-end, confident, personalizedElectronic/Neo Soul/Minimalist95–115Bass guitar + minimalist drum machine
Games/EntertainmentExciting, immersive, energeticEDM/Trap/Electronic130–150Synthetic Bass + 808 + high-energy drums
Corporate ImageProfessional, credible, warmCorporate Cinematic85–105Strings + piano + light percussion
Documentaries/HumanitiesAuthentic, thought-provoking, empatheticAmbient/Minimalist55–80Single instrument + spatial reverb

音乐分段设计(随叙事结构变化)

Music Segment Design (Changes with Narrative Structure)

不要用一首曲子铺到底。随叙事节拍设计音乐变化:
  • 开场(钩子段): 能量不要太满,留上升空间,或用静默+突然入场制造冲击
  • 信息/内容段: 音乐退后,作为底层床轨,人声/内容优先,音量适当压低
  • 高潮/转折点: 音乐与画面同时推进,鼓点或弦乐情绪爬坡,hit point 对齐剪辑点
  • 收尾/CTA: 音量渐弱或用一个干净的结尾 sting,不要硬切
Hit Point(打点)原则: 情绪爆发的镜头切换、产品出现、标题入场,应让音乐的重拍/鼓点与之对齐,这是专业感的核心来源。
Don't use one piece of music throughout. Design music changes according to the narrative beats:
  • Opening (Hook Section): Don't use full energy, leave room for escalation, or use silence + sudden entry to create impact
  • Information/Content Section: Music recedes, serves as a background track, prioritize voice/content, lower the volume appropriately
  • Climax/Turning Point: Music and screen advance simultaneously, drum beats or strings build up emotions, hit points align with editing cuts
  • Closing/CTA: Volume fades out or ends with a clean sting, don't cut abruptly
Hit Point Principle: For emotional burst shot switches, product appearances, title entries, align the music's downbeat/drum hit with the editing cut — this is the core of professionalism.

AI 配乐生成提示词结构(Suno 专项)

AI Music Generation Prompt Structure (Suno Special)

完整的 Suno 提示词指南见
references/music-design.md
的「Suno AI 提示词专项指南」部分。 以下是生成配乐方案时的快速操作框架。
⚠️ 首要前提:Suno 无法精确控制时长 Suno 是生成"一首曲子"的工具,不是"生成精确N秒音乐"的工具。正确工作流是: 生成略长于视频的音乐 → 在剪辑软件里裁剪到精确时长

Complete Suno prompt guide can be found in the "Suno AI Prompt Special Guide" section of
references/music-design.md
. The following is a quick operation framework for generating music scoring plans.
⚠️ Primary Premise: Suno cannot precisely control duration Suno is a tool for generating "a piece of music", not a tool for "generating music of exactly N seconds". The correct workflow is: Generate music slightly longer than the video → Trim to precise duration in editing software

Suno 两个字段:严格分离

Suno Two Fields: Strictly Separate

字段填什么
Style of Music流派 + 情绪 + BPM + 乐器 + 排除项(名词形容词,无动词命令)
Lyrics
[结构标记]
+ 可选小节数(如
[Verse 8]
)+ 歌词(无人声时留结构标记即可)
必填排除项:
instrumental only
no vocals
(否则 Suno 默认加人声)

FieldWhat to Fill
Style of MusicGenre + emotion + BPM + instruments + exclusions (nouns and adjectives, no verb commands)
Lyrics
[Structure Markers]
+ optional number of bars (e.g.,
[Verse 8]
) + lyrics (leave only structure markers if no vocals)
Mandatory Exclusion:
instrumental only
or
no vocals
(otherwise Suno defaults to adding vocals)

视频配乐提示词速写模板

Quick Prompt Template for Video Music Scoring

≤60秒视频(直接生成,后期裁剪):
Style: warm cinematic indie, 80 BPM, acoustic guitar and cello,
       sparse intro builds to full arrangement, 
       no vocals, instrumental only

Lyrics:
[Instrumental Intro]
[Verse]
[Build]
[Chorus]
[Fade Out]
需要控制段落比例时,加小节数(估算:小节数 × 4 ÷ BPM × 60 = 秒数):
Lyrics:
[Intro 4]        ← 120BPM ≈ 8秒
[Verse 8]        ← 120BPM ≈ 16秒
[Chorus 8]       ← 120BPM ≈ 16秒
[Outro 4]        ← 120BPM ≈ 8秒
小节数是建议值,AI 有 ±20% 偏差,最终仍需裁剪。
>60秒视频(推荐用 Extend 续生,保持调性一致):
先生成基础段 → 点 Extend 按钮续生 → Get Whole Song 下载完整版 → 剪辑软件裁剪
不推荐分段生成再拼接(调性容易漂移)
≤60-second videos (Generate directly, trim later):
Style: warm cinematic indie, 80 BPM, acoustic guitar and cello,
       sparse intro builds to full arrangement, 
       no vocals, instrumental only

Lyrics:
[Instrumental Intro]
[Verse]
[Build]
[Chorus]
[Fade Out]
When controlling segment proportions is needed, add number of bars (Estimation: Number of bars × 4 ÷ BPM × 60 = seconds):
Lyrics:
[Intro 4]        ← 120BPM ≈ 8 seconds
[Verse 8]        ← 120BPM ≈ 16 seconds
[Chorus 8]       ← 120BPM ≈ 16 seconds
[Outro 4]        ← 120BPM ≈ 8 seconds
The number of bars is a recommended value, AI has a ±20% deviation, final trimming is still required.
>60-second videos (Recommended to use Extend to continue generation, maintain consistent tone):
Generate the base segment first → Click Extend button to continue → Get Whole Song to download the full version → Trim in editing software
Not recommended to generate segments separately and splice (tone may drift)

配乐方案输出格式

Music Scoring Plan Output Format

在每个分镜文档末尾,附上配乐建议:
undefined
Attach the music recommendation at the end of each storyboard document:
undefined

🎵 配乐方案

🎵 Music Scoring Plan

整体 BPM: XX–XX BPM(基于平均镜头时长 X 秒) 风格方向: [音乐风格,如 Cinematic Indie / Corporate Upbeat / Synthwave] 情绪弧线: [开场 → 中段 → 高潮 → 收尾 各段的音乐状态] 关键打点: Shot XX([时间点])— 音乐高潮/重拍对齐此镜头切换
AI 生成提示词: [直接可用的音乐生成提示词]
版权安全资源推荐: Epidemic Sound / Artlist / YouTube Audio Library (按需选用,不推荐具体版权曲目)

---
Overall BPM: XX–XX BPM (Based on average shot length of X seconds) Style Direction: [Music style, e.g., Cinematic Indie / Corporate Upbeat / Synthwave] Emotional Arc: [Music state in each section: Opening → Middle → Climax → Closing] Key Hit Points: Shot XX ([Time Point]) — Music climax/downbeat aligns with this shot switch
AI Generation Prompt: [Directly usable music generation prompt]
Copyright-Safe Resource Recommendations: Epidemic Sound / Artlist / YouTube Audio Library (Select as needed, no specific copyrighted tracks recommended)

---

第六步:输出分镜文档

Step 6: Output Storyboard Document

输出格式判断原则

Output Format Judgment Principle

  • 镜头数 ≤ 8:卡片式逐镜描述(清晰易读)
  • 镜头数 9-20:Markdown 结构化表格 + 每镜提示词
  • 镜头数 > 20:按叙事段落分组,每组有总结 + 镜头细节
  • Number of shots ≤ 8: Card-style shot-by-shot description (clear and easy to read)
  • Number of shots 9-20: Markdown structured table + per-shot prompts
  • Number of shots > 20: Group by narrative paragraphs, each group has a summary + shot details

分镜输出模板

Storyboard Output Template

每个镜头必须同时提供两套指导——用 AI 生成或真实拍摄都能直接使用:
undefined
Each shot must provide two sets of guidance — directly usable for AI generation or actual shooting:
undefined

《[视频标题/主题]》分镜脚本

《[Video Title/Theme]》Storyboard Script

基本参数
  • 总时长:XX 秒 / X 分钟
  • 比例:16:9 横屏 / 9:16 竖屏
  • 总镜头数:XX 镜 / 平均镜头时长:X 秒
  • 整体视觉风格:[用一句话描述视觉氛围]
  • 配乐方向:[风格 + BPM 区间]
  • 旁白字数预算:总时长 XX秒 × 3.5字/秒 × 80% ≈ 上限 XX字(留20%呼吸空间)

Basic Parameters
  • Total Duration: XX seconds / X minutes
  • Aspect Ratio: 16:9 horizontal / 9:16 vertical
  • Total Number of Shots: XX shots / Average Shot Length: X seconds
  • Overall Visual Style: [One sentence describing the visual atmosphere]
  • Music Scoring Direction: [Style + BPM range]
  • Narration Word Count Budget: Total duration XX seconds × 3.5 characters/second × 80% ≈ Upper limit of XX characters (20% breathing space reserved)

SHOT 01 — [镜头标题]

SHOT 01 — [Shot Title]

时长: 3-4 秒 画面: [具体到这个视频独有的画面,不是任何视频都能套的通用描述] 台词/旁白: 「[台词内容,XX字]」 / 纯画面,无旁白 字数校验: XX字 ÷ 3.5字/秒 ≈ 需X秒 ✓合适 / ⚠️超时→已删减至XX字 或 →镜头延长至X秒 情绪: [这一镜头想传递什么感受] 配乐状态: [此镜头音乐处于什么状态]
🤖 AI 视频提示词: [英文提示词,包含:主体+动作、镜头类型+运动、光线+色调、速度、风格、技术参数]
🎬 人工拍摄指导:
  • 器材/镜头: [推荐焦段,如 85mm 定焦 / 广角 24mm / 微距镜头]
  • 布光: [如何打光或利用自然光,几盏灯、方向、软硬]
  • 拍摄要点: [实拍时需要注意的关键操作,如跟焦、保持稳定器平衡、演员指导]
  • 后期提示: [调色方向、速度调整、需要补拍的备选角度]

Duration: 3-4 seconds Screen: [Specific screen unique to this video, not a generic description applicable to any video] Lines/Narration: "[Line content, XX characters]" / Screen only, no narration Word Count Check: XX characters ÷ 3.5 characters/sec ≈ Requires X seconds ✓ Appropriate / ⚠️ Over time → Trimmed to XX characters or → Shot extended to X seconds Emotion: [What feeling this shot wants to convey] Music State: [State of the music during this shot]
🤖 AI Video Prompt: [English prompt, including: Subject+Action, Shot Type+Movement, Lighting+Color Tone, Speed, Style, Technical Parameters]
🎬 Manual Shooting Guidance:
  • Equipment/Lens: [Recommended focal length, e.g., 85mm prime / 24mm wide-angle / macro lens]
  • Lighting: [How to set up lights or use natural light, number of lights, direction, soft/hard]
  • Shooting Key Points: [Key operations needed during actual shooting, e.g., focus tracking, keep stabilizer balanced, actor guidance]
  • Post-Production Tips: [Color grading direction, speed adjustment, alternative angles to shoot as backups]

🎵 配乐方案

🎵 Music Scoring Plan

[见第五步输出格式]

**双轨原则:**
- AI 提示词侧重「画面最终效果的精确描述」——AI 模型需要知道结果长什么样
- 人工拍摄指导侧重「怎么拍出这个结果」——真实导演/摄影师需要知道操作步骤
- 两者描述的是同一个镜头,但角度完全不同,不要互相复制
[See Step 5 Output Format]

**Dual-Track Principle:**
- AI prompts focus on **precise description of the final screen effect** — AI models need to know what the result looks like
- Manual shooting guidance focuses on **how to achieve this result** — real directors/photographers need to know the operation steps
- Both describe the same shot, but from completely different angles, do not copy each other

AI 视频提示词结构

AI Video Prompt Structure

通用格式(Sora / Kling / Runway / Veo)

General Format (Sora / Kling / Runway / Veo)

[Shot type] of [subject + action], [camera movement], [lighting condition], 
[color palette/mood], [lens/depth of field], [speed/timing], 
[style reference], [technical quality]
示例(教育类视频开场镜头):
Wide establishing shot of a young woman at a bright, organized desk surrounded 
by floating digital icons, slow push-in toward her face, soft natural window 
lighting mixed with warm ambient glow, clean white and blue color palette, 
shallow depth of field with bokeh background elements, normal speed, 
modern educational aesthetic, 4K, cinematic color grading

[Shot type] of [subject + action], [camera movement], [lighting condition], 
[color palette/mood], [lens/depth of field], [speed/timing], 
[style reference], [technical quality]
Example (Opening shot of educational video):
Wide establishing shot of a young woman at a bright, organized desk surrounded 
by floating digital icons, slow push-in toward her face, soft natural window 
lighting mixed with warm ambient glow, clean white and blue color palette, 
shallow depth of field with bokeh background elements, normal speed, 
modern educational aesthetic, 4K, cinematic color grading

即梦 Seedance 2.0 专项格式

Seedance 2.0 (Jimeng) Special Format

核心差异: Seedance 2.0 支持多模态输入,用
@素材名
直接引用参考素材,不再依赖文字堆砌专业术语。中文提示词原生支持,效果比英文翻译更好。
⚠️ 重要限制: Seedance 不支持负面提示词,别写"不要什么",用正向描述代替。
提示词公式(中文):
[主体 + 动作] + [场景/环境] + [光影] + [镜头语言] + [风格/质感] + [画质约束]
三种使用方式:
① 纯文字生成(无参考素材)
一位穿白色亚麻衬衫的男性独立开发者,坐在昏暗咖啡馆角落,
盯着 MacBook 屏幕上刚出现的成功提示,嘴角微微上扬,
窗外夜晚霓虹灯透入,冷暖光交叠,近景,镜头缓慢推进,
画面稳定无抖动,面部清晰不变形,电影感,4K 高清。
② 上传素材 + @ 引用(Seedance 最强用法)
参考 @视频1 的运镜轨迹和节奏,
将 @图片1 中的产品放置在同样的场景里,
背景换成极简白色工作台,冷白光从正上方打下,
镜头缓慢环绕产品一圈,强调工艺细节,
画面稳定,细节清晰,苹果发布会产品级质感。
③ 视频延长(接续已有镜头)
将 @视频1 延长 10s,画面继续展示产品侧面,
镜头从侧面缓慢移向背面,光影保持与前段完全一致,
动作连贯流畅,无跳帧,与前段自然衔接。
完整的 Seedance 2.0 使用指南(@语法、多模态组合、长视频工作流、排查表)见
references/seedance-jimeng.md

Core Difference: Seedance 2.0 supports multimodal input, directly reference materials with
@Material Name
, no longer relying on stacking professional terms in text. Chinese prompts are natively supported and work better than translated English prompts.
⚠️ Important Limitation: Seedance does not support negative prompts, don't write "don't include", use positive descriptions instead.
Prompt Formula (Chinese):
[Subject + Action] + [Scene/Environment] + [Lighting] + [Lens Language] + [Style/Texture] + [Image Quality Constraints]
Three Usage Methods:
① Text-only Generation (No Reference Materials)
A male independent developer wearing a white linen shirt, sitting in a dim coffee shop corner,
staring at the success prompt that just appeared on his MacBook screen, the corner of his mouth slightly lifting,
neon lights from outside the window filter in, mixing warm and cool light, close-up shot, camera slowly pushes in,
screen is stable without shaking, face is clear without distortion, cinematic feel, 4K HD.
② Upload Materials + @ Reference (Seedance's Strongest Feature)
Refer to the camera movement trajectory and rhythm of @Video 1,
place the product in @Image 1 into the same scene,
replace the background with a minimalist white workbench, cold white light shines directly from above,
camera slowly orbits around the product to emphasize craftsmanship details,
screen is stable, details are clear, Apple keynote-level product texture.
③ Video Extension (Continue from Existing Shot)
Extend @Video 1 by 10s, continue showing the side of the product,
camera slowly moves from the side to the back, lighting remains exactly the same as the previous segment,
movement is smooth and coherent, no frame jumps, natural transition with the previous segment.
Complete Seedance 2.0 usage guide (@ syntax, multimodal combination, long video workflow, troubleshooting checklist) can be found in
references/seedance-jimeng.md

第七步:可选增强项

Step 7: Optional Enhancements

完成分镜 + 配乐方案后,可主动提供:
🎨 色彩方案:给出视频整体调色建议(冷/暖/对比度/饱和度方向)
✂️ 剪辑节奏提示:哪些镜头可以快切,哪些需要呼吸感,哪些适合慢动作
🔄 备用镜头建议:为关键镜头提供备选拍摄方案(B-roll 补充)

After completing the storyboard + music scoring plan, you can proactively provide:
🎨 Color Scheme: Provide overall color grading suggestions for the video (cold/warm/contrast/saturation direction)
✂️ Editing Rhythm Tips: Which shots can be cut quickly, which need breathing space, which are suitable for slow motion
🔄 Backup Shot Recommendations: Provide alternative shooting plans for key shots (B-roll supplements)

提示词质量原则

Prompt Quality Principles

AI 视频提示词精准度标准:
  1. 主体在前,技术参数在后 — AI 模型对开头的词权重更高,主体描述越具体越好
  2. 避免矛盾指令 — 不要同时写 "handheld" 和 "perfectly stable"
  3. 情绪词有效 — "melancholic", "euphoric", "tense" 对 AI 生成有实际影响
  4. 数字比形容词准 — "15cm high pour" 比 "close to" 准;"drops from 4.2MB to 312KB" 比 "file size decreases" 准
  5. 避免模糊词 — "beautiful" 无效,"warm golden backlight creating rim lighting on subject's hair" 有效
  6. 速度要明确 — "slow motion 120fps" 比 "slow" 清晰;"real-time" 比什么都不写清晰
  7. 负面提示词 — 必要时附上:"no text overlay, no watermark, no camera shake, no cartoon style"
  8. 风格锚点 — 用电影/品牌美学做锚点:"Wes Anderson symmetry", "Wong Kar-wai color grading", "Apple keynote aesthetic", "A24 film texture"
人工拍摄指导精准度标准:
  1. 焦段必须给 — 不说「用长焦」,说「85mm 或 135mm 定焦,站距约 1.5 米」
  2. 布光可操作 — 不说「打暖光」,说「一盏 LED 柔光灯放在左侧 45 度,距离约 80cm,加柔光罩」
  3. 演员/被摄体指导 — 表情/动作描述要具体,如「不需要微笑,眼神看屏幕右上角,保持 2 秒不动」
  4. 备选方案 — 每个关键镜头建议一个备选拍法,防止拍摄现场意外

AI Video Prompt Accuracy Standards:
  1. Subject first, technical parameters later — AI models assign higher weight to words at the beginning, the more specific the subject description, the better
  2. Avoid conflicting instructions — Don't write "handheld" and "perfectly stable" at the same time
  3. Emotional words are effective — "melancholic", "euphoric", "tense" have actual impacts on AI generation
  4. Numbers are more accurate than adjectives — "15cm high pour" is more accurate than "close to"; "drops from 4.2MB to 312KB" is more accurate than "file size decreases"
  5. Avoid vague words — "beautiful" is ineffective, "warm golden backlight creating rim lighting on subject's hair" is effective
  6. Speed must be clear — "slow motion 120fps" is clearer than "slow"; "real-time" is clearer than writing nothing
  7. Negative prompts — Attach when necessary: "no text overlay, no watermark, no camera shake, no cartoon style"
  8. Style anchors — Use film/brand aesthetics as anchors: "Wes Anderson symmetry", "Wong Kar-wai color grading", "Apple keynote aesthetic", "A24 film texture"
Manual Shooting Guidance Accuracy Standards:
  1. Must specify focal length — Don't say "use telephoto", say "85mm or 135mm prime lens, standing distance about 1.5 meters"
  2. Lighting must be operable — Don't say "use warm light", say "One LED soft light placed at 45 degrees to the left, about 80cm away, with a softbox"
  3. Actor/Subject Guidance — Describe expressions/actions specifically, e.g., "No need to smile, look at the upper right corner of the screen, stay still for 2 seconds"
  4. Alternative Plans — Recommend one alternative shooting method for each key shot to prevent on-site accidents

参考文件

Reference Files

  • references/shot-types.md
    — 完整镜头类型库 + 提示词范例
  • references/music-design.md
    — ASL/BPM 速查、流派×用途映射、Suno AI 提示词专项指南
  • references/prompt-examples.md
    — 按行业分类的视频提示词范例集(通用 AI 视频工具)
  • references/seedance-jimeng.md
    — 即梦 Seedance 2.0 完整指南:多模态 @ 引用、提示词公式、场景模板、长视频工作流
  • references/shot-types.md
    — Complete shot type library + prompt examples
  • references/music-design.md
    — ASL/BPM quick reference, genre × usage mapping, Suno AI prompt special guide
  • references/prompt-examples.md
    — Industry-classified video prompt examples (general AI video tools)
  • references/seedance-jimeng.md
    — Complete Seedance 2.0 guide: multimodal @ reference, prompt formula, scene templates, long video workflow