avatar-video

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Avatar Video

Avatar视频

Create AI avatar videos with full control over avatars, voices, scripts, scenes, and backgrounds. Build single or multi-scene videos with exact configuration using HeyGen's

/v2/video/generate

API.

创建AI Avatar视频，可全面控制Avatar、语音、脚本、场景及背景。借助HeyGen的

/v2/video/generate

API，按照精确配置制作单场景或多场景视频。

Authentication

身份验证

All requests require the

X-Api-Key

header. Set the

HEYGEN_API_KEY

environment variable.

bash

curl -X GET "https://api.heygen.com/v2/avatars" \
  -H "X-Api-Key: $HEYGEN_API_KEY"

所有请求均需携带

X-Api-Key

请求头。请设置

HEYGEN_API_KEY

环境变量。

bash

curl -X GET "https://api.heygen.com/v2/avatars" \
  -H "X-Api-Key: $HEYGEN_API_KEY"

Tool Selection

工具选择

If HeyGen MCP tools are available (

mcp__heygen__*

), prefer them over direct HTTP API calls — they handle authentication and request formatting automatically.

Task	MCP Tool	Fallback (Direct API)
Check video status / get URL	`mcp__heygen__get_video`	`GET /v2/videos/{video_id}`
List account videos	`mcp__heygen__list_videos`	`GET /v2/videos`
Delete a video	`mcp__heygen__delete_video`	`DELETE /v2/videos/{video_id}`

Video generation (

POST /v2/video/generate

) and avatar/voice listing are done via direct API calls — see reference files below.

若HeyGen MCP工具可用（

mcp__heygen__*

），优先使用此类工具而非直接调用HTTP API——它们会自动处理身份验证和请求格式。

任务	MCP工具	备选方案（直接调用API）
查询视频状态/获取视频链接	`mcp__heygen__get_video`	`GET /v2/videos/{video_id}`
列出账号下的视频	`mcp__heygen__list_videos`	`GET /v2/videos`
删除视频	`mcp__heygen__delete_video`	`DELETE /v2/videos/{video_id}`

视频生成（

POST /v2/video/generate

）以及Avatar/语音列表查询需通过直接调用API完成——详见下方参考文档。

Default Workflow

默认工作流程

List avatars —
```
GET /v2/avatars
```
→ pick an avatar, preview it, note
```
avatar_id
```
and
```
default_voice_id
```
. See avatars.md
List voices (if needed) —
```
GET /v2/voices
```
→ pick a voice matching the avatar's gender/language. See voices.md
Write the script — Structure scenes with one concept each. See scripts.md
Generate the video —
```
POST /v2/video/generate
```
with avatar, voice, script, and background per scene. See video-generation.md
Poll for completion —
```
GET /v2/videos/{video_id}
```
until status is
```
completed
```
. See video-status.md

列出Avatar — 调用
```
GET /v2/avatars
```
→ 选择一个Avatar，预览它，记录
```
avatar_id
```
和
```
default_voice_id
```
。详见avatars.md
列出语音（如需） — 调用
```
GET /v2/voices
```
→ 选择与Avatar性别/语言匹配的语音。详见voices.md
编写脚本 — 为每个场景构建单一主题的脚本。详见scripts.md
生成视频 — 调用
```
POST /v2/video/generate
```
，传入每个场景对应的Avatar、语音、脚本及背景。详见video-generation.md
轮询完成状态 — 调用
```
GET /v2/videos/{video_id}
```
，直到状态变为
```
completed
```
。详见video-status.md

Quick Reference

快速参考

Task	Read
List and preview avatars	avatars.md
List and select voices	voices.md
Write and structure scripts	scripts.md
Generate video (single or multi-scene)	video-generation.md
Add custom backgrounds	backgrounds.md
Add captions / subtitles	captions.md
Add text overlays	text-overlays.md
Create transparent WebM video	video-generation.md (WebM section)
Use templates	templates.md
Create avatar from photo	photo-avatars.md
Check video status / download	video-status.md
Upload assets (images, audio)	assets.md
Use with Remotion	remotion-integration.md
Set up webhooks	webhooks.md

任务	参考文档
列出并预览Avatar	avatars.md
列出并选择语音	voices.md
编写并结构化脚本	scripts.md
生成视频（单场景或多场景）	video-generation.md
添加自定义背景	backgrounds.md
添加字幕	captions.md
添加文字叠加层	text-overlays.md
创建透明WebM视频	video-generation.md（WebM章节）
使用模板	templates.md
从照片创建Avatar	photo-avatars.md
查询视频状态/下载视频	video-status.md
上传资源（图片、音频）	assets.md
与Remotion集成	remotion-integration.md
设置Webhook	webhooks.md

When to Use This Skill vs Create Video

本技能与Create Video技能的适用场景对比

This skill is for precise control — you choose the avatar, write the exact script, configure each scene.

If the user just wants to describe a video idea and let AI handle the rest (script, avatar, visuals), use the create-video skill instead.

User Says	Create Video Skill	This Skill
"Make me a video about X"	✓
"Create a product demo"	✓
"I want avatar Y to say exactly Z"		✓
"Multi-scene video with different backgrounds"		✓
"Transparent WebM for compositing"		✓
"Use this specific voice for my script"		✓
"Batch generate videos with exact specs"		✓

本技能适用于精准控制场景——由你选择Avatar、编写精准脚本、配置每个场景。

若用户仅希望描述视频创意，并交由AI处理其余部分（脚本、Avatar、视觉效果），则应使用create-video技能。

用户需求	Create Video技能	本技能
"帮我制作一个关于X的视频"	✓
"创建一个产品演示视频"	✓
"我想要Avatar Y精准朗读内容Z"		✓
"制作多场景且每个场景背景不同的视频"		✓
"创建用于合成的透明WebM视频"		✓
"为我的脚本使用指定的语音"		✓
"按照精确规格批量生成视频"		✓

Reference Files

参考文档

Core Video Creation

核心视频创建

references/avatars.md - Listing avatars, styles, avatar_id selection
references/voices.md - Listing voices, locales, speed/pitch
references/scripts.md - Writing scripts, pauses, pacing
references/video-generation.md - POST /v2/video/generate and multi-scene videos

references/avatars.md - 列出Avatar、风格，选择
```
avatar_id
```
references/voices.md - 列出语音、区域设置、语速/音调
references/scripts.md - 编写脚本、停顿、节奏
references/video-generation.md - 调用
```
POST /v2/video/generate
```
及多场景视频制作

Video Customization

视频自定义

references/backgrounds.md - Solid colors, images, video backgrounds
references/text-overlays.md - Adding text with fonts and positioning
references/captions.md - Auto-generated captions and subtitles

references/backgrounds.md - 纯色、图片、视频背景
references/text-overlays.md - 添加带字体和位置设置的文字
references/captions.md - 自动生成字幕

Advanced Features

高级功能

references/templates.md - Template listing and variable replacement
references/photo-avatars.md - Creating avatars from photos
references/webhooks.md - Webhook endpoints and events

references/templates.md - 模板列表及变量替换
references/photo-avatars.md - 从照片创建Avatar
references/webhooks.md - Webhook端点及事件

Integration

集成

references/remotion-integration.md - Using HeyGen in Remotion compositions

references/remotion-integration.md - 在Remotion合成中使用HeyGen

Foundation

基础功能

references/video-status.md - Polling patterns and download URLs
references/assets.md - Uploading images, videos, audio
references/dimensions.md - Resolution and aspect ratios
references/quota.md - Credit system and usage limits

references/video-status.md - 轮询模式及下载链接
references/assets.md - 上传图片、视频、音频
references/dimensions.md - 分辨率及宽高比
references/quota.md - 积分系统及使用限制

Best Practices

最佳实践

Preview avatars before generating — Download
```
preview_image_url
```
so the user can see the avatar before committing
Use avatar's default voice — Most avatars have a
```
default_voice_id
```
pre-matched for natural results
Fallback: match gender manually — If no default voice, ensure avatar and voice genders match
Use test mode for development — Set
```
test: true
```
to avoid consuming credits (output will be watermarked)
Set generous timeouts — Video generation often takes 5-15 minutes, sometimes longer
Validate inputs — Check avatar and voice IDs exist before generating

生成前预览Avatar — 下载
```
preview_image_url
```
，让用户在确认生成前查看Avatar效果
使用Avatar的默认语音 — 大多数Avatar都有预先匹配的
```
default_voice_id
```
，可获得更自然的效果
备选方案：手动匹配性别 — 若没有默认语音，确保Avatar与语音的性别匹配
开发时使用测试模式 — 设置
```
test: true
```
以避免消耗积分（输出视频将带有水印）
设置充足的超时时间 — 视频生成通常需要5-15分钟，有时会更长
验证输入信息 — 生成前检查Avatar和语音ID是否存在