aliyun-wan-i2v
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseWan 2.7 Image-to-Video
Wan 2.7 图生视频
Validation
验证
bash
mkdir -p output/aliyun-wan-i2v
python -m py_compile skills/ai/video/aliyun-wan-i2v/scripts/generate_i2v.py && echo "py_compile_ok" > output/aliyun-wan-i2v/validate.txtPass criteria: command exits 0 and is generated.
output/aliyun-wan-i2v/validate.txtbash
mkdir -p output/aliyun-wan-i2v
python -m py_compile skills/ai/video/aliyun-wan-i2v/scripts/generate_i2v.py && echo "py_compile_ok" > output/aliyun-wan-i2v/validate.txt通过标准:命令执行退出码为0,且生成文件。
output/aliyun-wan-i2v/validate.txtOutput And Evidence
输出与证据
- Save task IDs, polling responses, and final video URLs to .
output/aliyun-wan-i2v/ - Keep at least one end-to-end run log for troubleshooting.
- 将任务ID、轮询响应和最终视频URL保存至目录。
output/aliyun-wan-i2v/ - 保留至少一份端到端运行日志用于故障排查。
Prerequisites
前置条件
- Install SDK (recommended in a venv):
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope- Set in your environment, or add
DASHSCOPE_API_KEYtodashscope_api_key.~/.alibabacloud/credentials
- 安装SDK(建议在虚拟环境中进行):
bash
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope- 在环境变量中设置,或在
DASHSCOPE_API_KEY中添加~/.alibabacloud/credentials。dashscope_api_key
Critical model names
关键模型名称
- — supports first-frame, first+last frame, video continuation, and audio-driven generation
wan2.7-i2v
- — 支持首帧生成、首尾帧插值、视频续播和音频驱动生成
wan2.7-i2v
Capabilities
功能特性
| Capability | Description | Required media types |
|---|---|---|
| First-frame video | Generate video from a single image | |
| First+last frame | Interpolate video between two images | |
| Video continuation | Extend an existing video clip | |
| Audio-driven | Drive video with audio (lip-sync, rhythm) | |
| 功能 | 描述 | 所需媒体类型 |
|---|---|---|
| 首帧视频生成 | 从单张图像生成视频 | |
| 首尾帧插值 | 在两张图像之间生成过渡视频 | |
| 视频续播 | 扩展现有视频片段 | |
| 音频驱动 | 用音频驱动视频生成(唇形同步、节奏匹配) | |
API endpoint (async only)
API端点(仅支持异步)
POST https://dashscope.aliyuncs.com/api/v1/services/aigc/video-generation/video-synthesisRequired headers:
Authorization: Bearer $DASHSCOPE_API_KEYContent-Type: application/jsonX-DashScope-Async: enable
Singapore endpoint: replace with .
dashscope.aliyuncs.comdashscope-intl.aliyuncs.comPOST https://dashscope.aliyuncs.com/api/v1/services/aigc/video-generation/video-synthesis必填请求头:
Authorization: Bearer $DASHSCOPE_API_KEYContent-Type: application/jsonX-DashScope-Async: enable
新加坡区域端点:将替换为。
dashscope.aliyuncs.comdashscope-intl.aliyuncs.comNormalized interface
标准化接口
Request
请求参数
- (string, optional) — up to 5000 characters, describes desired video content
prompt - (string, optional) — up to 500 characters
negative_prompt - (array, required) — media objects with
mediaandtypefields:url- :
type|first_frame|last_frame|driving_audiofirst_clip - : public URL (HTTP/HTTPS) or OSS temporary URL
url
- (string, optional) —
resolutionor720P(default:1080P)1080P - (integer, optional) — video length in seconds, range [2, 15] (default: 5)
duration - (boolean, optional) — AI prompt rewriting (default: true)
prompt_extend - (boolean, optional) — add "AI generated" watermark (default: false)
watermark - (integer, optional) — range [0, 2147483647]
seed
- (字符串,可选)—— 最多5000字符,描述期望的视频内容
prompt - (字符串,可选)—— 最多500字符
negative_prompt - (数组,必填)—— 包含
media和type字段的媒体对象:url- :
type|first_frame|last_frame|driving_audiofirst_clip - : 公共URL(HTTP/HTTPS)或OSS临时URL
url
- (字符串,可选)——
resolution或720P(默认值:1080P)1080P - (整数,可选)—— 视频时长(秒),范围[2, 15](默认值:5)
duration - (布尔值,可选)—— AI提示词重写(默认值:true)
prompt_extend - (布尔值,可选)—— 添加"AI生成"水印(默认值:false)
watermark - (整数,可选)—— 范围[0, 2147483647]
seed
Media input limits
媒体输入限制
Images (first_frame, last_frame):
- Formats: JPEG, JPG, PNG (no transparency), BMP, WEBP
- Resolution: [240, 8000] pixels per side
- Aspect ratio: 1:8 to 8:1
- Max size: 20MB
Audio (driving_audio):
- Formats: wav, mp3
- Duration: 2-30s
- Max size: 15MB
- Auto-truncated to value if longer
duration
Video (first_clip):
- Formats: mp4, mov
- Duration: 2-10s
- Resolution: [240, 4096] pixels per side
- Aspect ratio: 1:8 to 8:1
- Max size: 100MB
图像(first_frame、last_frame):
- 格式:JPEG、JPG、PNG(无透明通道)、BMP、WEBP
- 分辨率:每边[240, 8000]像素
- 宽高比:1:8至8:1
- 最大大小:20MB
音频(driving_audio):
- 格式:wav、mp3
- 时长:2-30秒
- 最大大小:15MB
- 如果时长超过值,将自动截断
duration
视频(first_clip):
- 格式:mp4、mov
- 时长:2-10秒
- 分辨率:每边[240, 4096]像素
- 宽高比:1:8至8:1
- 最大大小:100MB
Response (task creation)
响应(任务创建)
- (string) — use for polling, valid 24 hours
output.task_id - (string) — PENDING | RUNNING | SUCCEEDED | FAILED | CANCELED
output.task_status - (string)
request_id
- (字符串)—— 用于轮询,有效期24小时
output.task_id - (字符串)—— PENDING | RUNNING | SUCCEEDED | FAILED | CANCELED
output.task_status - (字符串)
request_id
Response (task result)
响应(任务结果)
- (string) — generated video URL
output.video_url - (string) — original prompt
output.orig_prompt - (string) — rewritten prompt (if prompt_extend enabled)
output.actual_prompt - (integer)
usage.video_count - (integer) — duration in seconds
usage.video_duration
- (字符串)—— 生成的视频URL
output.video_url - (字符串)—— 原始提示词
output.orig_prompt - (字符串)—— 重写后的提示词(若开启prompt_extend)
output.actual_prompt - (整数)
usage.video_count - (整数)—— 视频时长(秒)
usage.video_duration
Quick start (Python + HTTP)
快速开始(Python + HTTP)
python
import os
import json
import time
import requests
API_KEY = os.getenv("DASHSCOPE_API_KEY")
BASE_URL = "https://dashscope.aliyuncs.com/api/v1"
def create_i2v_task(req: dict) -> str:
"""Create an image-to-video task and return task_id."""
payload = {
"model": "wan2.7-i2v",
"input": {
"prompt": req.get("prompt", ""),
"media": req["media"],
},
"parameters": {
"resolution": req.get("resolution", "1080P"),
"duration": req.get("duration", 5),
"prompt_extend": req.get("prompt_extend", True),
"watermark": req.get("watermark", False),
},
}
if req.get("negative_prompt"):
payload["input"]["negative_prompt"] = req["negative_prompt"]
if req.get("seed") is not None:
payload["parameters"]["seed"] = req["seed"]
resp = requests.post(
f"{BASE_URL}/services/aigc/video-generation/video-synthesis",
headers={
"Authorization": f"Bearer {API_KEY}",
"Content-Type": "application/json",
"X-DashScope-Async": "enable",
},
json=payload,
)
resp.raise_for_status()
data = resp.json()
return data["output"]["task_id"]
def poll_task(task_id: str, interval: int = 15) -> dict:
"""Poll until task completes. Returns final response."""
while True:
resp = requests.get(
f"{BASE_URL}/tasks/{task_id}",
headers={"Authorization": f"Bearer {API_KEY}"},
)
resp.raise_for_status()
data = resp.json()
status = data["output"]["task_status"]
if status in ("SUCCEEDED", "FAILED", "CANCELED"):
return data
time.sleep(interval)python
import os
import json
import time
import requests
API_KEY = os.getenv("DASHSCOPE_API_KEY")
BASE_URL = "https://dashscope.aliyuncs.com/api/v1"
def create_i2v_task(req: dict) -> str:
"""创建图生视频任务并返回task_id。"""
payload = {
"model": "wan2.7-i2v",
"input": {
"prompt": req.get("prompt", ""),
"media": req["media"],
},
"parameters": {
"resolution": req.get("resolution", "1080P"),
"duration": req.get("duration", 5),
"prompt_extend": req.get("prompt_extend", True),
"watermark": req.get("watermark", False),
},
}
if req.get("negative_prompt"):
payload["input"]["negative_prompt"] = req["negative_prompt"]
if req.get("seed") is not None:
payload["parameters"]["seed"] = req["seed"]
resp = requests.post(
f"{BASE_URL}/services/aigc/video-generation/video-synthesis",
headers={
"Authorization": f"Bearer {API_KEY}",
"Content-Type": "application/json",
"X-DashScope-Async": "enable",
},
json=payload,
)
resp.raise_for_status()
data = resp.json()
return data["output"]["task_id"]
def poll_task(task_id: str, interval: int = 15) -> dict:
"""轮询直到任务完成,返回最终响应。"""
while True:
resp = requests.get(
f"{BASE_URL}/tasks/{task_id}",
headers={"Authorization": f"Bearer {API_KEY}"},
)
resp.raise_for_status()
data = resp.json()
status = data["output"]["task_status"]
if status in ("SUCCEEDED", "FAILED", "CANCELED"):
return data
time.sleep(interval)Media combination examples
媒体组合示例
python
undefinedpython
undefinedFirst-frame only
仅使用首帧
media = [{"type": "first_frame", "url": "https://example.com/image.jpg"}]
media = [{"type": "first_frame", "url": "https://example.com/image.jpg"}]
First + last frame interpolation
首尾帧插值
media = [
{"type": "first_frame", "url": "https://example.com/start.jpg"},
{"type": "last_frame", "url": "https://example.com/end.jpg"},
]
media = [
{"type": "first_frame", "url": "https://example.com/start.jpg"},
{"type": "last_frame", "url": "https://example.com/end.jpg"},
]
Audio-driven from first frame
基于首帧的音频驱动
media = [
{"type": "first_frame", "url": "https://example.com/face.jpg"},
{"type": "driving_audio", "url": "https://example.com/speech.mp3"},
]
media = [
{"type": "first_frame", "url": "https://example.com/face.jpg"},
{"type": "driving_audio", "url": "https://example.com/speech.mp3"},
]
Video continuation
视频续播
media = [{"type": "first_clip", "url": "https://example.com/clip.mp4"}]
undefinedmedia = [{"type": "first_clip", "url": "https://example.com/clip.mp4"}]
undefinedError handling
错误处理
| Error | Likely cause | Action |
|---|---|---|
| 401/403 | Missing or invalid | Check env var or credentials file |
400 | Unsupported resolution, bad duration, missing media | Validate parameters |
| "does not support synchronous calls" | Missing | Add required header |
| 429 | Rate limit or quota | Retry with backoff |
| 错误 | 可能原因 | 处理措施 |
|---|---|---|
| 401/403 | | 检查环境变量或凭据文件 |
400 | 不支持的分辨率、无效时长、缺失媒体 | 验证参数合法性 |
| "does not support synchronous calls" | 缺少 | 添加必填请求头 |
| 429 | 速率限制或配额不足 | 退避重试 |
Output location
输出位置
- Default output:
output/aliyun-wan-i2v/videos/ - Override base dir with .
OUTPUT_DIR
- 默认输出目录:
output/aliyun-wan-i2v/videos/ - 可通过环境变量覆盖基础目录。
OUTPUT_DIR
Anti-patterns
反模式
- Do not use model names other than .
wan2.7-i2v - Do not call this API synchronously — async header is required.
- Do not pass duplicate media types (e.g., two entries).
first_frame - Video URLs expire after 24 hours; download and persist immediately.
- Do not use this API for video editing — use instead.
aliyun-wan-videoedit
- 请勿使用以外的模型名称。
wan2.7-i2v - 请勿同步调用此API —— 必须添加异步请求头。
- 请勿传入重复的媒体类型(例如两个条目)。
first_frame - 视频URL有效期为24小时;请立即下载并持久化保存。
- 请勿使用此API进行视频编辑 —— 请使用。
aliyun-wan-videoedit
Workflow
工作流程
- Confirm user intent: first-frame, first+last frame, video continuation, or audio-driven.
- Prepare media array with correct types and valid URLs.
- Create async task and poll for results.
- Download and save generated video before URL expiration.
- 确认用户需求:首帧生成、首尾帧插值、视频续播或音频驱动。
- 准备包含正确类型和有效URL的媒体数组。
- 创建异步任务并轮询结果。
- 在URL过期前下载并保存生成的视频。
References
参考资料
- See for full HTTP API details.
references/api_reference.md - See for source links.
references/sources.md
- 完整HTTP API详情请查看。
references/api_reference.md - 源码链接请查看。
references/sources.md