p-video-avatar

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Pruna P-Video-Avatar

Pruna P-Video-Avatar

Generate talking head avatar videos from a single portrait image via inference.sh CLI.
P-Video-Avatar is the fastest and most cost-effective avatar video model available. Quality on par with Veo 3.0, 18x faster and 6x cheaper than alternatives like Fabric, OmniHuman, and HeyGen.
通过inference.sh CLI,仅用一张肖像图片即可生成说话头像视频。
P-Video-Avatar是目前速度最快、性价比最高的头像视频生成模型。画质与Veo 3.0相当,比Fabric、OmniHuman和HeyGen等竞品快18倍,成本低6倍。

Quick Start

快速开始

Requires inference.sh CLI (
belt
). Install instructions
bash
belt login
需要inference.sh CLI(
belt
)。安装说明
bash
belt login

Generate avatar from portrait + text script

从肖像图片+文本脚本生成头像视频

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Hello, welcome to our product demo!", "voice": "Zephyr (Female)" }'
undefined
belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Hello, welcome to our product demo!", "voice": "Zephyr (Female)" }'
undefined

Full Workflow: Generate Portrait + Avatar Video

完整工作流:生成肖像图片+头像视频

Use Pruna P-Image to generate the portrait, then P-Video-Avatar to animate it:
bash
undefined
使用Pruna P-Image生成肖像图片,再用P-Video-Avatar使其动起来:
bash
undefined

1. Generate a portrait image with P-Image

1. 使用P-Image生成肖像图片

belt app run pruna/p-image --input '{ "prompt": "professional headshot portrait of a young woman, neutral background, looking at camera, studio lighting, photorealistic", "aspect_ratio": "9:16" }'
belt app run pruna/p-image --input '{ "prompt": "professional headshot portrait of a young woman, neutral background, looking at camera, studio lighting, photorealistic", "aspect_ratio": "9:16" }'

2. Use the generated image URL to create the avatar video

2. 使用第一步生成的图片URL创建头像视频

belt app run pruna/p-video-avatar --input '{ "image": "<image-url-from-step-1>", "voice_script": "Hi there! Let me walk you through our latest features.", "voice": "Zephyr (Female)", "resolution": "720p" }'
undefined
belt app run pruna/p-video-avatar --input '{ "image": "<image-url-from-step-1>", "voice_script": "Hi there! Let me walk you through our latest features.", "voice": "Zephyr (Female)", "resolution": "720p" }'
undefined

Examples

示例

Text Script with Voice Selection

选择音色的文本脚本

bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "voice_script": "Welcome to our product walkthrough. Today I will show you three key features.",
  "voice": "Puck (Male)",
  "voice_language": "English (US)",
  "resolution": "720p"
}'
bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "voice_script": "Welcome to our product walkthrough. Today I will show you three key features.",
  "voice": "Puck (Male)",
  "voice_language": "English (US)",
  "resolution": "720p"
}'

Audio-Driven Avatar

音频驱动头像

Provide your own audio file instead of using built-in TTS:
bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "audio": "https://speech.mp3"
}'
When both
audio
and
voice_script
are provided, audio takes priority.
提供自定义音频文件,替代内置TTS:
bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "audio": "https://speech.mp3"
}'
当同时提供
audio
voice_script
时,音频文件优先级更高。

1080p with Custom Style

1080p分辨率+自定义风格

bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "voice_script": "This is exciting news for our community!",
  "voice": "Aoede (Female)",
  "voice_prompt": "Enthusiastic and energetic tone, slightly faster pace",
  "video_prompt": "The person is presenting on stage with dramatic lighting",
  "resolution": "1080p"
}'
bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "voice_script": "This is exciting news for our community!",
  "voice": "Aoede (Female)",
  "voice_prompt": "Enthusiastic and energetic tone, slightly faster pace",
  "video_prompt": "The person is presenting on stage with dramatic lighting",
  "resolution": "1080p"
}'

Multilingual Content

多语言内容

bash
undefined
bash
undefined

Spanish

西班牙语

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Bienvenidos a nuestra demostración de producto.", "voice": "Kore (Female)", "voice_language": "Spanish" }'
belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "Bienvenidos a nuestra demostración de producto.", "voice": "Kore (Female)", "voice_language": "Spanish" }'

Japanese

日语

belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "こんにちは、製品デモへようこそ。", "voice": "Leda (Female)", "voice_language": "Japanese" }'
undefined
belt app run pruna/p-video-avatar --input '{ "image": "https://portrait.jpg", "voice_script": "こんにちは、製品デモへようこそ。", "voice": "Leda (Female)", "voice_language": "Japanese" }'
undefined

Reproducible Generation

可复现生成

bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "voice_script": "Consistent results every time.",
  "seed": 42
}'
bash
belt app run pruna/p-video-avatar --input '{
  "image": "https://portrait.jpg",
  "voice_script": "Consistent results every time.",
  "seed": 42
}'

Available Voices

可用音色

Female: Zephyr, Kore, Leda, Aoede, Callirrhoe, Autonoe, Despina, Erinome, Laomedeia, Achernar, Gacrux, Pulcherrima, Vindemiatrix, Sulafat
Male: Puck, Charon, Fenrir, Orus, Enceladus, Iapetus, Umbriel, Algenib, Algieba, Schedar, Achird, Zubenelgenubi, Sadachbia, Sadaltager, Alnilam, Rasalgethi
女性音色: Zephyr, Kore, Leda, Aoede, Callirrhoe, Autonoe, Despina, Erinome, Laomedeia, Achernar, Gacrux, Pulcherrima, Vindemiatrix, Sulafat
男性音色: Puck, Charon, Fenrir, Orus, Enceladus, Iapetus, Umbriel, Algenib, Algieba, Schedar, Achird, Zubenelgenubi, Sadachbia, Sadaltager, Alnilam, Rasalgethi

Supported Languages

支持语言

English (US), English (UK), Spanish, French, German, Italian, Portuguese (Brazil), Japanese, Korean, Hindi
英语(美国)、英语(英国)、西班牙语、法语、德语、意大利语、葡萄牙语(巴西)、日语、韩语、印地语

Parameters

参数说明

ParameterTypeDefaultDescription
image
filerequiredPortrait image (jpg, jpeg, png, webp)
voice_script
string-Text for the person to say
audio
file-Audio file (overrides voice_script)
voice
string"Zephyr (Female)"Voice selection
voice_language
string"English (US)"Output language
resolution
string"720p"720p or 1080p
video_prompt
string"The person is talking."Control avatar behavior and background
voice_prompt
string"Say the following."Control tone, pacing, emotion
seed
intrandomReproducible generation
disable_safety_filter
booltrueDisable content filter
disable_prompt_upsampling
boolfalseSkip prompt enhancement
参数类型默认值描述
image
文件必填肖像图片(jpg、jpeg、png、webp格式)
voice_script
字符串-头像需要说的文本内容
audio
文件-音频文件(优先级高于voice_script)
voice
字符串"Zephyr (Female)"音色选择
voice_language
字符串"English (US)"输出语言
resolution
字符串"720p"分辨率可选720p或1080p
video_prompt
字符串"The person is talking."控制头像动作与背景
voice_prompt
字符串"Say the following."控制语气、语速、情绪
seed
整数随机值用于生成可复现的结果
disable_safety_filter
布尔值true禁用内容过滤
disable_prompt_upsampling
布尔值false跳过提示词优化

Pricing

定价

ResolutionPrice
720p$0.025 per second of output video
1080p$0.045 per second of output video
Example: 30-second 720p video = $0.75
分辨率价格
720p每秒钟输出视频0.025美元
1080p每秒钟输出视频0.045美元
示例:30秒720p视频 = 0.75美元

Free Launch Weekend

免费启动周末

P-Video-Avatar is completely free from Thursday May 1, 2026 4:00 PM CET through Sunday May 4, 2026 11:59 PM CET. All costs are on us during this window — no billing, no limits on resolution.
P-Video-Avatar在2026年5月1日星期四欧洲中部时间16:00至2026年5月4日星期日欧洲中部时间23:59期间完全免费。 在此期间所有费用由我们承担——无需计费,分辨率无限制。

Competitive Advantage

竞争优势

FeatureP-Video-AvatarFabric 1.0OmniHuman 1.5HeyGen Avatar 4
Speed (per sec of video)~1.83s/s~34s/s (18x slower)~28s/s (15x slower)~26s/s (14x slower)
Cost per second$0.025$0.14 (5.6x more)$0.16 (6.4x more)$0.075 (3x more)
Built-in TTSYesYesNoYes
Dynamic BackgroundYesYesNoYes
1080p SupportYesNoNoYes
特性P-Video-AvatarFabric 1.0OmniHuman 1.5HeyGen Avatar 4
速度(每生成1秒视频所需时间)~1.83秒/秒~34秒/秒(慢18倍)~28秒/秒(慢15倍)~26秒/秒(慢14倍)
每秒成本0.025美元0.14美元(贵5.6倍)0.16美元(贵6.4倍)0.075美元(贵3倍)
内置TTS
动态背景
支持1080p

Use Cases

适用场景

  • Marketing: Product demos, UGC-style ads with AI presenters
  • Education: Course videos, explainers, tutoring content
  • Localization: Dub content across 10 languages from one image
  • Social Media: Consistent virtual influencer content
  • Corporate: Training, onboarding, announcements
  • Gaming: Character avatars, NPC dialogue videos
  • Customer Support: Personalized video responses
  • 营销领域:产品演示、AI主持人风格的UGC广告
  • 教育领域:课程视频、讲解内容、辅导素材
  • 本地化:通过一张图片将内容配音成10种语言
  • 社交媒体:持续产出虚拟网红内容
  • 企业场景:培训、入职引导、公告
  • 游戏领域:角色头像、NPC对话视频
  • 客户支持:个性化视频回复

Tips

使用技巧

  • Use high-quality portrait photos (front-facing, good lighting)
  • The output video aspect ratio matches the input image
  • Use
    video_prompt
    to control dynamic backgrounds and body language
  • Use
    voice_prompt
    to control speaking style, emotion, and pacing
  • Keep videos under 3 minutes for best visual consistency
  • Generate portraits with
    pruna/p-image
    using aspect ratio
    9:16
    for vertical avatar videos
  • 使用高质量肖像照片(正面朝向、光线良好)
  • 输出视频的宽高比与输入图片一致
  • 使用
    video_prompt
    控制动态背景和肢体语言
  • 使用
    voice_prompt
    控制说话风格、情绪和语速
  • 视频时长控制在3分钟以内以获得最佳视觉一致性
  • 使用
    pruna/p-image
    生成宽高比为9:16的肖像图片,用于竖版头像视频

Related Pruna Models

相关Pruna模型

bash
undefined
bash
undefined

Generate portrait images

生成肖像图片

belt app run pruna/p-image --input '{"prompt": "professional headshot portrait"}'
belt app run pruna/p-image --input '{"prompt": "professional headshot portrait"}'

General video generation

通用视频生成

belt app run pruna/p-video --input '{"prompt": "cinematic scene"}'
belt app run pruna/p-video --input '{"prompt": "cinematic scene"}'

Image editing

图片编辑

belt app run pruna/p-image-edit --input '{"prompt": "change background", "image": "https://photo.jpg"}'
undefined
belt app run pruna/p-image-edit --input '{"prompt": "change background", "image": "https://photo.jpg"}'
undefined

Related Skills

相关技能

bash
undefined
bash
undefined

Full platform skill (all 250+ apps)

全平台技能(包含250+应用)

npx skills add inference-sh/skills@infsh-cli
npx skills add inference-sh/skills@infsh-cli

Pruna video generation

Pruna视频生成

npx skills add inference-sh/skills@p-video
npx skills add inference-sh/skills@p-video

Pruna image generation

Pruna图片生成

npx skills add inference-sh/skills@p-image
npx skills add inference-sh/skills@p-image

All video generation models

所有视频生成模型

npx skills add inference-sh/skills@ai-video-generation
npx skills add inference-sh/skills@ai-video-generation

Image generation (for creating portraits)

图片生成(用于创建肖像)

npx skills add inference-sh/skills@ai-image-generation

Browse all Pruna apps: `belt app list --namespace pruna`
npx skills add inference-sh/skills@ai-image-generation

浏览所有Pruna应用:`belt app list --namespace pruna`

Documentation

文档