Loading...
Loading...
Generate talking head avatar videos with Pruna P-Video-Avatar via inference.sh CLI. Turn a portrait image into a realistic speaking video with built-in TTS. 18x faster and 6x cheaper than competitors. Models: P-Video-Avatar, P-Image (for portrait generation). Capabilities: text-to-avatar, audio-driven avatars, 30 voices, 10 languages, 720p/1080p, built-in TTS, dynamic backgrounds, full-body control. Use for: AI presenters, product demos, explainer videos, virtual influencers, marketing, education, multilingual content, UGC, gaming avatars. Triggers: avatar video, talking head, ai avatar, p-video-avatar, pruna avatar, video avatar, ai presenter, digital human, virtual presenter, lipsync, talking avatar, ai spokesperson, heygen alternative, synthesia alternative, veed alternative, fabric alternative, omnihuman alternative
npx skill4agent add infsh-skills/skills p-video-avatarRequires inference.sh CLI (). Install instructionsbelt
belt login
# Generate avatar from portrait + text script
belt app run pruna/p-video-avatar --input '{
"image": "https://portrait.jpg",
"voice_script": "Hello, welcome to our product demo!",
"voice": "Zephyr (Female)"
}'# 1. Generate a portrait image with P-Image
belt app run pruna/p-image --input '{
"prompt": "professional headshot portrait of a young woman, neutral background, looking at camera, studio lighting, photorealistic",
"aspect_ratio": "9:16"
}'
# 2. Use the generated image URL to create the avatar video
belt app run pruna/p-video-avatar --input '{
"image": "<image-url-from-step-1>",
"voice_script": "Hi there! Let me walk you through our latest features.",
"voice": "Zephyr (Female)",
"resolution": "720p"
}'belt app run pruna/p-video-avatar --input '{
"image": "https://portrait.jpg",
"voice_script": "Welcome to our product walkthrough. Today I will show you three key features.",
"voice": "Puck (Male)",
"voice_language": "English (US)",
"resolution": "720p"
}'belt app run pruna/p-video-avatar --input '{
"image": "https://portrait.jpg",
"audio": "https://speech.mp3"
}'audiovoice_scriptbelt app run pruna/p-video-avatar --input '{
"image": "https://portrait.jpg",
"voice_script": "This is exciting news for our community!",
"voice": "Aoede (Female)",
"voice_prompt": "Enthusiastic and energetic tone, slightly faster pace",
"video_prompt": "The person is presenting on stage with dramatic lighting",
"resolution": "1080p"
}'# Spanish
belt app run pruna/p-video-avatar --input '{
"image": "https://portrait.jpg",
"voice_script": "Bienvenidos a nuestra demostración de producto.",
"voice": "Kore (Female)",
"voice_language": "Spanish"
}'
# Japanese
belt app run pruna/p-video-avatar --input '{
"image": "https://portrait.jpg",
"voice_script": "こんにちは、製品デモへようこそ。",
"voice": "Leda (Female)",
"voice_language": "Japanese"
}'belt app run pruna/p-video-avatar --input '{
"image": "https://portrait.jpg",
"voice_script": "Consistent results every time.",
"seed": 42
}'| Parameter | Type | Default | Description |
|---|---|---|---|
| file | required | Portrait image (jpg, jpeg, png, webp) |
| string | - | Text for the person to say |
| file | - | Audio file (overrides voice_script) |
| string | "Zephyr (Female)" | Voice selection |
| string | "English (US)" | Output language |
| string | "720p" | 720p or 1080p |
| string | "The person is talking." | Control avatar behavior and background |
| string | "Say the following." | Control tone, pacing, emotion |
| int | random | Reproducible generation |
| bool | true | Disable content filter |
| bool | false | Skip prompt enhancement |
| Resolution | Price |
|---|---|
| 720p | $0.025 per second of output video |
| 1080p | $0.045 per second of output video |
| Feature | P-Video-Avatar | Fabric 1.0 | OmniHuman 1.5 | HeyGen Avatar 4 |
|---|---|---|---|---|
| Speed (per sec of video) | ~1.83s/s | ~34s/s (18x slower) | ~28s/s (15x slower) | ~26s/s (14x slower) |
| Cost per second | $0.025 | $0.14 (5.6x more) | $0.16 (6.4x more) | $0.075 (3x more) |
| Built-in TTS | Yes | Yes | No | Yes |
| Dynamic Background | Yes | Yes | No | Yes |
| 1080p Support | Yes | No | No | Yes |
video_promptvoice_promptpruna/p-image9:16# Generate portrait images
belt app run pruna/p-image --input '{"prompt": "professional headshot portrait"}'
# General video generation
belt app run pruna/p-video --input '{"prompt": "cinematic scene"}'
# Image editing
belt app run pruna/p-image-edit --input '{"prompt": "change background", "image": "https://photo.jpg"}'# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# Pruna video generation
npx skills add inference-sh/skills@p-video
# Pruna image generation
npx skills add inference-sh/skills@p-image
# All video generation models
npx skills add inference-sh/skills@ai-video-generation
# Image generation (for creating portraits)
npx skills add inference-sh/skills@ai-image-generationbelt app list --namespace pruna