Loading...
Loading...
Asset preprocessing for HyperFrames compositions — text-to-speech narration (Kokoro), audio/video transcription (Whisper), and background removal for transparent overlays (u2net). Use when generating voiceover from text, transcribing speech for captions, removing the background from a video or image to use as a transparent overlay, choosing a TTS voice or whisper model, or chaining these (TTS → transcribe → captions). Each command downloads its own model on first run.
npx skill4agent add heygen-com/hyperframes hyperframes-mediattstranscriberemove-background~/.cache/hyperframes/hyperframesttsnpx hyperframes tts "Text here" --voice af_nova --output narration.wav
npx hyperframes tts script.txt --voice bf_emma --output narration.wav
npx hyperframes tts --list # all 54 voicesaf_heart| Content type | Voice | Why |
|---|---|---|
| Product demo | | Warm, professional |
| Tutorial / how-to | | Neutral, easy to follow |
| Marketing / promo | | Energetic or authoritative |
| Documentation | | Clear British English, formal |
| Casual / social | | Approachable, natural |
abefhijpz--langnpx hyperframes tts "La reunión empieza a las nueve" --voice ef_dora --output es.wav
npx hyperframes tts "今日はいい天気ですね" --voice jf_alpha --output ja.wav--langen-usen-gbesfr-frhiitpt-brjazhespeak-ngbrew install espeak-ngapt-get install espeak-ng0.7-0.81.01.1-1.21.5+.txtkokoro-onnxsoundfilepip install kokoro-onnx soundfile~/.cache/hyperframes/tts/transcribetranscript.jsonnpx hyperframes transcribe audio.mp3
npx hyperframes transcribe video.mp4 --model small --language es
npx hyperframes transcribe subtitles.srt # import existing
npx hyperframes transcribe subtitles.vtt
npx hyperframes transcribe openai-response.json.en.ensmall.enmedium.en--model small --language <code>.en--model small.en--model small.en--languagesmallsmall.en| Model | Size | Speed | When to use |
|---|---|---|---|
| 75 MB | Fastest | Quick previews, testing pipeline |
| 142 MB | Fast | Short clips, clear audio |
| 466 MB | Moderate | Default — most content |
| 1.5 GB | Slow | Important content, noisy audio, music |
| 3.1 GB | Slowest | Production quality |
mediumidw0w1[
{ "id": "w0", "text": "Hello", "start": 0.0, "end": 0.5 },
{ "id": "w1", "text": "world.", "start": 0.6, "end": 1.2 }
]remove-backgroundnpx hyperframes remove-background avatar.mp4 -o transparent.webm # default: VP9 alpha WebM
npx hyperframes remove-background avatar.mp4 -o transparent.mov # ProRes 4444 (editing)
npx hyperframes remove-background portrait.jpg -o cutout.png # single-image cutout
npx hyperframes remove-background avatar.mp4 -o transparent.webm --device cpu
npx hyperframes remove-background --info # detected providersu2net_human_seg~/.cache/hyperframes/background-removal/models/| Format | When |
|---|---|
| Default. Compositions play this directly via |
| Editing in DaVinci/Premiere/FCP. Large files. |
| Single-image cutout (still subject, layered over a backdrop). |
.webmhyperframes<video>npx hyperframes tts script.txt --voice af_heart --output narration.wav
npx hyperframes transcribe narration.wav # → transcript.json