Search Results: TTS

Found 156 Skills

AI & Machine Learninginference-skills/skills

ai-avatar-video

Create AI avatar and talking head videos via inference.sh CLI. Recommended: P-Video-Avatar (fastest, cheapest, built-in TTS). Also: OmniHuman, Fabric, PixVerse. Capabilities: audio-driven avatars, text-to-avatar, lipsync videos, talking head generation, virtual presenters. Use for: AI presenters, explainer videos, virtual influencers, dubbing, marketing videos. Triggers: ai avatar, talking head, lipsync, avatar video, virtual presenter, ai spokesperson, audio driven video, heygen alternative, synthesia alternative, talking avatar, lip sync, video avatar, ai presenter, digital human

🇺🇸|EnglishTranslated

177.2k

AI & Machine Learninginference-sh/skills

text-to-speech

Convert text to natural speech with DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: DIA TTS (conversational), Kokoro TTS, Chatterbox, Higgs Audio, VibeVoice (podcasts). Capabilities: text-to-speech, voice cloning, multi-speaker dialogue, podcast generation, expressive speech. Use for: voiceovers, audiobooks, podcasts, accessibility, video narration, IVR, voice assistants. Triggers: text to speech, tts, voice generation, ai voice, speech synthesis, voice over, generate speech, ai narrator, voice cloning, text to audio, elevenlabs alternative, voice ai, ai voiceover, speech generator, natural voice

🇺🇸|EnglishTranslated

872

AI & Machine Learningskill-zero/s

ai-voice-cloning

AI voice generation, text-to-speech, and voice synthesis via inference.sh CLI. Models: Kokoro TTS, DIA, Chatterbox, Higgs, VibeVoice for natural speech. Capabilities: multiple voices, emotions, accents, long-form narration, conversation. Use for: voiceovers, audiobooks, podcasts, video narration, accessibility. Triggers: voice cloning, tts, text to speech, ai voice, voice generation, voice synthesis, voice over, narration, speech synthesis, ai narrator, elevenlabs alternative, natural voice, realistic speech, voice ai

🇺🇸|EnglishTranslated

829

AI & Machine Learningskill-zero/s

talking-head-production

Talking head video production with AI avatars, lipsync, and voiceover. Covers portrait requirements, audio quality, OmniHuman, PixVerse lipsync, Dia TTS. Use for: spokesperson videos, course content, social media, presentations, demos. Triggers: talking head, avatar video, lipsync, lip sync, ai spokesperson, virtual presenter, ai presenter, omnihuman, talking avatar, video presenter, ai talking head, presenter video, ai face video

🇺🇸|EnglishTranslated

810

AI & Machine Learningskill-zero/s

dialogue-audio

Multi-speaker dialogue audio creation with Dia TTS. Covers speaker tags, emotion control, pacing, conversation flow, and post-production. Use for: podcasts, audiobooks, explainers, character dialogue, conversational content. Triggers: dialogue audio, multi speaker, conversation audio, dia tts, two speakers, podcast audio, character voices, voice acting, dialogue generation, conversation tts, multi voice, speaker tags, dialogue recording

🇺🇸|EnglishTranslated

732

AI & Machine Learningmartinholovsky/claude-ski...

text-to-speech

Expert skill for implementing text-to-speech with Kokoro TTS. Covers voice synthesis, audio generation, performance optimization, and secure handling of generated audio for JARVIS voice assistant.

🇺🇸|EnglishTranslated

Automationjykim/claude-obsidian-ski...

markdown-video

Convert Deckset-format markdown slides with speaker notes to presentation video with TTS narration. Use when user requests to create video from slides, generate presentation video, or convert slides to MP4 format.

🇺🇸|EnglishTranslated

8 scripts/Checked

AI & Machine Learningmichaelboeding/skills

voice-generation

Use this skill for AI text-to-speech generation. Triggers include: "generate voice", "create audio", "text to speech", "TTS", "read this aloud", "generate narration", "create voiceover", "synthesize speech", "podcast audio", "dialogue audio", "multi-speaker", "audiobook" Supports Google Gemini TTS, ElevenLabs, and OpenAI TTS.

🇺🇸|EnglishTranslated

3 scripts/Checked

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-tts-realtime

Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningnotedit/happy-skills

tts-skill

MiniMax TTS API - Text-to-Speech, Voice Cloning, Voice Design

🇨🇳|ChineseTranslated

1 scripts/Checked

Automationzrt-ai-lab/opencode-skill...

video-creator

Video creation skill. Combine images and audio to generate videos, supporting TTS dubbing, fade-in/fade-out transitions, subtitles, outro, and BGM. Triggered when users mention phrases like 'generate video', 'make video', 'educational video', 'image-to-video', 'create video account content', 'dubbed video', 'image-text integrated video', 'ancient poetry video', 'story video'. Includes the full workflow of image generation → dubbing → video synthesis, no need to call image-service separately.

🇨🇳|ChineseTranslated

5 scripts/Checked

AI & Machine Learningtavus-engineering/tavus-s...

tavus-cvi-persona

Configure Tavus CVI personas with custom LLMs, TTS engines, perception, and turn-taking. Use when customizing AI behavior, bringing your own LLM, configuring voice/TTS, enabling vision with Raven, or tuning conversation flow with Sparrow.

🇺🇸|EnglishTranslated