Search Results: TTS

Found 156 Skills

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-tts-voice-design

Voice design workflows with Alibaba Cloud Model Studio Qwen TTS VD models. Use when creating custom synthetic voices from text descriptions and using them for speech synthesis.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningcinience/alicloud-skills

aliyun-qwen-tts-realtime

Use when real-time speech synthesis is needed with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningaradotso/trending-skills

parlor-on-device-ai

On-device, real-time multimodal AI voice and vision assistant powered by Gemma 4 E2B and Kokoro TTS, running entirely locally via FastAPI WebSocket server.

🇺🇸|EnglishTranslated

AI & Machine Learningqwencloud/qwencloud-ai

qwencloud-audio-tts

[QwenCloud] Synthesize speech from text with Qwen TTS models. TRIGGER when: user wants to convert text to speech, create voiceovers, generate audio narration, read text aloud, build TTS applications, mentions speech synthesis/voice generation/audio output from text, or explicitly invokes this skill by name (e.g. use qwencloud-audio-tts). DO NOT TRIGGER when: user wants speech recognition/ASR, text generation without audio, non-Qwen audio tasks.

🇺🇸|EnglishTranslated

4 scripts/Checked

Tools & Utilitieswinsorllc/upgraded-carniv...

voice-output

Speak text aloud using system TTS (say command on macOS/Linux) or browser TTS via Chrome DevTools Protocol. Use when: (1) job completes and you want to announce results, (2) user asks to hear something spoken, (3) notifications that need audio alerts, (4) accessibility - reading content aloud.

🇺🇸|EnglishTranslated

5 scripts/Attention

AI & Machine Learningfreestylefly/canghe-skill...

flyworks-avatar-video

Generate videos using Flyworks (a.k.a HiFly) Digital Humans. Create talking photo videos from images, use public avatars with TTS, or clone voices for custom audio.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningsickn33/antigravity-aweso...

voice-ai-engine-development

Build real-time conversational AI voice engines using async worker pipelines, streaming transcription, LLM agents, and TTS synthesis with interrupt handling and multi-provider support

🇺🇸|EnglishTranslated

5 scripts/Checked

AI & Machine Learninggaelic-ghost/productivity...

talktomepy-tts

Deprecated legacy TalkToMePy TTS skill retained for backward compatibility. Prefer successor speech workflows in [gaelic-ghost/a11y-skills](https://github.com/gaelic-ghost/a11y-skills).

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningchanjing-ai/chan-skills

chanjing-tts-voice-clone

Use Chanjing TTS API to synthesize speech from text, using user-provided voice

🇺🇸|EnglishTranslated

AI & Machine Learningminimax-ai/skills

minimax-multimodal-toolkit

MiniMax multimodal model skill — use MiniMax Multi-Modal models for speech, music, video, and image. Create voice, music, video, and images with MiniMax AI: TTS (text-to-speech, voice cloning, voice design, multi-segment), music (songs, instrumentals), video (text-to-video, image-to-video, start-end frame, subject reference, templates, long-form multi-scene), image (text-to-image, image-to-image with character reference), and media processing (convert, concat, trim, extract). Use when the user mentions MiniMax, multimodal generation, or wants speech/music/video/image AI, MiniMax APIs, or FFmpeg workflows alongside MiniMax outputs.

🇺🇸|EnglishTranslated

9 scripts/Attention

AI & Machine Learningmckruz/comfyui-expert

comfyui-voice-pipeline

Generate character voices using TTS, voice cloning, and lip-sync tools. Supports Chatterbox, F5-TTS, TTS Audio Suite, RVC, and ElevenLabs. Use when creating speech audio for characters or syncing audio to video.

🇺🇸|EnglishTranslated

AI & Machine Learningcinience/alicloud-skills

aliyun-cosyvoice-voice-clone

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.

🇺🇸|EnglishTranslated

1 scripts/Checked