Search Results: tts

Found 196 Skills

AI & Machine Learningcinience/alicloud-skills

aliyun-qwen-tts-realtime

Use when real-time speech synthesis is needed with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningqwencloud/qwencloud-ai

qwencloud-audio-tts

[QwenCloud] Synthesize speech from text with Qwen TTS models. TRIGGER when: user wants to convert text to speech, create voiceovers, generate audio narration, read text aloud, build TTS applications, mentions speech synthesis/voice generation/audio output from text, or explicitly invokes this skill by name (e.g. use qwencloud-audio-tts). DO NOT TRIGGER when: user wants speech recognition/ASR, text generation without audio, non-Qwen audio tasks.

🇺🇸|EnglishTranslated

4 scripts/Checked

AI & Machine Learningwinsorllc/upgraded-carniv...

elevenlabs-tts

Convert text to speech using ElevenLabs API. Use when you need to generate voice audio for messages, narrations, or accessibility.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningalsk1992/cloddsbot

tts

Text-to-speech synthesis with ElevenLabs and system voices

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learninggaelic-ghost/productivity...

talktomepy-tts

Deprecated legacy TalkToMePy TTS skill retained for backward compatibility. Prefer successor speech workflows in [gaelic-ghost/a11y-skills](https://github.com/gaelic-ghost/a11y-skills).

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningframersai/agentos-skills

google-cloud-tts

Text-to-speech synthesis via Google Cloud Text-to-Speech API — MP3 output, configurable language and voice, voice listing.

🇺🇸|EnglishTranslated

AI & Machine Learningakrindev/google-studio-sk...

gemini-tts

Generate speech from text using Google Gemini TTS models via scripts/. Use for text-to-speech, audio generation, voice synthesis, multi-speaker conversations, and creating audio content. Supports multiple voices and streaming. Triggers on "text to speech", "TTS", "generate audio", "voice synthesis", "speak this text".

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningjarmen423/skills

qwen3-tts

Build text-to-speech applications using Qwen3-TTS, a powerful speech generation system supporting voice clone, voice design, and custom voice synthesis. Use when creating TTS applications, generating speech from text, cloning voices from audio samples, designing new voices via natural language descriptions, or fine-tuning TTS models. Supports 10 languages (Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, Italian).

🇺🇸|EnglishTranslated

Testing & QAcinience/alicloud-skills

alicloud-ai-audio-tts-voice-clone-test

Minimal voice cloning TTS smoke test for Model Studio Qwen TTS VC.

🇺🇸|EnglishTranslated

AI & Machine Learningaradotso/trending-skills

moss-tts-nano-speech

Expert skill for using MOSS-TTS-Nano, a 0.1B parameter multilingual real-time TTS model that runs on CPU with voice cloning support.

🇺🇸|EnglishTranslated

AI & Machine Learningxiaomimimo/mimo-skills

mimo-v2-5-tts

MiMo V2.5 TTS Text-to-Speech. Generate speech using Xiaomi MiMo V2.5 TTS series models. This skill is activated when text needs to be converted to speech, voice messages need to be sent, content needs to be read aloud, or when users request 'speak it out' or 'voice reply'. It supports three modes: preset voice, voice design, and voice cloning, as well as natural language control and director mode. It also supports style tag control for tone, emotion, and dialect, and preset voices support singing.

🇨🇳|ChineseTranslated

4 scripts/Attention

AI & Machine Learningsmallnest/goal-workflow

listenhub-tts

Convert text to speech (TTS) using the ListenHub API. Three modes are supported: Quick Synthesis (/v1/tts), Multi-role Script (/v1/speech), and Long Text Streaming Synthesis (/v1/flow-speech/episodes). If no voice is specified, automatically retrieve the voice list for user selection, with chat-girl-105-cn (Xiaoman) as the default. Use when user says: "tts", "text to speech", "语音合成", "文字转语音", "朗读", "生成语音", "生成音频", "转音频", "text to audio"

🇨🇳|ChineseTranslated