Search Results: text-to-speech

Found 89 Skills

AI & Machine Learningframersai/agentos-skills

google-cloud-tts

Text-to-speech synthesis via Google Cloud Text-to-Speech API — MP3 output, configurable language and voice, voice listing.

🇺🇸|EnglishTranslated

Frontend Developmentdaffy0208/ai-dev-standard...

voice-interface-builder

Expert in building voice interfaces, speech recognition, and text-to-speech systems

🇺🇸|EnglishTranslated

AI & Machine Learningakrindev/google-studio-sk...

gemini-tts

Generate speech from text using Google Gemini TTS models via scripts/. Use for text-to-speech, audio generation, voice synthesis, multi-speaker conversations, and creating audio content. Supports multiple voices and streaming. Triggers on "text to speech", "TTS", "generate audio", "voice synthesis", "speak this text".

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningitechmeat/llm-code

inworld

Inworld TTS API. Covers voice cloning, audio markups, timestamps. Keywords: text-to-speech, visemes.

🇺🇸|EnglishTranslated

AI & Machine Learningmodelslab/skills

modelslab-audio-generation

Generate speech, music, and sound effects using ModelsLab's v7 Voice API. Supports text-to-speech, speech-to-text, speech-to-speech, music generation, sound effects, dubbing, song extension, and song inpainting via ElevenLabs and Inworld models.

🇺🇸|EnglishTranslated

Backend Developmentteam-telnyx/skills

telnyx-voice-media-go

Play audio files, use text-to-speech, and record calls. Use when building IVR systems, playing announcements, or recording conversations. This skill provides Go SDK examples.

🇺🇸|EnglishTranslated

AI & Machine Learningvm0-ai/vm0-skills

minimax

MiniMax API via curl. Use this skill for Chinese LLM chat, text-to-speech, and AI video generation.

🇺🇸|EnglishTranslated

AI & Machine Learningdeepgram/skills

docs

Find the right Deepgram documentation for any task. Use whenever someone needs help locating docs, understanding which API to use, or wants to ask questions about Deepgram. Covers all product areas: speech-to-text, text-to-speech, voice agents, audio intelligence, and self-hosted deployments.

🇺🇸|EnglishTranslated

AI & Machine Learningdeepgram/skills

starters

Clone a ready-to-run Deepgram demo app and start building on top of it. Use whenever someone wants a quick working demo, needs to prototype with Deepgram, or is starting a new project that uses speech-to-text, text-to-speech, voice agents, audio intelligence, or live streaming. Match the user's language, framework, and desired Deepgram feature to the right starter.

🇺🇸|EnglishTranslated

AI & Machine Learningjakerains/agentskills

elevenlabs

Complete ElevenLabs AI audio platform: text-to-speech (TTS), speech-to-text (STT/Scribe), voice cloning, voice design, sound effects, music generation, dubbing, voice changer, voice isolator, and conversational voice agents. Use when working with audio generation, voice synthesis, transcription, audio processing, or building voice-enabled applications. Triggers: generate speech, clone voice, transcribe audio, create sound effects, compose music, dub video, change voice, isolate vocals, build voice agent, ElevenLabs API/SDK/CLI/MCP.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningsickn33/antigravity-aweso...

podcast-generation

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creatio...

🇺🇸|EnglishTranslated

AI & Machine Learningiamzhihuix/happy-claude-s...

happy-audio-gen

Universal AI voice / text-to-speech skill supporting OpenAI TTS (gpt-4o-mini-tts, tts-1), ElevenLabs multilingual TTS with voice cloning, Bailian Qwen TTS (qwen-tts / qwen3-tts-vd with voice-design custom voices, long-text chunking built in), MiniMax speech-02-hd, SiliconFlow CosyVoice / SenseVoice, and PlayHT 2.0. Use this skill whenever the user asks to read text aloud, synthesize speech, generate narration, create voice-over, dub a script, or turn any text into audio (mp3 / wav / ogg / flac). Typical phrases include "read this aloud", "generate voice for ...", "create a narration of ...", "tts this", "把这段念出来", "做个配音", "合成语音", or mentions of voices / TTS model names like Alloy, Ash, Cherry, Rachel, CosyVoice, PlayHT. Always use this skill even if the user does not specify a provider — pick one from EXTEND.md defaults or available env keys.

🇺🇸|EnglishTranslated

10 scripts/Attention