Search Results: audio-generation

Found 32 Skills

audio-generation

Guide to audio generation and understanding in MassGen. Covers text-to-speech, music, sound effects, and audio understanding across ElevenLabs and OpenAI backends.

🇺🇸|EnglishTranslated

AI & Machine Learningmodelslab/skills

modelslab-audio-generation

Generate speech, music, and sound effects using ModelsLab's v7 Voice API. Supports text-to-speech, speech-to-text, speech-to-speech, music generation, sound effects, dubbing, song extension, and song inpainting via ElevenLabs and Inworld models.

🇺🇸|EnglishTranslated

AI & Machine Learningdavila7/claude-code-templ...

audiocraft-audio-generation

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.

🇺🇸|EnglishTranslated

AI & Machine Learningpostplusai/postplus-skill...

audio-generation

Control audio generation requests before execution. Use this when the user asks for TTS, persona voice, voice change, translated dub, cloned voice take, podcast audio, or lip-sync audio handoff and the skill must classify the request before handing execution to voice-batch-runner or a video workflow.

🇺🇸|EnglishTranslated

AI & Machine Learningframersai/agentos-skills

audio-generation

Music and sound effects generation — 8 providers with fallback chains, user-configurable preferences, local and cloud options.

🇺🇸|EnglishTranslated

AI & Machine Learningelevenlabs/skills

text-to-speech

Convert text to speech using ElevenLabs voice AI. Use when generating audio from text, creating voiceovers, building voice apps, or synthesizing speech in 70+ languages.

🇺🇸|EnglishTranslated

AI & Machine Learningglebis/claude-skills

elevenlabs-tts

This skill converts text to high-quality audio files using ElevenLabs API. Use this skill when users request text-to-speech generation, audio narration, or voice synthesis with customizable voice parameters (stability, similarity boost) and voice presets (rachel, adam, bella, elli, josh, arnold, ava).

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningaffaan-m/everything-claud...

fal-ai-media

Unified media generation via fal.ai MCP — image, video, and audio. Covers text-to-image (Nano Banana), text/image-to-video (Seedance, Kling, Veo 3), text-to-speech (CSM-1B), and video-to-audio (ThinkSound). Use when the user wants to generate images, videos, or audio with AI.

🇺🇸|EnglishTranslated

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-tts

Generate human-like speech audio with Model Studio DashScope Qwen TTS (qwen3-tts-flash). Use when converting text to speech, producing voice lines for short drama/news videos, or documenting TTS request/response fields for DashScope.

🇺🇸|EnglishTranslated

1 scripts/Checked

Tools & Utilitiesdkyazzentwatwa/chatgpt-sk...

sound-effects-generator

Generate audio tones, noise, DTMF signals, and simple sound effects programmatically. Export to WAV or MP3 format.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningomer-metin/skills-for-ant...

ai-music-audio

Comprehensive patterns for AI-powered audio generation including text-to-music, voice synthesis, text-to-speech, sound effects, and audio manipulation using MusicGen, Bark, ElevenLabs, and more. Use when "music generation, text to music, AI music, voice cloning, text to speech, TTS API, ElevenLabs, MusicGen, Bark, audio synthesis, sound effects generation, voice synthesis, AudioCraft, " mentioned.

🇺🇸|EnglishTranslated

AI & Machine Learningheygen-com/skills

text-to-speech

Generate speech audio from text using HeyGen's Starfish TTS model. Use when: (1) Generating standalone speech audio files from text, (2) Converting text to speech with voice selection, speed, and pitch control, (3) Creating audio for voiceovers, narration, or podcasts, (4) Working with HeyGen's /v1/audio endpoints, (5) Listing available TTS voices by language or gender.

🇺🇸|EnglishTranslated