Search Results: audio-generation

Found 32 Skills

Tools & Utilitiesvaibhav0806/trying-someth...

edge-tts

Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

🇺🇸|EnglishTranslated

AI & Machine Learningyangagent/minimax-tts-pip...

minimax-tts-pipeline

Generate Chinese broadcast audio from text files via the MiniMax TTS API, which automatically handles common pronunciation errors such as polyphonic characters, English abbreviations, mixed model names, and number pronunciations. Triggered when the user says "Generate broadcast audio using MiniMax".

🇨🇳|ChineseTranslated

6 scripts/Attention

AI & Machine Learningmeowa-ai/meowa-skills

game-assets

Create, edit, and pipeline game assets using MeowArt, including pixel sprites, HD assets, backgrounds, UI mockups, seamless loops, texture tiles, dual-grid tilesets, background removal, pixel cleanup, simple animations, sound effects, and music/BGM generation. Use this when Codex needs to produce or refine game art or audio assets in the project, especially when selecting MeowArt commands, setting canvas sizes, choosing templates, generating music or SFX, or converting generated assets into game-ready files.

🇨🇳|ChineseTranslated

1 scripts/Attention

AI & Machine Learningnexu-io/open-design

audio-jingle

Audio generation skill — jingles, beds, voiceover, and sound effects. Routes music requests to Suno V5 / Udio / Lyria, speech to MiniMax TTS / FishAudio / ElevenLabs V3, and SFX to ElevenLabs SFX or AudioCraft. Output is one MP3/WAV file saved to the project folder.

🇺🇸|EnglishTranslated

Tools & Utilitiesagricidaniel/claude-blog

blog-audio

Generate audio narration of blog posts using Google Gemini TTS. Supports summary narration, full article read-aloud, and two-speaker podcast/dialogue mode with 30 voice options. Outputs MP3 with HTML5 audio embed code. Works standalone via /blog audio or internally from blog-write. Falls back gracefully when API key is not configured. Use when user says "blog audio", "narrate blog", "audio version", "text to speech", "tts", "podcast mode", "read aloud", "audio narration", "voice", "narration", "generate audio".

🇺🇸|EnglishTranslated

4 scripts/Checked

AI & Machine Learningsamuraigpt/generative-med...

muapi-music-video

Build a short music video from a song theme — N keyframes, animate each, generate matching music.

🇺🇸|EnglishTranslated

AI & Machine Learningjechearte/skills

elevenlabs-tts

Generate realistic audio from text using ElevenLabs Text-to-Speech API. Use when the user needs to convert text to speech, create voiceovers, generate narration, or produce audio content. Triggers include "generate audio", "text to speech", "TTS", "voiceover", "narration", "ElevenLabs", "audio from text", "read this text aloud"

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningrunwayml/skills

rw-generate-audio

Generate audio using the Runway API via runnable scripts. Supports TTS, sound effects, voice isolation, dubbing, and voice conversion.

🇺🇸|EnglishTranslated