Search Results: tts

Found 123 Skills

AI & Machine Learningaradotso/trending-skills

voicebox-voice-synthesis

Expert skill for Voicebox — the open-source local voice cloning and TTS studio built with Tauri, React, and FastAPI

AI & Machine Learningnovitalabs/novita-skills

novita-ai

Novita AI: LLM, Image Generation & Editing, Video Generation, Audio (TTS/ASR), and GPU Cloud. Use this skill whenever the user wants to call Novita AI APIs — chat with LLMs (DeepSeek, Llama, Qwen), generate images (FLUX, Stable Diffusion, Seedream, Hunyuan Image), edit images (remove background, upscale, inpainting, img2img, outpainting, reimagine, merge face, replace background, remove text), generate videos (Kling, Wan, Hunyuan, Minimax Hailuo, Vidu, PixVerse, Seedance), do text-to-speech or speech-to-text (MiniMax TTS, GLM TTS, Fish Audio, ASR, voice cloning), run OpenAI-compatible batch jobs, manage GPU cloud instances and serverless endpoints, or check account balance and billing. Also trigger when the user mentions novita.ai, Novita AI, Novita API key, or wants to use any Novita platform service — even if they just say "generate an image" or "run an LLM" and Novita is available as a provider.

🇺🇸|EnglishTranslated

Documentation & Writingdirnbauer/webconsulting-s...

webconsulting-create-documentation

Create product documentation with help pages, AI-generated screenshots, Remotion product videos with TTS narration and background music, and GitHub README visual documentation. Use when asked to create documentation, a help page, product tour video, generate screenshots, add user guides, or enrich a GitHub README.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesterrylica/cc-skills

health

Health check for TTS and Telegram bot subsystems. TRIGGERS - tts health, kokoro status, telegram bot check, tts diagnostics.

🇺🇸|EnglishTranslated

AI & Machine Learningkrishamaze/skills

gemini-api-2026

Complete Google Gemini API reference for 2026. Use whenever writing code that calls Gemini models. Covers the google-genai SDK, Gemini 3/3.1 models, thought signatures, thinking config, Interactions API, File Search (managed RAG), Computer Use, URL Context, Nano Banana image gen, Live API, ephemeral tokens, TTS, Veo video gen, Lyria music gen, and all tools. ALWAYS prefer `from google import genai` over any legacy import. Use this skill for ANY Gemini API question, even simple ones.

🇺🇸|EnglishTranslated

AI & Machine Learningcinience/alicloud-skills

alicloud-ai-audio-cosyvoice-voice-clone

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningmichaelboeding/skills

video-producer-agent

Use this skill to create complete videos with voiceover and music. Triggers: "create video", "product video", "explainer video", "promo video", "demo video", "training video", "ad video", "commercial", "marketing video", "video with voiceover", "video with music", "brand video", "testimonial video" Orchestrates: script, voiceover, background music, video clips/images, and final assembly.

🇺🇸|EnglishTranslated

2 scripts/Checked

AI & Machine Learningoutscal/video-generator

shorts-script-personality

Generates hyper-optimized YouTube Shorts/Instagram Reels scripts with personality-specific styles while enforcing strict anti-AI-slop writing rules

🇺🇸|EnglishTranslated

Marketing & Growthyonatangross/orchestkit

narration-scripting

Scene-by-scene narration scripts for videos. Use when writing voiceover scripts, adding timing markers, or creating CTA patterns for demos

🇺🇸|EnglishTranslated

AI & Machine Learninghmbown/minimax-cli

storybook-lesson

Create a kid-friendly learning card with an illustration and narrated audio.

🇺🇸|EnglishTranslated

AI & Machine Learningbytedance/agentkit-sample...

byted-text-to-speech

Convert text to speech (TTS). Powered by the VolcEngine Doubao Text-to-Speech API, it supports streaming synthesis, multiple voice timbres, adjustments to speech rate/pitch/loudness, Markdown syntax filtering, and LaTeX formula broadcasting. Use this skill when users need to convert text to speech, generate reading audio, dubbing, narration, broadcasts, or mention terms like 'text-to-speech', 'TTS', 'speech synthesis', 'reading aloud', or 'dubbing'.

🇨🇳|ChineseTranslated

2 scripts/Checked

AI & Machine Learningdokhacgiakhoa/antigravity...

voice-ai-engine-development

Architecting real-time Voice AI agents.

🇺🇸|EnglishTranslated