Search Results: dio

Found 1,613 Skills

AI & Machine Learningmartinholovsky/claude-ski...

speech-to-text

Expert skill for implementing speech-to-text with Faster Whisper. Covers audio processing, transcription optimization, privacy protection, and secure handling of voice data for JARVIS voice assistant.

🇺🇸|EnglishTranslated

AI & Machine Learningqodex-ai/ai-agent-skills

voice-ai-integration

Build voice-enabled AI applications with speech recognition, text-to-speech, and voice-based interactions. Supports multiple voice providers and real-time processing. Use when creating voice assistants, voice-controlled applications, audio interfaces, or hands-free AI systems.

🇺🇸|EnglishTranslated

4 scripts/Checked

Tools & Utilitiesalejandro-ao/video-tool-c...

video-tool

Video processing toolkit. Use when user wants to: - Download videos from YouTube or other sites - Remove silence from videos - Trim, cut, or extract segments from videos - Extract audio from video files - Enhance or denoise audio - Replace audio track in a video - Change video playback speed - Concatenate multiple videos - Generate transcripts/captions (VTT) - Generate video descriptions, timestamps, or context cards - Upload videos to YouTube or Bunny.net CDN - Post social updates to X (Twitter) or LinkedIn - Get video metadata (duration, resolution, codec)

🇺🇸|EnglishTranslated

AI & Machine Learningagentiveau/myagentive

deepgram-transcription

Transcribe audio and video files using the Deepgram API. This skill should be used when the user requests transcription of audio files (mp3, wav, m4a, aac) or video files (mp4, mov, avi, etc.). Handles large video files by extracting audio first to reduce upload size and processing time.

🇺🇸|EnglishTranslated

1 scripts/Checked

Tools & Utilitiessteipete/clawdis

songsee

Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.

🇺🇸|EnglishTranslated

AI & Machine Learningmichaelboeding/skills

podcast-producer-agent

Use this skill to create podcast episodes, interviews, dialogues, and conversation-style audio. Triggers: "create podcast", "podcast episode", "interview audio", "dialogue", "conversation", "two hosts discussing", "audio show", "radio show", "audio drama" Orchestrates: script generation, multi-speaker TTS, intro/outro music, and audio assembly.

🇺🇸|EnglishTranslated

Security & Complianceramzxy/ctf

ctf-stego

Steganography techniques for CTF challenges. Use when data is hidden in images, audio, video, or other media files.

🇺🇸|EnglishTranslated

AI & Machine Learningglebis/claude-skills

elevenlabs-tts

This skill converts text to high-quality audio files using ElevenLabs API. Use this skill when users request text-to-speech generation, audio narration, or voice synthesis with customizable voice parameters (stability, similarity boost) and voice presets (rachel, adam, bella, elli, josh, arnold, ava).

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learninglubu-labs/langchain-agent...

langgraph-project-setup

Initialize and configure LangGraph projects with proper structure, langgraph.json configuration, environment variables, and dependency management. Use when users want to (1) create a new LangGraph project, (2) set up langgraph.json for deployment, (3) configure environment variables for LLM providers, (4) initialize project structure for agents, (5) set up local development with LangGraph Studio, (6) configure dependencies (pyproject.toml, requirements.txt, package.json), or (7) troubleshoot project configuration issues.

🇺🇸|EnglishTranslated

4 scripts/Attention

Marketing & Growthguia-matthieu/clawfu-skil...

sound-design-film

Apply Walter Murch's legendary film sound principles to marketing video, creating emotionally resonant audio that audiences feel without consciously noticing. Use when: Designing sound for brand films and documentaries; Creating emotional impact in video ads; Layering music, effects, and voice effectively; Building immersive soundscapes for content; Elevating production value through audio

🇺🇸|EnglishTranslated

AI & Machine Learningheygen-com/skills

text-to-speech

Generate speech audio from text using HeyGen's Starfish TTS model. Use when: (1) Generating standalone speech audio files from text, (2) Converting text to speech with voice selection, speed, and pitch control, (3) Creating audio for voiceovers, narration, or podcasts, (4) Working with HeyGen's /v1/audio endpoints, (5) Listing available TTS voices by language or gender.

🇺🇸|EnglishTranslated

Testing & QAcinience/alicloud-skills

alicloud-ai-video-wan-edit-test

Minimal video editing smoke test for Model Studio Wan edit models.

🇺🇸|EnglishTranslated