Search Results: audio-transcription

Found 30 Skills

AI & Machine Learningkouko/monkey-knowledge-yo...

mk-youtube-audio-transcribe

Transcribe audio to text using local whisper.cpp. Use when user wants to convert audio/video to text, get transcription, or speech-to-text.

🇺🇸|EnglishTranslated

13 scripts/Attention

AI & Machine Learningthinkfleetai/thinkfleet-e...

local-whisper

Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.

🇺🇸|EnglishTranslated

AI & Machine Learningsarvamai/skills

speech-to-text

Transcribe audio to text using Sarvam AI's Saaras model. Handles speech recognition, transcription, and voice interfaces for 23 Indian languages. Supports 5 output modes, auto language detection, WebSocket streaming, and batch diarization. Use when converting speech to text or building voice-enabled apps.

🇺🇸|EnglishTranslated

AI & Machine Learningsteipete/clawdis

openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

🇺🇸|EnglishTranslated

1 scripts/Checked

Document Processingcoroboros/agent-skills

markitdown

Convert any document to Markdown with Microsoft's `markitdown` CLI — PDF, Word, Excel, PowerPoint, HTML, CSV, JSON, XML, ZIP, EPub, images (OCR/EXIF), audio (transcription), and YouTube URLs. Use whenever the user wants to extract text from a binary document, transcribe audio, OCR an image, scrape a YouTube transcript, or pre-process a file for an LLM context window — even when they just say "convert this pdf", "what's in this docx", "transcribe this mp3", or "get the text out of this".

🇺🇸|EnglishTranslated

1 scripts/Attention

Tools & Utilitieshyperpuncher/dotagents

chough

Fast ASR CLI tool for transcribing audio/video files. Use when user wants to transcribe audio/video, generate subtitles (VTT), convert speech to text with timestamps (JSON), or optimize transcription for low memory.

🇺🇸|EnglishTranslated

AI & Machine Learningnoizai/skills

speech-to-text

Use this skill whenever the user wants to transcribe audio to text, convert speech to text, or get a transcript from an audio or video file. Triggers include: any mention of 'transcribe', 'transcription', 'speech to text', 'STT', 'convert audio to text', 'what does this audio say', 'get transcript', 'subtitle generation', or requests to extract spoken words from a file. Also use when the user wants speaker identification from audio, timestamps for captions, or multilingual transcription.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningaia-11-hn-mib/mib-mockint...

gemini-video-understanding

Analyze videos using Google's Gemini API - describe content, answer questions, transcribe audio with visual descriptions, reference timestamps, clip videos, and process YouTube URLs. Supports 9 video formats, multiple models (Gemini 2.5/2.0), and context windows up to 2M tokens (6 hours of video).

🇺🇸|EnglishTranslated

AI & Machine Learningalphaonedev/openclaw-grap...

openai-whisper-api

OpenAI Whisper API: audio transcription, translation, structured output, large file handling

🇺🇸|EnglishTranslated

AI & Machine Learning958877748/skills

groq-stt

Transcribe audio files using Groq API (Whisper models). Use when user needs to transcribe audio to text.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningmarswaveai/skills

asr

Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".

🇺🇸|EnglishTranslated

Tools & Utilitiesnateherkai/hyperframes-st...

hyperframes-cli

HyperFrames CLI tool — hyperframes init, lint, preview, render, transcribe, tts, doctor, browser, info, upgrade, compositions, docs, benchmark. Use when scaffolding a project, linting or validating compositions, previewing in the studio, rendering to video, transcribing audio, generating TTS, or troubleshooting the HyperFrames environment.

🇺🇸|EnglishTranslated