Loading...
Loading...
Found 124 Skills
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Read, watch, and listen to video/audio files. Use Gemini for native video understanding, or extract key frames + Whisper transcription as fallback. Use when a user sends a video/audio and asks about its content, what's in it, what someone said, etc.
Fast ASR CLI tool for transcribing audio/video files. Use when user wants to transcribe audio/video, generate subtitles (VTT), convert speech to text with timestamps (JSON), or optimize transcription for low memory.
Subtitle generation and burning. Transcription → dictionary correction → review → burning. Trigger words: add subtitles, generate subtitles, subtitles
Transcribe speech to text using the Speech framework. Use when implementing live microphone transcription with AVAudioEngine, recognizing pre-recorded audio files, configuring on-device vs server-based recognition, handling authorization flows, or adopting the new SpeechAnalyzer API (iOS 26+) for modern async/await speech-to-text.
Connect to PAXS AI platform to create meetings, upload recordings, and generate transcriptions and meeting notes. Use this skill when a user wants to transcribe audio, create meeting notes, or interact with the PAXS platform.
Transcribe audio/video using trx CLI and post-process results with agent corrections. Use when: (1) user wants to transcribe a video or audio file, (2) user shares a YouTube/Twitter/Instagram URL for transcription, (3) user says "transcribe", "subtitles", "srt", "transcript", (4) user wants to fix/clean up a whisper transcription, (5) user asks to extract text from a video.
Remove background noise and isolate vocals/speech from audio using ElevenLabs Voice Isolator (audio isolation) API. Use when cleaning up noisy recordings, removing music or background ambience from dialogue, isolating speech from field recordings, preparing audio for transcription, extracting vocals, or any "denoise / clean up / isolate voice" task.
Jiminny platform help — conversation intelligence, revenue intelligence, AI notetaker, sales coaching, and automatic CRM logging. Use when setting up Jiminny call recording or transcription, configuring Jiminny CRM sync to Salesforce or HubSpot, connecting Jiminny to a dialer like Aircall or Dialpad, troubleshooting calls not appearing in Jiminny or tagging delays, pulling activity data from the Jiminny API, comparing Jiminny vs Gong pricing or features, or evaluating Jiminny for pipeline visibility. Do NOT use for general coaching program design (use /sales-coaching) or comparing standalone AI note-takers (use /sales-note-taker).
Tactiq platform help — Chrome extension AI meeting note-taker with bot-free live transcription for Google Meet, Zoom, and MS Teams. Use when Tactiq transcription accuracy is poor with accents, AI credits running out too fast, setting up Tactiq workflows to push notes to Slack or Notion, connecting Tactiq to HubSpot or Salesforce via Zapier, figuring out which Tactiq plan to pick, or comparing Tactiq to Fathom or Fireflies. Do NOT use for choosing between note-takers generally (use /sales-note-taker) or reviewing a specific call for coaching (use /sales-call-review).
Video generation and transcription workflows via the Venice.ai API.
Subtitle Generation and Burning. Volcengine Transcription → Dictionary Error Correction → Review → Burning. Trigger words: add subtitles, generate subtitles, subtitles