Loading...
Loading...
Chunked sliding-window streaming speech-to-text via OpenAI Whisper HTTP API — compatible with local Faster-Whisper, Groq, and OpenRouter endpoints.
npx skill4agent add framersai/agentos-skills streaming-stt-whisperOPENAI_API_KEYbaseUrlproviderOptions{
"voice": {
"stt": "whisper"
}
}{
"voice": {
"stt": "whisper",
"providerOptions": {
"model": "whisper-1",
"language": "en",
"baseUrl": "http://localhost:8000"
}
}
}prompterrorlanguage/v1/audio/transcriptions| Event | Description |
|---|---|
| Emitted after each chunk is transcribed |
| Emitted after flush() completes |
| RMS energy crossed threshold |
| RMS energy dropped below threshold |
| Fetch failure (session continues) |
| Session fully terminated |
OPENAI_API_KEYproviderOptions.baseUrldiarization