eachlabs-voice-audio
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseEachLabs Voice & Audio
EachLabs 语音与音频处理
Text-to-speech, speech-to-text transcription, voice conversion, and audio utilities via the EachLabs Predictions API.
通过EachLabs Predictions API实现文本转语音、语音转文本、声音转换及音频工具功能。
Authentication
身份验证
Header: X-API-Key: <your-api-key>Set the environment variable. Get your key at eachlabs.ai.
EACHLABS_API_KEYHeader: X-API-Key: <your-api-key>设置环境变量。可前往eachlabs.ai获取您的密钥。
EACHLABS_API_KEYAvailable Models
可用模型
Text-to-Speech
文本转语音
| Model | Slug | Best For |
|---|---|---|
| ElevenLabs TTS | | High quality TTS |
| ElevenLabs TTS w/ Timestamps | | TTS with word timing |
| ElevenLabs Text to Dialogue | | Multi-speaker dialogue |
| ElevenLabs Sound Effects | | Sound effect generation |
| ElevenLabs Voice Design v2 | | Custom voice design |
| Kling V1 TTS | | Kling text-to-speech |
| Kokoro 82M | | Lightweight TTS |
| Play AI Dialog | | Dialog TTS |
| Stable Audio 2.5 | | Text to audio |
| 模型 | Slug | 适用场景 |
|---|---|---|
| ElevenLabs TTS | | 高质量文本转语音 |
| ElevenLabs TTS(带时间戳) | | 带单词时间戳的文本转语音 |
| ElevenLabs 文本转对话 | | 多说话人对话生成 |
| ElevenLabs 音效生成 | | 音效生成 |
| ElevenLabs 语音设计v2 | | 自定义语音设计 |
| Kling V1 TTS | | Kling语文本转语音 |
| Kokoro 82M | | 轻量级文本转语音 |
| Play AI 对话TTS | | 对话式文本转语音 |
| Stable Audio 2.5 | | 文本转音频 |
Speech-to-Text
语音转文本
| Model | Slug | Best For |
|---|---|---|
| ElevenLabs Scribe v2 | | Best quality transcription |
| ElevenLabs STT | | Standard transcription |
| Wizper with Timestamp | | Timestamped transcription |
| Wizper | | Basic transcription |
| Whisper | | Open-source transcription |
| Whisper Diarization | | Speaker identification |
| Incredibly Fast Whisper | | Fastest transcription |
| 模型 | Slug | 适用场景 |
|---|---|---|
| ElevenLabs Scribe v2 | | 高质量语音转写 |
| ElevenLabs STT | | 标准语音转写 |
| Wizper(带时间戳) | | 带时间戳的语音转写 |
| Wizper | | 基础语音转写 |
| Whisper | | 开源语音转写 |
| Whisper 说话人分离 | | 说话人识别 |
| 极速Whisper | | 最快速度语音转写 |
Voice Conversion & Cloning
声音转换与克隆
| Model | Slug | Best For |
|---|---|---|
| RVC v2 | | Voice conversion |
| Train RVC | | Train custom voice model |
| ElevenLabs Voice Clone | | Voice cloning |
| ElevenLabs Voice Changer | | Voice transformation |
| ElevenLabs Voice Design v3 | | Advanced voice design |
| ElevenLabs Dubbing | | Video dubbing |
| Chatterbox S2S | | Speech to speech |
| Open Voice | | Open-source voice clone |
| XTTS v2 | | Multi-language voice clone |
| Stable Audio 2.5 Inpaint | | Audio inpainting |
| Stable Audio 2.5 A2A | | Audio transformation |
| Audio Trimmer | | Audio trimming with fade |
| 模型 | Slug | 适用场景 |
|---|---|---|
| RVC v2 | | 声音转换 |
| 训练RVC模型 | | 训练自定义语音模型 |
| ElevenLabs 语音克隆 | | 语音克隆 |
| ElevenLabs 声音变换 | | 声音变换 |
| ElevenLabs 语音设计v3 | | 进阶语音设计 |
| ElevenLabs 视频配音 | | 视频配音 |
| Chatterbox 语音转语音 | | 语音转语音 |
| Open Voice | | 开源语音克隆 |
| XTTS v2 | | 多语言语音克隆 |
| Stable Audio 2.5 音频修复 | | 音频修复 |
| Stable Audio 2.5 音频转音频 | | 音频变换 |
| 音频剪辑(带淡入淡出) | | 带淡入淡出的音频剪辑 |
Audio Utilities
音频工具
| Model | Slug | Best For |
|---|---|---|
| FFmpeg Merge Audio Video | | Merge audio with video |
| Toolkit Video Convert | | Video/audio conversion |
| 模型 | Slug | 适用场景 |
|---|---|---|
| FFmpeg 音视频合并 | | 音频与视频合并 |
| 音视频转换工具包 | | 音视频格式转换 |
Prediction Flow
预测流程
- Check model — validates the model exists and returns the
GET https://api.eachlabs.ai/v1/model?slug=<slug>with exact input parameters. Always do this before creating a prediction to ensure correct inputs.request_schema - POST with model slug, version
https://api.eachlabs.ai/v1/prediction, and input matching the schema"0.0.1" - Poll until status is
GET https://api.eachlabs.ai/v1/prediction/{id}or"success""failed" - Extract the output from the response
- 检查模型 — 验证模型是否存在,并返回包含准确输入参数的
GET https://api.eachlabs.ai/v1/model?slug=<slug>。在创建预测前务必执行此步骤,以确保输入参数正确。request_schema - 发送POST请求 至,携带模型标识(slug)、版本
https://api.eachlabs.ai/v1/prediction及符合schema的输入参数"0.0.1" - 轮询请求 ,直到状态变为
GET https://api.eachlabs.ai/v1/prediction/{id}或"success""failed" - 提取结果 从响应中获取输出内容
Examples
示例
Text-to-Speech with ElevenLabs
基于ElevenLabs的文本转语音
bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "elevenlabs-text-to-speech",
"version": "0.0.1",
"input": {
"text": "Welcome to our product demo. Today we will walk through the key features.",
"voice_id": "EXAVITQu4vr4xnSDxMaL",
"model_id": "eleven_v3",
"stability": 0.5,
"similarity_boost": 0.7
}
}'bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "elevenlabs-text-to-speech",
"version": "0.0.1",
"input": {
"text": "Welcome to our product demo. Today we will walk through the key features.",
"voice_id": "EXAVITQu4vr4xnSDxMaL",
"model_id": "eleven_v3",
"stability": 0.5,
"similarity_boost": 0.7
}
}'Transcription with ElevenLabs Scribe
基于ElevenLabs Scribe的语音转写
bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "elevenlabs-speech-to-text-scribe-v2",
"version": "0.0.1",
"input": {
"media_url": "https://example.com/recording.mp3",
"diarize": true,
"timestamps_granularity": "word"
}
}'bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "elevenlabs-speech-to-text-scribe-v2",
"version": "0.0.1",
"input": {
"media_url": "https://example.com/recording.mp3",
"diarize": true,
"timestamps_granularity": "word"
}
}'Transcription with Wizper (Whisper)
基于Wizper(Whisper)的语音转写
bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "wizper-with-timestamp",
"version": "0.0.1",
"input": {
"audio_url": "https://example.com/audio.mp3",
"language": "en",
"task": "transcribe",
"chunk_level": "segment"
}
}'bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "wizper-with-timestamp",
"version": "0.0.1",
"input": {
"audio_url": "https://example.com/audio.mp3",
"language": "en",
"task": "transcribe",
"chunk_level": "segment"
}
}'Speaker Diarization with Whisper
基于Whisper的说话人分离
bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "whisper-diarization",
"version": "0.0.1",
"input": {
"file_url": "https://example.com/meeting.mp3",
"num_speakers": 3,
"language": "en",
"group_segments": true
}
}'bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "whisper-diarization",
"version": "0.0.1",
"input": {
"file_url": "https://example.com/meeting.mp3",
"num_speakers": 3,
"language": "en",
"group_segments": true
}
}'Voice Conversion with RVC v2
基于RVC v2的声音转换
bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "rvc-v2",
"version": "0.0.1",
"input": {
"input_audio": "https://example.com/vocals.wav",
"rvc_model": "CUSTOM",
"custom_rvc_model_download_url": "https://example.com/my-voice-model.zip",
"pitch_change": 0,
"output_format": "wav"
}
}'bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "rvc-v2",
"version": "0.0.1",
"input": {
"input_audio": "https://example.com/vocals.wav",
"rvc_model": "CUSTOM",
"custom_rvc_model_download_url": "https://example.com/my-voice-model.zip",
"pitch_change": 0,
"output_format": "wav"
}
}'Merge Audio with Video
音视频合并
bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "ffmpeg-api-merge-audio-video",
"version": "0.0.1",
"input": {
"video_url": "https://example.com/video.mp4",
"audio_url": "https://example.com/narration.mp3",
"start_offset": 0
}
}'bash
curl -X POST https://api.eachlabs.ai/v1/prediction \
-H "Content-Type: application/json" \
-H "X-API-Key: $EACHLABS_API_KEY" \
-d '{
"model": "ffmpeg-api-merge-audio-video",
"version": "0.0.1",
"input": {
"video_url": "https://example.com/video.mp4",
"audio_url": "https://example.com/narration.mp3",
"start_offset": 0
}
}'ElevenLabs Voice IDs
ElevenLabs 语音ID
The model supports these voice IDs. Pass the raw ID string:
elevenlabs-text-to-speech| Voice ID | Notes |
|---|---|
| Default voice |
| — |
| — |
| — |
| — |
| — |
| — |
| — |
| — |
| — |
elevenlabs-text-to-speech| 语音ID | 说明 |
|---|---|
| 默认语音 |
| — |
| — |
| — |
| — |
| — |
| — |
| — |
| — |
| — |
Parameter Reference
参数参考
See references/MODELS.md for complete parameter details for each model.
如需查看各模型的完整参数详情,请参阅references/MODELS.md。