fish-audio
Original:🇺🇸 English
Translated
Generate AI text-to-speech audio with Fish Audio and browse public reference voices via AceDataCloud API. Use when creating voiceover/narration audio (TTS), synthesizing multilingual speech, or selecting a Fish reference voice from the model catalog.
12installs
Sourceacedatacloud/skills
Added on
NPX Install
npx skill4agent add acedatacloud/skills fish-audioTags
Translated version includes tags in frontmatterSKILL.md Content
View Translation Comparison →Fish Audio — Text-to-Speech
Generate narration / voiceover through AceDataCloud's Fish Audio API.
Setup: See authentication for token setup.
Quick Start
bash
curl -X POST https://api.acedata.cloud/fish/tts \
-H "Authorization: ******ACEDATACLOUD_API_TOKEN" \
-H "Content-Type: application/json" \
-H "model: s2-pro" \
-d '{"text":"你好,欢迎使用 AceData Cloud。","reference_id":"d7900c21663f485ab63ebdb7e5905036","format":"mp3"}'Synchronous responses return a direct audio URL:
json
{"audio_url":"https://platform.r2.fish.audio/task/8a72ff9840234006a9f74cb2fa04f978.mp3"}Endpoints
| Endpoint | Purpose |
|---|---|
| Text-to-speech generation |
| Browse/search public Fish reference voices |
| Poll async TTS jobs when |
Workflows
1. Find a reference voice
bash
curl "https://api.acedata.cloud/fish/model?page_size=10&page_number=1&title=Marcus" \
-H "Authorization: ******ACEDATACLOUD_API_TOKEN"The response includes with public voice metadata such as , ,
, , , and . Use an item as
in TTS requests.
items[]_idtitlelanguagestagsvisibilitystate_idreference_id2. Text-to-Speech
json
POST /fish/tts
Headers:
model: s2-pro
{
"text": "Your narration text.",
"reference_id": "d7900c21663f485ab63ebdb7e5905036",
"format": "mp3"
}3. Async TTS
json
POST /fish/tts
Headers:
model: s1
{
"text": "Longer narration for background processing.",
"async": true,
"callback_url": "https://api.acedata.cloud/health"
}Async: See async task polling. Poll viawithPOST /fish/tasks.{"id":"..."}
Parameters — /fish/tts
/fish/ttsHeader
| Parameter | Values | Description |
|---|---|---|
| | Fish TTS engine selection |
JSON body
| Parameter | Type / Values | Description |
|---|---|---|
| string | Text to synthesize (required) |
| string | Public/reference voice ID from |
| | Output format |
| integer | Optional output sample rate |
| | MP3 bitrate |
| integer | Opus bitrate |
| | TTS latency mode |
| integer | Chunking controls |
| number | Sampling controls |
| integer | Maximum generated tokens |
| boolean | Normalize generated audio |
| object | Prosody tuning |
| array | Additional reference objects |
| string | Async callback URL |
| boolean | Run asynchronously and poll |
Gotchas
- The documented TTS endpoint is — not
POST /fish/tts./fish/audios - Choose the Fish engine with the request header, not a JSON
modelfield.model - Use from
reference_id— notGET /fish/model.voice_id - Synchronous requests return directly; async jobs should be polled via
audio_url./fish/tasks - The current OpenAPI spec documents voice browsing via ; it does not document a voice-cloning write endpoint.
GET /fish/model