Loading...
Loading...
Compare original and translation side by side
| Parameter | Description | Example |
|---|---|---|
| Text to convert (required) | |
| Voice name | |
| Base name for output file | |
| Output directory for audio | |
| Disable auto timestamp | Flag |
| TTS model | |
| Enable streaming | Flag |
| Multi-speaker mapping | |
| 参数 | 描述 | 示例 |
|---|---|---|
| 需要转换的文本(必填) | |
| 音色名称 | |
| 输出文件的基础名称 | |
| 音频输出目录 | |
| 禁用自动时间戳 | 标志参数 |
| TTS模型 | |
| 启用流式输出 | 标志参数 |
| 多角色音色映射 | |
python scripts/tts.py "Hello, world! Have a wonderful day."Koreaudio/tts_output_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "Hello, world! Have a wonderful day."Koreaudio/tts_output_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "Welcome to our podcast about technology trends" --voice Puck --output welcomeaudio/welcome_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "Welcome to our podcast about technology trends" --voice Puck --output welcomeaudio/welcome_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "TTS the following conversation:
Joe: How's it going today?
Jane: Not too bad, how about you?
Joe: I'm working on a new project.
Jane: Sounds exciting, tell me more!" --speakers "Joe:Kore,Jane:Puck" --output conversationaudio/conversation_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "TTS the following conversation:
Joe: How's it going today?
Jane: Not too bad, how about you?
Joe: I'm working on a new project.
Jane: Sounds exciting, tell me more!" --speakers "Joe:Kore,Jane:Puck" --output conversationaudio/conversation_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "This is a very long text that would benefit from streaming..." --stream --output long-formaudio/long-form_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "This is a very long text that would benefit from streaming..." --stream --output long-formaudio/long-form_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "Welcome to our quarterly earnings presentation. Today we'll discuss our growth metrics and future plans." --voice Charon --output voiceoverCharonpython scripts/tts.py "Welcome to our quarterly earnings presentation. Today we'll discuss our growth metrics and future plans." --voice Charon --output voiceoverCharonpython scripts/tts.py "Save to specific folder." --output-dir ./my-projects/podcasts/ --output episode1./my-projects/podcasts/episode1_YYYYMMDD_HHMMSS.wavpython scripts/tts.py "Save to specific folder." --output-dir ./my-projects/podcasts/ --output episode1./my-projects/podcasts/episode1_YYYYMMDD_HHMMSS.wavundefinedundefined- Best for: Podcasts, audiobooks, video narration
- Combines with: gemini-text for script generation- 最佳适用场景:播客、有声书、视频旁白
- 搭配使用:gemini-text Skill用于生成脚本python scripts/tts.py "Welcome to our accessible website. This audio describes our main navigation options." --voice Aoede --output accessibilityAoedepython scripts/tts.py "Welcome to our accessible website. This audio describes our main navigation options." --voice Aoede --output accessibilityAoedepython scripts/tts.py "Chapter 1: Introduction to Quantum Computing. Let's explore the fundamental principles..." --voice Zephyr --output chapter1Zephyrpython scripts/tts.py "Chapter 1: Introduction to Quantum Computing. Let's explore the fundamental principles..." --voice Zephyr --output chapter1Zephyrpython scripts/tts.py "Fixed filename." --output my-audio --no-timestampaudio/my-audio.wavpython scripts/tts.py "Fixed filename." --output my-audio --no-timestampaudio/my-audio.wav| Model | Quality | Speed | Best For |
|---|---|---|---|
| Good | Fast | General use, high volume |
| Higher | Slower | Premium content, voiceovers |
| 模型 | 质量 | 速度 | 最佳适用场景 |
|---|---|---|---|
| 良好 | 快速 | 通用场景、大音量生成需求 |
| 更高 | 较慢 | 高质量内容、专业旁白 |
| Voice | Characteristics | Best For |
|---|---|---|
| Kore | Clear, professional | Announcements, general purpose (default) |
| Puck | Friendly, conversational | Casual content, interviews |
| Charon | Deep, authoritative | Corporate, serious content |
| Fenrir | Warm, expressive | Storytelling, narratives |
| Aoede | Melodic, pleasant | Educational, accessibility |
| Zephyr | Light, airy | Gentle content, tutorials |
| Sulafat | Neutral, balanced | Documentaries, factual content |
| 音色 | 特点 | 最佳适用场景 |
|---|---|---|
| Kore | 清晰、专业 | 公告、通用信息(默认音色) |
| Puck | 友好、口语化 | 播客、访谈、休闲内容 |
| Charon | 低沉、权威 | 企业内容、新闻、正式演示 |
| Fenrir | 温暖、富有表现力 | 有声书、故事、情感类内容 |
| Aoede | 悦耳、柔和 | 教育内容、无障碍优化 |
| Zephyr | 轻快、空灵 | 温和类内容、教程 |
| Sulafat | 中立、均衡 | 纪录片、事实性演示 |
| Specification | Value |
|---|---|
| Format | WAV (PCM) |
| Sample rate | 24000 Hz |
| Channels | 1 (mono) |
| Bit depth | 16-bit |
| 规格 | 数值 |
|---|---|
| 格式 | WAV (PCM) |
| 采样率 | 24000 Hz |
| 声道 | 1(单声道) |
| 位深 | 16-bit |
| Limit | Type | Description |
|---|---|---|
| 8,192 | Input | Maximum input text tokens |
| 16,384 | Output | Maximum output audio tokens |
| 限制值 | 类型 | 描述 |
|---|---|---|
| 8,192 | 输入 | 最大输入文本令牌数 |
| 16,384 | 输出 | 最大输出音频令牌数 |
--speakers--speakerspip install google-genaipip install google-genaiSpeakerName:VoiceName,Speaker2:Voice2"Joe:Kore,Jane:Puck,Host:Charon"SpeakerName:VoiceName,Speaker2:Voice2"Joe:Kore,Jane:Puck,Host:Charon"--output--output| Voice | Ideal Use Cases |
|---|---|
| Kore | Announcements, navigation, general info |
| Puck | Podcasts, interviews, casual content |
| Charon | Corporate, news, formal presentations |
| Fenrir | Audiobooks, stories, emotional content |
| Aoede | Accessibility, educational, gentle content |
| Zephyr | Tutorials, explanations, guides |
| Sulafat | Documentaries, factual presentations |
| 音色 | 理想适用场景 |
|---|---|
| Kore | 公告、导航、通用信息 |
| Puck | 播客、访谈、休闲内容 |
| Charon | 企业内容、新闻、正式演示 |
| Fenrir | 有声书、故事、情感类内容 |
| Aoede | 无障碍内容、教育、温和类内容 |
| Zephyr | 教程、讲解、指南 |
| Sulafat | 纪录片、事实性演示 |
undefinedundefinedundefinedundefinedreferences/voices.mdreferences/voices.md