image-generation
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseImage Generation
图像生成
Generate images using with . The system auto-selects the best backend based on available API keys.
generate_mediamode="image"使用并设置即可生成图像,系统会根据可用的API密钥自动选择最优后端。
generate_mediamode="image"Quick Start
快速开始
python
undefinedpython
undefinedSimple text-to-image (auto-selects backend)
简单文生图(自动选择后端)
generate_media(prompt="A cat in space", mode="image")
generate_media(prompt="A cat in space", mode="image")
Specify backend and quality
指定后端和质量
generate_media(prompt="A logo for a coffee shop", mode="image",
backend_type="openai", quality="high")
generate_media(prompt="A logo for a coffee shop", mode="image",
backend_type="openai", quality="high")
Batch generation (parallel)
批量生成(并行)
generate_media(prompts=["sunset over ocean", "mountain landscape", "city at night"],
mode="image", max_concurrent=3)
undefinedgenerate_media(prompts=["sunset over ocean", "mountain landscape", "city at night"],
mode="image", max_concurrent=3)
undefinedBackend Comparison
后端对比
| Backend | Default Model | Strengths | API Key |
|---|---|---|---|
| Google (priority 1) | | Fast, flexible sizes, image editing, multi-turn | |
| OpenAI (priority 2) | | High quality, transparent backgrounds, continuation via response ID | |
| Grok (priority 3) | | 1k resolution, continuation via stored data URI | |
| OpenRouter (priority 4) | | Access to multiple models via single API | |
| 后端 | 默认模型 | 优势 | API密钥 |
|---|---|---|---|
| Google(优先级1) | | 速度快、支持灵活尺寸、图像编辑、多轮交互 | |
| OpenAI(优先级2) | | 画质高、支持透明背景、可通过响应ID续生成 | |
| Grok(优先级3) | | 1k分辨率、可通过存储的data URI续生成 | |
| OpenRouter(优先级4) | | 可通过单个API访问多个模型 | |
Key Parameters
核心参数
| Parameter | Description | Example |
|---|---|---|
| Text description of the image | |
| Force a specific backend | |
| Override default model | |
| Image quality (OpenAI) | |
| Image dimensions | See backends reference |
| Aspect ratio | |
| Source images for image-to-image editing | |
| Continuation ID for multi-turn editing | |
| 参数 | 说明 | 示例 |
|---|---|---|
| 图像的文本描述 | |
| 强制使用指定后端 | |
| 覆盖默认模型 | 用于工作室级画质的 |
| 图像质量(仅OpenAI支持) | |
| 图像尺寸 | 参考后端文档 |
| 宽高比 | |
| 图生图编辑的源图像 | |
| 多轮编辑的续生成ID | |
Image-to-Image Editing
图生图编辑
Transform existing images by providing :
input_imagespython
generate_media(
prompt="Make it look like a watercolor painting",
mode="image",
input_images=["photo.jpg"]
)Supported backends for image-to-image: Google (Gemini), OpenAI, Grok. The system auto-selects if your current backend doesn't support it.
提供即可转换现有图像:
input_imagespython
generate_media(
prompt="Make it look like a watercolor painting",
mode="image",
input_images=["photo.jpg"]
)支持图生图的后端:Google(Gemini)、OpenAI、Grok。如果你当前使用的后端不支持该功能,系统会自动选择适配的后端。
Multi-Turn Editing (Continuation)
多轮编辑(续生成)
Iteratively refine images using :
continue_frompython
undefined使用迭代优化图像:
continue_frompython
undefinedFirst generation
首次生成
result = generate_media(prompt="A logo for a coffee shop", mode="image")
result = generate_media(prompt="A logo for a coffee shop", mode="image")
Refine using the continuation ID
使用续生成ID优化图像
result2 = generate_media(
prompt="Make the text larger and add a cup icon",
mode="image",
continue_from=result["continuation_id"]
)
Each backend uses a different continuation mechanism:
- **OpenAI**: Passes `previous_response_id` (stateless)
- **Google Gemini**: In-memory chat store (LRU, 50 items)
- **Grok**: In-memory data URI store (LRU, 50 items)
Continuation only works for single image generation (not batch).result2 = generate_media(
prompt="Make the text larger and add a cup icon",
mode="image",
continue_from=result["continuation_id"]
)
不同后端使用的续生成机制不同:
- **OpenAI**:传递`previous_response_id`(无状态)
- **Google Gemini**:内存聊天存储(LRU,最多存50条)
- **Grok**:内存data URI存储(LRU,最多存50条)
续生成仅适用于单张图像生成(不支持批量)。Google: Gemini vs Imagen
Google:Gemini vs Imagen
Google supports two API paths. Gemini (Nano Banana 2) is the default and recommended for most use cases. Imagen is only needed for advanced reference-image editing features.
- Gemini models ():
gemini-*— text-to-image, image editing viagenerate_content(), multi-turn continuationinput_images - Imagen models ():
imagen-*/generate_images()— text-to-image withedit_image()/negative_prompt/seed, plus style transfer, control editing, and subject consistency via reference imagesguidance_scale
For studio-quality precision and text rendering, use: (Pro-tier).
model="gemini-3-pro-image-preview"Google支持两种API路径,Gemini(Nano Banana 2)是大多数场景下的默认推荐选项,仅当需要高级参考图像编辑功能时才需要使用Imagen。
- Gemini模型():
gemini-*— 文生图、通过generate_content()实现图像编辑、多轮续生成input_images - Imagen模型():
imagen-*/generate_images()— 支持edit_image()/negative_prompt/seed的文生图,此外还支持风格迁移、可控编辑、通过参考图像保持主体一致性guidance_scale
如果需要工作室级别的精度和文字渲染效果,请使用:(专业版权限)。
model="gemini-3-pro-image-preview"Need More Control?
需要更多控制?
- Per-backend sizes, quality options, and quirks: See references/backends.md
- Complete reference: See references/extra_params.md
extra_params - Advanced editing (inpainting, style transfer, control, subject): See references/editing.md
- 各后端的尺寸、质量选项和使用注意事项:查看 references/backends.md
- 完整的参考文档:查看 references/extra_params.md
extra_params - 高级编辑(局部重绘、风格迁移、可控编辑、主体控制):查看 references/editing.md",