gemini-3-image-generation

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Gemini 3 Pro Image Generation (Nano Banana Pro)

Gemini 3 Pro Image 图像生成指南(Nano Banana Pro)

Comprehensive guide for generating images with Gemini 3 Pro Image (
gemini-3-pro-image-preview
), also known as Nano Banana Pro. This skill focuses on IMAGE OUTPUT (generating images) - see
gemini-3-multimodal
for INPUT (analyzing images).
本指南详细介绍如何使用Gemini 3 Pro Image(
gemini-3-pro-image-preview
,又称Nano Banana Pro)生成图像。本技能聚焦于图像输出(生成图像)——如需了解图像输入(分析图像),请查看
gemini-3-multimodal
技能。

Overview

概述

Gemini 3 Pro Image (Nano Banana Pro 🍌) is Google's image generation model featuring native 4K support, text rendering within images, grounded generation with Google Search, and conversational editing capabilities.
Gemini 3 Pro Image(Nano Banana Pro 🍌)是谷歌推出的图像生成模型,支持原生4K分辨率、图像内文本渲染、基于Google Search的事实锚定生成,以及对话式编辑功能。

Key Capabilities

核心功能

  • 4K Resolution: Native 4K generation with upscaling to 2K/4K
  • Text Rendering: High-quality text within images
  • Grounded Generation: Fact-verified images using Google Search
  • Conversational Editing: Multi-turn image modification preserving context
  • Aspect Ratios: Supports 16:9 and custom ratios at 4K
  • Quality Control: Fine-tuned generation parameters
  • 4K分辨率: 原生4K生成,支持升级至2K/4K分辨率
  • 文本渲染: 在图像中生成高质量文本
  • 事实锚定生成: 利用Google Search生成符合事实的图像
  • 对话式编辑: 多轮迭代修改图像,保留上下文
  • 宽高比: 支持4K分辨率下的16:9及自定义宽高比
  • 质量控制: 可微调生成参数

When to Use This Skill

适用场景

  • Generating images from text prompts
  • Creating 4K resolution images
  • Rendering text within images
  • Fact-verified image generation (grounded)
  • Conversational image editing
  • Multi-turn image refinement
  • Custom aspect ratio images

  • 从文本提示词生成图像
  • 创建4K分辨率图像
  • 在图像中渲染文本
  • 生成符合事实的锚定图像
  • 对话式图像编辑
  • 多轮图像优化
  • 生成自定义宽高比的图像

Quick Start

快速入门

Prerequisites

前置条件

  • Gemini API setup (see
    gemini-3-pro-api
    skill)
  • Model:
    gemini-3-pro-image-preview
  • Gemini API配置完成(详见
    gemini-3-pro-api
    技能)
  • 模型:
    gemini-3-pro-image-preview

Python Quick Start

Python快速入门

python
import google.generativeai as genai

genai.configure(api_key="YOUR_API_KEY")
python
import google.generativeai as genai

genai.configure(api_key="YOUR_API_KEY")

Use the image generation model

使用图像生成模型

model = genai.GenerativeModel("gemini-3-pro-image-preview")
model = genai.GenerativeModel("gemini-3-pro-image-preview")

Generate image

生成图像

response = model.generate_content("A serene mountain landscape at sunset")
response = model.generate_content("A serene mountain landscape at sunset")

Save image

保存图像

if response.parts: with open("generated_image.png", "wb") as f: f.write(response.parts[0].inline_data.data) print("Image saved!")
undefined
if response.parts: with open("generated_image.png", "wb") as f: f.write(response.parts[0].inline_data.data) print("Image saved!")
undefined

Node.js Quick Start

Node.js快速入门

typescript
import { GoogleGenerativeAI } from "@google/generative-ai";
import fs from "fs";

const genAI = new GoogleGenerativeAI("YOUR_API_KEY");
const model = genAI.getGenerativeModel({ model: "gemini-3-pro-image-preview" });

const result = await model.generateContent("A serene mountain landscape at sunset");
const imageData = result.response.parts[0].inlineData.data;

fs.writeFileSync("generated_image.png", Buffer.from(imageData, "base64"));
console.log("Image saved!");

typescript
import { GoogleGenerativeAI } from "@google/generative-ai";
import fs from "fs";

const genAI = new GoogleGenerativeAI("YOUR_API_KEY");
const model = genAI.getGenerativeModel({ model: "gemini-3-pro-image-preview" });

const result = await model.generateContent("A serene mountain landscape at sunset");
const imageData = result.response.parts[0].inlineData.data;

fs.writeFileSync("generated_image.png", Buffer.from(imageData, "base64"));
console.log("Image saved!");

Core Tasks

核心任务

Task 1: Generate Image from Text Prompt

任务1:从文本提示词生成图像

Goal: Create high-quality images from text descriptions.
Python Example:
python
import google.generativeai as genai

genai.configure(api_key="YOUR_API_KEY")

model = genai.GenerativeModel(
    "gemini-3-pro-image-preview",
    generation_config={
        "thinking_level": "high",  # Best quality
        "temperature": 1.0
    }
)
目标: 根据文本描述生成高质量图像。
Python示例:
python
import google.generativeai as genai

genai.configure(api_key="YOUR_API_KEY")

model = genai.GenerativeModel(
    "gemini-3-pro-image-preview",
    generation_config={
        "thinking_level": "high",  # 最佳质量
        "temperature": 1.0
    }
)

Generate image

生成图像

prompt = """A futuristic cityscape at night with:
  • Neon lights and holographic advertisements
  • Flying vehicles
  • Tall skyscrapers with unique architecture
  • Rain-slicked streets reflecting the lights
  • Cinematic, detailed, 4K quality"""
response = model.generate_content(prompt)
prompt = """A futuristic cityscape at night with:
  • Neon lights and holographic advertisements
  • Flying vehicles
  • Tall skyscrapers with unique architecture
  • Rain-slicked streets reflecting the lights
  • Cinematic, detailed, 4K quality"""
response = model.generate_content(prompt)

Save image

保存图像

if response.parts and hasattr(response.parts[0], 'inline_data'): image_data = response.parts[0].inline_data.data with open("futuristic_city.png", "wb") as f: f.write(image_data) print("Image generated successfully!") else: print("No image generated")

**Tips for Better Prompts:**
- Be specific and detailed
- Specify art style (realistic, cartoon, oil painting, etc.)
- Include lighting, mood, and atmosphere
- Mention quality level (4K, detailed, high-quality)
- Describe colors, textures, composition

**See:** `references/generation-guide.md` for comprehensive prompting techniques

---
if response.parts and hasattr(response.parts[0], 'inline_data'): image_data = response.parts[0].inline_data.data with open("futuristic_city.png", "wb") as f: f.write(image_data) print("Image generated successfully!") else: print("No image generated")

**提示词优化技巧:**
- 描述要具体、详细
- 指定艺术风格(写实、卡通、油画等)
- 包含光线、氛围和情绪描述
- 提及质量等级(4K、细节丰富、高质量)
- 描述颜色、纹理和构图

**参考:** `references/generation-guide.md` 中的完整提示词技巧

---

Task 2: Generate 4K Images

任务2:生成4K图像

Goal: Create high-resolution 4K images with upscaling.
Python Example:
python
undefined
目标: 创建高分辨率4K图像,支持分辨率升级。
Python示例:
python
undefined

Generate with 4K quality specification

生成4K质量的图像

prompt = """A photorealistic portrait of a scientist in a modern lab:
  • 4K ultra-high definition
  • Sharp focus on subject
  • Soft bokeh background
  • Professional studio lighting
  • Fine detail in textures
  • Cinema-grade quality"""
response = model.generate_content(prompt)
prompt = """A photorealistic portrait of a scientist in a modern lab:
  • 4K ultra-high definition
  • Sharp focus on subject
  • Soft bokeh background
  • Professional studio lighting
  • Fine detail in textures
  • Cinema-grade quality"""
response = model.generate_content(prompt)

4K image will be generated

将生成4K图像

if response.parts: with open("scientist_4k.png", "wb") as f: f.write(response.parts[0].inline_data.data)

**4K Features:**
- Native 4K resolution support
- Upscaling to 2K/4K
- 16:9 aspect ratio at 4K
- Enhanced detail and clarity

**See:** `references/resolution-guide.md` for resolution control

---
if response.parts: with open("scientist_4k.png", "wb") as f: f.write(response.parts[0].inline_data.data)

**4K特性:**
- 原生4K分辨率支持
- 可升级至2K/4K
- 4K分辨率下支持16:9宽高比
- 细节更丰富、清晰度更高

**参考:** `references/resolution-guide.md` 中的分辨率控制指南

---

Task 3: Render Text in Images

任务3:在图像中渲染文本

Goal: Generate images with readable, high-quality text.
Python Example:
python
prompt = """Create a professional business card design with:
- Company name: "TechVision AI"
- Text: "Dr. Sarah Chen"
- Text: "Chief AI Officer"
- Text: "sarah.chen@techvision.ai"
- Text: "+1 (555) 123-4567"
- Modern, clean design
- Professional fonts
- Blue and white color scheme
- All text clearly readable"""

response = model.generate_content(prompt)

if response.parts:
    with open("business_card.png", "wb") as f:
        f.write(response.parts[0].inline_data.data)
Text Rendering Best Practices:
  • Explicitly specify text content in quotes
  • Request "readable" or "clearly visible" text
  • Keep text short and simple
  • Specify font style if desired
  • Use high contrast backgrounds
See:
references/generation-guide.md
for text rendering techniques

目标: 生成包含清晰可读高质量文本的图像。
Python示例:
python
prompt = """Create a professional business card design with:
- Company name: "TechVision AI"
- Text: "Dr. Sarah Chen"
- Text: "Chief AI Officer"
- Text: "sarah.chen@techvision.ai"
- Text: "+1 (555) 123-4567"
- Modern, clean design
- Professional fonts
- Blue and white color scheme
- All text clearly readable"""

response = model.generate_content(prompt)

if response.parts:
    with open("business_card.png", "wb") as f:
        f.write(response.parts[0].inline_data.data)
文本渲染最佳实践:
  • 用引号明确指定文本内容
  • 要求“可读”或“清晰可见”的文本
  • 文本内容简洁简短
  • 如需指定字体风格请明确说明
  • 使用高对比度背景
参考:
references/generation-guide.md
中的文本渲染技巧

Task 4: Grounded Generation (Fact-Verified Images)

任务4:事实锚定生成(事实校验图像)

Goal: Generate factually accurate images using Google Search grounding.
Python Example:
python
undefined
目标: 利用Google Search锚定功能生成符合事实的准确图像。
Python示例:
python
undefined

Enable Google Search grounding for factual accuracy

启用Google Search锚定以确保事实准确性

model_grounded = genai.GenerativeModel( "gemini-3-pro-image-preview", tools=[{"google_search_retrieval": {}}] # Enable grounding )
prompt = """Generate an accurate image of the International Space Station with Earth in the background. Use current ISS configuration."""
response = model_grounded.generate_content(prompt)
if response.parts: with open("iss_grounded.png", "wb") as f: f.write(response.parts[0].inline_data.data)
# Check if grounding was used
if hasattr(response, 'grounding_metadata'):
    print(f"Grounding sources used: {len(response.grounding_metadata.grounding_chunks)}")

**Grounded Generation Use Cases:**
- Historical scenes (accurate to period)
- Scientific visualizations
- Current events
- Famous landmarks
- Product representations

**Benefits:**
- Factual accuracy
- Real-world grounding
- Reduced hallucination
- Up-to-date information

**Note:** Uses free Google Search quota (1,500 queries/day)

**See:** `references/grounded-generation.md` for comprehensive guide

---
model_grounded = genai.GenerativeModel( "gemini-3-pro-image-preview", tools=[{"google_search_retrieval": {}}] # 启用锚定功能 )
prompt = """Generate an accurate image of the International Space Station with Earth in the background. Use current ISS configuration."""
response = model_grounded.generate_content(prompt)
if response.parts: with open("iss_grounded.png", "wb") as f: f.write(response.parts[0].inline_data.data)
# 检查是否使用了锚定功能
if hasattr(response, 'grounding_metadata'):
    print(f"Grounding sources used: {len(response.grounding_metadata.grounding_chunks)}")

**事实锚定生成适用场景:**
- 历史场景(符合时代特征)
- 科学可视化
- 当前事件
- 著名地标
- 产品展示

**优势:**
- 事实准确性
- 真实世界锚定
- 减少幻觉
- 信息实时更新

**注意:** 免费Google Search配额(每日1500次查询)

**参考:** `references/grounded-generation.md` 中的完整指南

---

Task 5: Conversational Image Editing

任务5:对话式图像编辑

Goal: Iteratively refine images through multi-turn conversation.
Python Example:
python
model = genai.GenerativeModel("gemini-3-pro-image-preview")
目标: 通过多轮对话迭代优化图像。
Python示例:
python
model = genai.GenerativeModel("gemini-3-pro-image-preview")

Start a chat session for conversational editing

开启对话会话进行对话式编辑

chat = model.start_chat()
chat = model.start_chat()

First generation

第一次生成

response1 = chat.send_message("Create a cozy coffee shop interior")
if response1.parts: with open("coffee_shop_v1.png", "wb") as f: f.write(response1.parts[0].inline_data.data)
response1 = chat.send_message("Create a cozy coffee shop interior")
if response1.parts: with open("coffee_shop_v1.png", "wb") as f: f.write(response1.parts[0].inline_data.data)

Refine the image

优化图像

response2 = chat.send_message("Add more plants and warm lighting")
if response2.parts: with open("coffee_shop_v2.png", "wb") as f: f.write(response2.parts[0].inline_data.data)
response2 = chat.send_message("Add more plants and warm lighting")
if response2.parts: with open("coffee_shop_v2.png", "wb") as f: f.write(response2.parts[0].inline_data.data)

Further refinement

进一步优化

response3 = chat.send_message("Make it more minimalist, remove some decorations")
if response3.parts: with open("coffee_shop_v3.png", "wb") as f: f.write(response3.parts[0].inline_data.data)

**Conversational Editing Features:**
- Preserves visual context across turns
- Incremental modifications
- Natural language instructions
- Multi-turn refinement
- Context-aware changes

**Example Editing Commands:**
- "Make it darker/lighter"
- "Add more [element]"
- "Change the color scheme to [colors]"
- "Make it more realistic/artistic"
- "Remove [element]"

**See:** `references/conversational-editing.md` for advanced patterns

---
response3 = chat.send_message("Make it more minimalist, remove some decorations")
if response3.parts: with open("coffee_shop_v3.png", "wb") as f: f.write(response3.parts[0].inline_data.data)

**对话式编辑特性:**
- 多轮对话中保留视觉上下文
- 增量式修改
- 自然语言指令
- 多轮优化
- 上下文感知的变更

**示例编辑指令:**
- "调暗/调亮画面"
- "添加更多[元素]"
- "将配色改为[颜色]"
- "让画面更写实/更具艺术感"
- "移除[元素]"

**参考:** `references/conversational-editing.md` 中的高级模式

---

Task 6: Custom Aspect Ratios

任务6:自定义宽高比

Goal: Generate images in specific aspect ratios.
Python Example:
python
undefined
目标: 生成特定宽高比的图像。
Python示例:
python
undefined

16:9 aspect ratio (4K supported)

16:9宽高比(支持4K)

prompt_169 = "A cinematic landscape in 16:9 aspect ratio, 4K quality"
prompt_169 = "A cinematic landscape in 16:9 aspect ratio, 4K quality"

Square aspect ratio

正方形宽高比

prompt_square = "A square logo design for a tech company"
prompt_square = "A square logo design for a tech company"

Portrait orientation

竖版方向

prompt_portrait = "A portrait-oriented movie poster"
response = model.generate_content(prompt_169)
prompt_portrait = "A portrait-oriented movie poster"
response = model.generate_content(prompt_169)

Image will be generated in specified ratio

图像将按指定宽高比生成


**Supported Ratios:**
- **16:9** - Wide, cinematic (4K supported)
- **1:1** - Square
- **4:3** - Standard
- **9:16** - Vertical/portrait

---

**支持的宽高比:**
- **16:9** - 宽屏、电影级(支持4K)
- **1:1** - 正方形
- **4:3** - 标准比例
- **9:16** - 竖版/肖像模式

---

Task 7: Optimize Image Generation Costs

任务7:优化图像生成成本

Goal: Balance quality and cost for image generation.
Pricing:
  • Text Input: $1-2 per 1M tokens
  • Text Output: $6-9 per 1M tokens
  • Image Output: $0.134 per image (varies by resolution)
Python Cost Optimization:
python
def generate_with_cost_tracking(prompt):
    """Generate image and track costs"""

    response = model.generate_content(prompt)

    # Calculate cost
    usage = response.usage_metadata
    input_cost = (usage.prompt_token_count / 1_000_000) * 2.00
    output_cost = (usage.candidates_token_count / 1_000_000) * 9.00
    image_cost = 0.134  # Per image

    total_cost = input_cost + output_cost + image_cost

    print(f"Input tokens: {usage.prompt_token_count} (${input_cost:.6f})")
    print(f"Output tokens: {usage.candidates_token_count} (${output_cost:.6f})")
    print(f"Image cost: ${image_cost:.6f}")
    print(f"Total: ${total_cost:.6f}")

    return response

response = generate_with_cost_tracking("A beautiful sunset over mountains")
Cost Optimization Strategies:
  1. Batch Requests: Generate multiple images in one session
  2. Reuse Chat Sessions: Conversational editing is more efficient
  3. Specific Prompts: Clear prompts reduce regeneration needs
  4. Monitor Usage: Track costs per project
  5. Use Appropriate Quality: Not all images need 4K
See:
references/pricing-optimization.md
for detailed strategies

目标: 在图像生成的质量与成本间取得平衡。
定价:
  • 文本输入: 每100万令牌1-2美元
  • 文本输出: 每100万令牌6-9美元
  • 图像输出: 每张图像0.134美元(根据分辨率有所不同)
Python成本优化示例:
python
def generate_with_cost_tracking(prompt):
    """生成图像并跟踪成本"""

    response = model.generate_content(prompt)

    # 计算成本
    usage = response.usage_metadata
    input_cost = (usage.prompt_token_count / 1_000_000) * 2.00
    output_cost = (usage.candidates_token_count / 1_000_000) * 9.00
    image_cost = 0.134  # 每张图像成本

    total_cost = input_cost + output_cost + image_cost

    print(f"Input tokens: {usage.prompt_token_count} (${input_cost:.6f})")
    print(f"Output tokens: {usage.candidates_token_count} (${output_cost:.6f})")
    print(f"Image cost: ${image_cost:.6f}")
    print(f"Total: ${total_cost:.6f}")

    return response

response = generate_with_cost_tracking("A beautiful sunset over mountains")
成本优化策略:
  1. 批量请求: 在一个会话中生成多张图像
  2. 复用对话会话: 对话式编辑更高效
  3. 明确的提示词: 清晰的提示词减少重复生成需求
  4. 监控使用情况: 跟踪每个项目的成本
  5. 选择合适的质量: 并非所有图像都需要4K分辨率
参考:
references/pricing-optimization.md
中的详细策略

Batch Image Generation

批量图像生成

python
import google.generativeai as genai

genai.configure(api_key="YOUR_API_KEY")
model = genai.GenerativeModel("gemini-3-pro-image-preview")

prompts = [
    "A serene mountain lake at dawn",
    "A bustling market in Morocco",
    "A futuristic robot assistant",
    "An abstract geometric pattern"
]

for i, prompt in enumerate(prompts):
    print(f"Generating image {i+1}/{len(prompts)}: {prompt}")

    response = model.generate_content(prompt)

    if response.parts:
        with open(f"generated_{i+1}.png", "wb") as f:
            f.write(response.parts[0].inline_data.data)
        print(f"  Saved: generated_{i+1}.png")

python
import google.generativeai as genai

genai.configure(api_key="YOUR_API_KEY")
model = genai.GenerativeModel("gemini-3-pro-image-preview")

prompts = [
    "A serene mountain lake at dawn",
    "A bustling market in Morocco",
    "A futuristic robot assistant",
    "An abstract geometric pattern"
]

for i, prompt in enumerate(prompts):
    print(f"Generating image {i+1}/{len(prompts)}: {prompt}")

    response = model.generate_content(prompt)

    if response.parts:
        with open(f"generated_{i+1}.png", "wb") as f:
            f.write(response.parts[0].inline_data.data)
        print(f"  Saved: generated_{i+1}.png")

Error Handling

错误处理

python
from google.api_core import exceptions

def safe_image_generation(prompt):
    """Generate image with error handling"""

    try:
        response = model.generate_content(prompt)

        if not response.parts:
            return {"success": False, "error": "No image generated"}

        if not hasattr(response.parts[0], 'inline_data'):
            return {"success": False, "error": "Invalid response format"}

        return {
            "success": True,
            "image_data": response.parts[0].inline_data.data,
            "mime_type": response.parts[0].inline_data.mime_type
        }

    except exceptions.InvalidArgument as e:
        return {"success": False, "error": f"Invalid prompt: {e}"}
    except exceptions.ResourceExhausted as e:
        return {"success": False, "error": f"Rate limit exceeded: {e}"}
    except Exception as e:
        return {"success": False, "error": f"Error: {e}"}

python
from google.api_core import exceptions

def safe_image_generation(prompt):
    """带有错误处理的图像生成"""

    try:
        response = model.generate_content(prompt)

        if not response.parts:
            return {"success": False, "error": "No image generated"}

        if not hasattr(response.parts[0], 'inline_data'):
            return {"success": False, "error": "Invalid response format"}

        return {
            "success": True,
            "image_data": response.parts[0].inline_data.data,
            "mime_type": response.parts[0].inline_data.mime_type
        }

    except exceptions.InvalidArgument as e:
        return {"success": False, "error": f"Invalid prompt: {e}"}
    except exceptions.ResourceExhausted as e:
        return {"success": False, "error": f"Rate limit exceeded: {e}"}
    except Exception as e:
        return {"success": False, "error": f"Error: {e}"}

References

参考资料

Core Guides
  • Model Setup - Nano Banana Pro configuration
  • Generation Guide - Comprehensive prompting techniques
  • Grounded Generation - Fact-verified image creation
  • Conversational Editing - Multi-turn refinement
Optimization
  • Resolution Guide - 4K and quality control
  • Pricing Optimization - Cost management
Scripts
  • Generate Image Script - Production-ready generation
  • Grounded Generation Script - Fact-verified images
  • Edit Image Script - Conversational editing
Official Resources

核心指南
  • Model Setup - Nano Banana Pro配置
  • Generation Guide - 完整提示词技巧
  • Grounded Generation - 事实校验图像创建
  • Conversational Editing - 多轮优化
优化指南
  • Resolution Guide - 4K与质量控制
  • Pricing Optimization - 成本管理
脚本
  • Generate Image Script - 生产环境可用的生成脚本
  • Grounded Generation Script - 事实校验图像脚本
  • Edit Image Script - 对话式编辑脚本
官方资源

Related Skills

相关技能

  • gemini-3-pro-api - Basic setup, authentication, text generation
  • gemini-3-multimodal - Image INPUT (analyzing images)
  • gemini-3-advanced - Advanced features (caching, batch, tools)

  • gemini-3-pro-api - 基础配置、身份验证、文本生成
  • gemini-3-multimodal - 图像输入(分析图像)
  • gemini-3-advanced - 高级功能(缓存、批量、工具)

Best Practices

最佳实践

  1. Be Specific: Detailed prompts produce better results
  2. Specify Quality: Request 4K or high quality explicitly
  3. Use Grounding: Enable for factual accuracy
  4. Iterate Conversationally: Use chat for refinements
  5. Monitor Costs: Track usage, especially for 4K
  6. Handle Errors: Implement retry logic
  7. Save Images Properly: Use binary mode for writing

  1. 描述具体: 详细的提示词产出更好的结果
  2. 指定质量: 明确要求4K或高质量
  3. 使用锚定功能: 启用以确保事实准确性
  4. 对话式迭代: 使用会话进行优化
  5. 监控成本: 跟踪使用情况,尤其是4K图像
  6. 处理错误: 实现重试逻辑
  7. 正确保存图像: 使用二进制模式写入

Troubleshooting

故障排除

Issue: No image generated

问题:未生成图像

Solution: Check
response.parts
exists and has
inline_data
attribute
解决方案: 检查
response.parts
是否存在,且包含
inline_data
属性

Issue: Low quality images

问题:图像质量低

Solution: Add "4K", "high quality", "detailed" to prompt
解决方案: 在提示词中添加“4K”、“高质量”、“细节丰富”

Issue: Text in images unreadable

问题:图像中的文本不可读

Solution: Specify text explicitly in quotes, request "readable text"
解决方案: 用引号明确指定文本,要求“可读文本”

Issue: Images not factually accurate

问题:图像不符合事实

Solution: Enable grounded generation with Google Search
解决方案: 启用基于Google Search的事实锚定生成

Issue: High costs

问题:成本过高

Solution: Optimize prompts, batch requests, monitor usage

解决方案: 优化提示词、批量请求、监控使用情况

Summary

总结

This skill provides complete image generation capabilities:
✅ Text-to-image generation ✅ Native 4K support ✅ Text rendering in images ✅ Grounded generation (fact-verified) ✅ Conversational editing ✅ Custom aspect ratios ✅ Cost optimization ✅ Production-ready examples
Ready to generate images? Start with Task 1: Generate Image from Text Prompt above!
本技能提供完整的图像生成功能:
✅ 文本转图像生成 ✅ 原生4K支持 ✅ 图像内文本渲染 ✅ 事实锚定生成(事实校验) ✅ 对话式编辑 ✅ 自定义宽高比 ✅ 成本优化 ✅ 生产环境可用示例
准备好生成图像了吗? 从上方的任务1:从文本提示词生成图像开始吧!