Search Results: image-generation

Found 429 Skills

image-gen

Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate image", "配图", "create picture", "draw", "visualize", "generate an image".

🇺🇸|EnglishTranslated

AI & Machine Learningmckruz/comfyui-expert

comfyui-prompt-engineer

Craft model-specific prompts optimized for the target checkpoint and identity method. Handles FLUX, SDXL, SD1.5, and Wan video models with proper syntax, quality tags, and negative prompts. Use when generating or refining prompts for ComfyUI workflows.

🇺🇸|EnglishTranslated

AI & Machine Learningwxul/openrouter-generate-...

generate-image

Use when the user needs to generate images, UI assets, icons, backgrounds, placeholders, or any visual content. Triggers on requests like "generate an image", "create a picture", "make an icon", "I need a visual for...".

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningjamditis/claude-skills-jo...

nano-banana-image-gen

Use when generating images with Gemini models, choosing between Nano Banana 1/2/Pro, optimizing image generation costs, writing image prompts, or needing visual grounding with real-world reference images

🇺🇸|EnglishTranslated

AI & Machine Learningjezweb/claude-skills

ai-image-generator

Generate AI images using Gemini or GPT APIs directly. Covers model selection (Gemini for scenes, GPT for transparent icons), the 5-part prompting framework, API calling patterns, multi-turn editing, and quality assurance. Produces photorealistic scenes, icons, illustrations, OG images, and product shots. Use when building websites that need images, creating marketing assets, or generating visual content. Triggers: 'generate image', 'ai image', 'create hero image', 'make an icon', 'generate illustration', 'create og image', 'ai art', 'image generation'.

🇺🇸|EnglishTranslated

AI & Machine Learningminimax-ai/skills

minimax-multimodal-toolkit

MiniMax multimodal model skill — use MiniMax Multi-Modal models for speech, music, video, and image. Create voice, music, video, and images with MiniMax AI: TTS (text-to-speech, voice cloning, voice design, multi-segment), music (songs, instrumentals), video (text-to-video, image-to-video, start-end frame, subject reference, templates, long-form multi-scene), image (text-to-image, image-to-image with character reference), and media processing (convert, concat, trim, extract). Use when the user mentions MiniMax, multimodal generation, or wants speech/music/video/image AI, MiniMax APIs, or FFmpeg workflows alongside MiniMax outputs.

🇺🇸|EnglishTranslated

9 scripts/Attention

Product & Designchunpu/agent-skills

character-design

Generate character design drawings that serve the narrative and core essence of characters, adopting a 16:9 horizontal three-view mode. It is anti-template and anti-cliché, enabling characters to have memorable points and high recognition.

🇨🇳|ChineseTranslated

AI & Machine Learningzai-org/glm-skills

glm-image-gen

Official skill for generating high-quality images from text prompts using ZhiPu GLM-Image API. Excellent at scientific illustrations, high-quality portraits, social media graphics, and commercial posters. Supports multiple aspect ratios, HD quality, and watermark control. Use this skill when the user wants to generate images, create AI art, text-to-image, or convert text descriptions into visual content.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningagricidaniel/banana-claud...

banana

AI image generation Creative Director powered by Google Gemini Nano Banana models. Use this skill for ANY request involving image creation, editing, visual asset production, or creative direction. Triggers on: generate an image, create a photo, edit this picture, design a logo, make a banner, visual for my anything, and all /banana commands. Handles text-to-image, image editing, multi-turn creative sessions, batch workflows, and brand presets.

🇺🇸|EnglishTranslated

7 scripts/Checked

AI & Machine Learningglebis/claude-skills

nano-banana

Generate and edit images using Google's Gemini image generation models (Nano Banana family). Supports style presets, platform-specific sizing (YouTube/slides/blog), variants, image editing via inlineData, reference images for style transfer, and organized output with metadata. Default model is Nano Banana 2 (gemini-3.1-flash-image-preview). Key is auto-decrypted via SOPS.

🇺🇸|EnglishTranslated

2 scripts/Checked

Product & Designfactory-ai/factory-plugin...

visual-design

Image generation and presentations. Use when: - User asks for images: logos, icons, app assets, diagrams, flowcharts, architecture diagrams, patterns, textures, photo edits, restorations - User needs a presentation or slide deck Covers nanobanana CLI for image generation and Slidev for presentations.

🇺🇸|EnglishTranslated

AI & Machine Learningwanshuiyin/auto-claude-co...

paper-illustration-image2

Generate publication-quality academic illustrations through a local Codex app-server bridge that uses Codex native image generation. This is a separate experimental alternative to `paper-illustration`, intended for Claude Code users who want a GPT-image-style renderer without modifying the original skill.

🇺🇸|EnglishTranslated