Search Results: image-generation

Found 428 Skills

AI & Machine Learningmrgoonie/claudekit-skills

ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.

🇺🇸|EnglishTranslated

6 scripts/Attention

AI & Machine Learningcnemri/google-genai-skill...

nano-banana-use

Generate, edit, and compose images using Gemini Nano Banana models via portable Python scripts. Handles authentication via API Key or Vertex AI environment variables. Available parameters: prompt, model, aspect-ratio, safety-filter-level. Always confirm parameters with the user or explicitly state defaults before running.

🇺🇸|EnglishTranslated

3 scripts/Checked

AI & Machine Learninghmbown/minimax-cli

storybook-lesson

Create a kid-friendly learning card with an illustration and narrated audio.

🇺🇸|EnglishTranslated

AI & Machine Learningbinhmuc/autobot-review

ai-artist

Write and optimize prompts for AI-generated outcomes across text and image models. Use when crafting prompts for LLMs (Claude, GPT, Gemini), image generators (Midjourney, DALL-E, Stable Diffusion, Imagen, Flux), or video generators (Veo, Runway). Covers prompt structure, style keywords, negative prompts, chain-of-thought, few-shot examples, iterative refinement, and domain-specific patterns for marketing, code, and creative writing.

🇺🇸|EnglishTranslated

AI & Machine Learningthepexcel/agent-skills

art-director

Creates professional AI image/video prompts with photographer's and cinematographer's eye. Specializes in composition, lighting, color grading, and storytelling. Use when generating AI images/videos with artistic vision, working with models like Nano Banana Pro, Qwen, Sora2, Wan 2.2. For graphic design work (thumbnails, banners, layouts), use /graphic-designer instead.

🇺🇸|EnglishTranslated

AI & Machine Learningeachlabs/skills

eachlabs-product-visuals

Generate professional e-commerce product photography and videos using EachLabs AI models. Product shots, background replacement, lifestyle scenes, and 360-degree views. Use when the user needs product images for e-commerce or marketing.

🇺🇸|EnglishTranslated

AI & Machine Learningxsir0/xsir-skills

google-gemini-media

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding".

🇺🇸|EnglishTranslated

19 scripts/Checked

AI & Machine Learningshinchven/nano-banana-ski...

character-reference-sheet

Generates a 1:1 split-screen (front/back) character reference sheet, mirroring facial, physical, and costume details from an uploaded image.

🇺🇸|EnglishTranslated

AI & Machine Learningeachlabs/skills

pinterest-pin-generation

Generate Pinterest pin images using each::sense AI. Create standard pins, idea pins, product pins, recipe pins, infographics, and more optimized for Pinterest's formats and best practices.

🇺🇸|EnglishTranslated

AI & Machine Learningbluewaves-creations/bluew...

photographer-lindbergh

Generate images in Peter Lindbergh's iconic black and white style. Use when users ask for Lindbergh style, raw authentic beauty, emotional B&W portraits, supermodel aesthetic, or unretouched natural photography.

🇺🇸|EnglishTranslated

1 scripts/Checked

Tools & Utilitiesfreestylefly/canghe-skill...

canghe-xhs-images

Generates Xiaohongshu (Little Red Book) infographic series with 10 visual styles and 8 layouts. Breaks content into 1-10 cartoon-style images optimized for XHS engagement. Use when user mentions "小红书图片", "XHS images", "RedNote infographics", "小红书种草", or wants social media infographics for Chinese platforms.

🇺🇸|EnglishTranslated

Tools & Utilitiesvercel-labs/json-render

json-render-image

Image renderer for json-render that turns JSON specs into SVG and PNG images via Satori. Use when working with @json-render/image, generating OG images from JSON, creating social cards, or rendering AI-generated image specs.

🇺🇸|EnglishTranslated