Total 50,365 skills, AI & Machine Learning has 8464 skills
Showing 12 of 8464 skills
Generate and edit high-quality images using Google's Nano Banana Pro (Gemini 3 Pro Image) AI model. Use this skill when working with AI image generation or editing tasks including (1) Creating images from text prompts with professional quality, (2) Editing existing images with natural language instructions, (3) Multi-turn iterative image refinement, (4) Product photography and marketing visuals, (5) Creating infographics and diagrams with text, (6) Photorealistic and artistic image generation, (7) Batch image generation workflows, (8) Testing and troubleshooting Gemini image generation API. Supports up to 4K resolution, multiple aspect ratios, reference images, and advanced prompting techniques for cinema-quality results.
Create and orchestrate multi-agent clusters to complete complex tasks. Use this skill when users need to break down complex tasks into multiple specialized agents for parallel or serial execution. Applicable scenarios: (1) Complex projects requiring multi-role collaboration (planning, research, coding, writing, design, analysis, review) (2) Need to execute multiple independent sub-tasks in parallel to improve efficiency (3) Need professional division of labor to optimize cost and quality. Keywords: multi-agent, agent cluster, task orchestration, parallel execution, agent team.
Generate text content using Google Gemini models via scripts/. Use for text generation, multimodal prompts with images, thinking mode for complex reasoning, JSON-formatted outputs, and Google Search grounding for real-time information. Triggers on "generate with gemini", "use gemini for text", "AI text generation", "multimodal prompt", "gemini thinking mode", "grounded response".
Use this skill when the user wants any MCP-capable agent or IDE assistant to interact with Google ADK agents through the adk-agent-extension MCP server. Trigger for requests like wiring ADK tools into Codex/Claude Code/Cursor/Cline/Gemini, registering a stdio MCP server, listing ADK servers/agents, creating sessions, and chatting with ADK agents.
Build agents specialized in conducting thorough research, gathering information from multiple sources, and synthesizing findings. Covers research planning, source evaluation, and report generation. Use when automating market research, competitive analysis, literature reviews, or intelligence gathering.
Generates custom Claude Code subagents with specialized expertise. Activates when user wants to create a subagent, specialized agent, or task-specific AI assistant. Creates properly formatted .md files with YAML frontmatter, suggests tool restrictions and model selection, generates effective system prompts. Use when user mentions "create subagent", "new agent", "specialized agent", "task-specific agent", or wants isolated context for domain-specific work.
Local speech-to-text via Handy app (push-to-talk) and NeMo CLI scripts. Parakeet V3: 25 languages, auto-detection, ~30x realtime on M4 Max, 6% WER. This skill should be used when transcribing audio files or dictating voice input.
No Polling for Background Agents
Write reliable prompts for Agentica/REPL agents that avoid LLM instruction ambiguity
Spawn Agentica multi-agent patterns
Interactive workspace discovery - learn what tools, workflows, agents, and hooks are available
Designs album concepts, tracklist architecture, and thematic planning through 7 structured phases. Use when planning a new album or reworking an existing album concept.