Search Results: gemini

Found 486 Skills

Tools & Utilitiesagricidaniel/claude-blog

blog-audio

Generate audio narration of blog posts using Google Gemini TTS. Supports summary narration, full article read-aloud, and two-speaker podcast/dialogue mode with 30 voice options. Outputs MP3 with HTML5 audio embed code. Works standalone via /blog audio or internally from blog-write. Falls back gracefully when API key is not configured. Use when user says "blog audio", "narrate blog", "audio version", "text to speech", "tts", "podcast mode", "read aloud", "audio narration", "voice", "narration", "generate audio".

🇺🇸|EnglishTranslated

4 scripts/Checked

AI & Machine Learningivanleomk/aie-workshop-20...

prompt_to_production

A complete workshop curriculum for building an agentic application using the Gemini Interactions API. Guides the user from basic API calls to a full production coding agent.

🇺🇸|EnglishTranslated

13 scripts/Attention

AI & Machine Learninggoogle/skills

gemini-managed-agents-api

Manages custom Agent resources on Gemini Enterprise Agent Platform. Use when the user wants to programmatically create, configure, list, update, or delete stateful, server-managed Agent resources (including mounting files, skills, and tools) before executing conversations.

🇺🇸|EnglishTranslated

AI & Machine Learningaradotso/devtools-skills

gemini-antigravity-cli

Terminal AI agent CLI for Google Gemini and Antigravity models with slash commands, MCP server support, and coding assistance

🇺🇸|EnglishTranslated

AI & Machine Learningmrgoonie/claudekit-skills

ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.

🇺🇸|EnglishTranslated

6 scripts/Attention

AI & Machine Learningjulianobarbosa/claude-cod...

consulting-design

Consult Gemini AI for architecture alternatives, design trade-offs, and brainstorming. Use when seeking different perspectives on design, evaluating architectural approaches, comparing solutions, or generating creative ideas.

🇺🇸|EnglishTranslated

AI & Machine Learningdnyoussef/context-cascade

multi-model-discovery

Use Gemini to find existing solutions before building from scratch. Leverages Google Search grounding to discover code examples, libraries, and best practices to avoid reinventing the wheel.

🇺🇸|EnglishTranslated

AI & Machine Learningadaptationio/skrillz

gemini-3-pro-api

Gemini 3 Pro API/SDK integration for text generation, reasoning, and chat. Covers setup, authentication, thinking levels, streaming, and production deployment. Use when working with Gemini 3 Pro API, Python SDK, Node.js SDK, text generation, chat applications, or advanced reasoning tasks.

🇺🇸|EnglishTranslated

AI & Machine Learningbuildatscale-tv/gemini-sk...

nano-banana-pro

Nano Banana Pro (nano-banana-pro) image generation skill. Use this skill when the user asks to "generate an image", "generate images", "create an image", "make an image", uses "nano banana", or requests multiple images like "generate 5 images". Generates images using Google's Gemini 2.5 Flash for any purpose - frontend designs, web projects, illustrations, graphics, hero images, icons, backgrounds, or standalone artwork. Invoke this skill for ANY image generation request.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningxsir0/xsir-skills

google-gemini-media

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding".

🇺🇸|EnglishTranslated

19 scripts/Checked

AI & Machine Learningcluesmith/codev

generate-image

AI image generation CLI using Gemini. Use when generating images, checking syntax for resolution, aspect ratio, and reference image options.

🇺🇸|EnglishTranslated

AI & Machine Learningadaptationio/skrillz

gemini-3-advanced

Advanced Gemini 3 Pro features including function calling, built-in tools (Google Search, Code Execution, File Search, URL Context), structured outputs, thought signatures, context caching, batch processing, and framework integration. Use when implementing tools, function calling, structured JSON output, context caching, batch API, LangChain, Vercel AI, or production features.

🇺🇸|EnglishTranslated

5 scripts/Checked