Search Results: model-deployment

Found 53 Skills

DevOps & Cloud Servicespromptingcompany/nv-skill...

dynamo-recipe-runner

Select, validate, patch, and deploy existing NVIDIA Dynamo Kubernetes recipes. Use for model/backend/GPU/deployment-mode recipe bring-up; use router-starter for router-only mode work and troubleshoot for broken deployments.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningdavila7/claude-code-templ...

modal-serverless-gpu

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

🇺🇸|EnglishTranslated

AI & Machine Learningmembranedev/application-s...

mosaicml

MosaicML integration. Manage data, records, and automate workflows. Use when the user wants to interact with MosaicML data.

🇺🇸|EnglishTranslated

AI & Machine Learningreplicate/skills

build-models

Package and build custom AI models with Cog for deployment on Replicate. Use when creating a cog.yaml or predict.py, defining model inputs and outputs, loading model weights at setup time, building Docker images for ML models, serving locally with cog serve or cog predict, or porting a HuggingFace, GitHub, or ComfyUI model to run on Replicate. Trigger on phrases like "build a model", "package a model", "create a Cog model", "wrap a model", "containerize an AI model", "predict.py", "cog.yaml", "BasePredictor", or "Cog container", and when referencing cog.run, github.com/replicate/cog, or github.com/replicate/cog-examples. Covers GPU and CUDA setup, pget for fast weight downloads, async predictors with continuous batching, streaming outputs, and cold-boot optimization for image, video, audio, and LLM models. For pushing built models to Replicate, see publish-models. For running existing models, see run-models.

🇺🇸|EnglishTranslated

AI & Machine Learningaffaan-m/everything-claud...

mle-workflow

Production machine-learning engineering workflow for data contracts, reproducible training, model evaluation, deployment, monitoring, and rollback. Use when building, reviewing, or hardening ML systems beyond one-off notebooks.

🇺🇸|EnglishTranslated