搜索：object-detection - AI Agent Skills

AI & Machine Learningmicrosoft/agent-skills

azure-ai-vision-imageanalysis-py

Azure AI Vision Image Analysis SDK for captions, tags, objects, OCR, people detection, and smart cropping. Use for computer vision and image understanding tasks. Triggers: "image analysis", "computer vision", "OCR", "object detection", "ImageAnalysisClient", "image caption".

🇺🇸|EnglishTranslated

51

AI & Machine Learningtondevrel/scientific-agen...

opencv

Open Source Computer Vision Library (OpenCV) for real-time image processing, video analysis, object detection, face recognition, and camera calibration. Use when working with images, videos, cameras, edge detection, contours, feature detection, image transformations, object tracking, optical flow, or any computer vision task.

🇺🇸|EnglishTranslated

21

Tools & Utilitiesnodnarbnitram/claude-code...

frigate-configurator

Configure Frigate NVR with optimized YAML, object detection, recording, zones, and hardware acceleration. Use when setting up Frigate cameras, troubleshooting detection issues, configuring Coral TPU/OpenVINO, or integrating with Home Assistant.

🇺🇸|EnglishTranslated

20

1 scripts/Attention

AI & Machine Learninghuggingface/skills

huggingface-vision-trainer

Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DETR, YOLOS), image classification (timm models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3 — plus any Transformers classifier), and SAM/SAM2 segmentation using Hugging Face Transformers on Hugging Face Jobs cloud GPUs. Covers COCO-format dataset preparation, Albumentations augmentation, mAP/mAR evaluation, accuracy metrics, SAM segmentation with bbox/point prompts, DiceCE loss, hardware selection, cost estimation, Trackio monitoring, and Hub persistence. Use when users mention training object detection, image classification, SAM, SAM2, segmentation, image matting, DETR, D-FINE, RT-DETR, ViT, timm, MobileNet, ResNet, bounding box models, or fine-tuning vision models on Hugging Face Jobs.

🇺🇸|EnglishTranslated

18

5 scripts/Checked

AI & Machine Learningvoxel51/fiftyone-skills

fiftyone-dataset-inference

Run ML model inference (YOLO, YOLOv8, CLIP, SAM, Detectron2, etc.) on FiftyOne datasets. Use when running models, applying detection, classification, segmentation, embeddings, or any model prediction task. Also use for end-to-end workflows that include importing data then running inference.

🇺🇸|EnglishTranslated

16

AI & Machine Learninghuggingface/skills

hugging-face-vision-trainer

Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DETR, YOLOS), image classification (timm models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3 — plus any Transformers classifier), and SAM/SAM2 segmentation using Hugging Face Transformers on Hugging Face Jobs cloud GPUs. Covers COCO-format dataset preparation, Albumentations augmentation, mAP/mAR evaluation, accuracy metrics, SAM segmentation with bbox/point prompts, DiceCE loss, hardware selection, cost estimation, Trackio monitoring, and Hub persistence. Use when users mention training object detection, image classification, SAM, SAM2, segmentation, image matting, DETR, D-FINE, RT-DETR, ViT, timm, MobileNet, ResNet, bounding box models, or fine-tuning vision models on Hugging Face Jobs.

🇺🇸|EnglishTranslated

15

5 scripts/Checked

AI & Machine Learningnvidia/skills

tao-train-pointpillars

PointPillars for 3D object detection from LiDAR point clouds. Encodes point clouds into a pseudo-image via a pillar-based representation, then applies 2D detection — used in autonomous driving and robotics. Use when training, evaluating, exporting, pruning, retraining, or running inference for a TAO PointPillars model. Trigger phrases include "train PointPillars", "LiDAR 3D detection", "point-cloud object detection", "pillar-based 3D detector".

🇺🇸|EnglishTranslated

15

AI & Machine Learningerichowens/some_claude_sk...

computer-vision-pipeline

Build production computer vision pipelines for object detection, tracking, and video analysis. Handles drone footage, wildlife monitoring, and real-time detection. Supports YOLO, Detectron2, TensorFlow, PyTorch. Use for archaeological surveys, conservation, security. Activate on "object detection", "video analysis", "YOLO", "tracking", "drone footage". NOT for simple image filters, photo editing, or face recognition APIs.

🇺🇸|EnglishTranslated

14

2 scripts/Checked

AI & Machine Learningjeremylongshore/claude-co...

processing-computer-vision-tasks

Process images using object detection, classification, and segmentation. Use when requesting "analyze image", "object detection", "image classification", or "computer vision". Trigger with relevant phrases based on skill purpose.

🇺🇸|EnglishTranslated

13

3 scripts/Checked

AI & Machine Learningpromptingcompany/nv-skill...

tao-train-sparse4d

Sparse4D for multi-camera temporal 3D object detection and tracking. Uses sparse queries with deformable attention across camera views and time for end-to-end 3D perception, with an instance bank for temporal tracking. Use when training, evaluating, exporting, quantizing, or running inference for a TAO Sparse4D model. Trigger phrases include "train Sparse4D", "multi-camera 3D detection", "temporal 3D tracker", "sparse query 3D perception".

🇺🇸|EnglishTranslated

13

AI & Machine Learningalirezarezvani/claude-ski...

senior-computer-vision

Computer vision engineering skill for object detection, image segmentation, and visual AI systems. Covers CNN and Vision Transformer architectures, YOLO/Faster R-CNN/DETR detection, Mask R-CNN/SAM segmentation, and production deployment with ONNX/TensorRT. Includes PyTorch, torchvision, Ultralytics, Detectron2, and MMDetection frameworks. Use when building detection pipelines, training custom models, optimizing inference, or deploying vision systems.

🇺🇸|EnglishTranslated

12

3 scripts/Attention

AI & Machine Learningpluginagentmarketplace/cu...

computer-vision

Image processing, object detection, segmentation, and vision models. Use for image classification, object detection, or visual analysis tasks.

🇺🇸|EnglishTranslated

12

1 scripts/Checked

Search Results: object-detection

azure-ai-vision-imageanalysis-py

opencv

frigate-configurator

huggingface-vision-trainer

fiftyone-dataset-inference

hugging-face-vision-trainer

tao-train-pointpillars

computer-vision-pipeline

processing-computer-vision-tasks

tao-train-sparse4d

senior-computer-vision

computer-vision

Search Results: object-detection

azure-ai-vision-imageanalysis-py

opencv

frigate-configurator

huggingface-vision-trainer

fiftyone-dataset-inference

hugging-face-vision-trainer

tao-train-pointpillars

computer-vision-pipeline

processing-computer-vision-tasks

tao-train-sparse4d

senior-computer-vision

computer-vision