Search Results: llm

Found 1,564 Skills

AI & Machine Learningolorolor/fundamentals-wit...

module1-llm-basics

LLM의 본질(확률적 토큰 예측), hallucination의 구조적 원인, temperature의 의미를 학습시키는 모듈.

AI & Machine Learningnickcrew/claude-ctx-plugi...

multi-llm-consult

Consult external LLMs (Gemini, OpenAI/Codex, Qwen) for second opinions, alternative plans, independent reviews, or delegated tasks. Use when a user asks for another model's perspective, wants to compare answers, or requests delegating a subtask to Gemini/Codex/Qwen.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningparcadei/continuous-claud...

llm-tuning-patterns

LLM Tuning Patterns

🇺🇸|EnglishTranslated

Testing & QAyonatangross/orchestkit

testing-llm

LLM and AI testing patterns — mock responses, evaluation with DeepEval/RAGAS, structured output validation, and agentic test patterns (generator, healer, planner). Use when testing AI features, validating LLM outputs, or building evaluation pipelines.

🇺🇸|EnglishTranslated

AI & Machine Learningaradotso/trending-skills

llmfit-hardware-model-matcher

Terminal tool that detects your hardware and recommends which LLM models will actually run well on your system

🇺🇸|EnglishTranslated

AI & Machine Learningabsolutelyskilled/absolut...

llm-app-development

Use this skill when building production LLM applications, implementing guardrails, evaluating model outputs, or deciding between prompting and fine-tuning. Triggers on LLM app architecture, AI guardrails, output evaluation, model selection, embedding pipelines, vector databases, fine-tuning, function calling, tool use, and any task requiring production AI application design.

🇺🇸|EnglishTranslated

AI & Machine Learningposthog/skills

exploring-llm-evaluations

Investigate LLM analytics evaluations of both types — `hog` (deterministic code-based) and `llm_judge` (LLM-prompt-based). Find existing evaluations, inspect their configuration, run them against specific generations, query individual pass/fail results, and generate AI-powered summaries of patterns across many runs. Use when the user asks to debug why an evaluation is failing, surface common failure modes, compare results across filters, dry-run a Hog evaluator, prototype a new LLM-judge prompt, or manage the evaluation lifecycle (create, update, enable/disable, delete).

🇺🇸|EnglishTranslated

AI & Machine Learningdatadog-labs/agent-skills

llm-obs-experiment-analyzer

Analyze LLM experiment results. Handles single or comparative experiments, exploratory or Q&A modes. Use when user says "analyze experiment", "compare experiments", "analyze against baseline", or provides one or two experiment IDs for analysis.

🇺🇸|EnglishTranslated

3 scripts/Checked

AI & Machine Learningllmquant/skills

llmquant-credit

Router skill for LLMQuant credit workflows. Use when the user needs issuer credit review, spread regime analysis, high-yield stress monitoring, default risk, debt maturity, or covenant context.

🇺🇸|EnglishTranslated

Data Processingllmquant/skills

llmquant-etfs

Router skill for LLMQuant ETFs workflows. Use when the user needs ETF holdings, overlap, concentration, issuer snapshot, or theme exposure analysis.

🇺🇸|EnglishTranslated

AI & Machine Learningllmquant/skills

llmquant-events

Router skill for LLMQuant event workflows. Use when the user needs earnings event briefs, M&A tracking, regulatory risk, catalysts, event calendars, or cross-asset event impact.

🇺🇸|EnglishTranslated

AI & Machine Learningllmquant/skills

llmquant-equity-derivatives

Router skill for LLMQuant equity derivatives workflows. Use when the user needs single-stock derivative, convertible, warrant, structured payoff, or hybrid security analysis.

🇺🇸|EnglishTranslated