Search Results: llm-observability

Found 32 Skills

AI & Machine Learningphrazzld/claude-config

llm-infrastructure

Comprehensive LLM audit. Model currency, prompt quality, evals, observability, CI/CD. Ensures all LLM-powered features follow best practices and are properly instrumented. Auto-invoke when: model names/versions mentioned, AI provider config, prompt changes, .env with AI keys, aiProviders.ts or prompts.ts modified, AI-related PRs. CRITICAL: Training data lags months. ALWAYS web search before LLM decisions.

🇺🇸|EnglishTranslated

2 scripts/Attention

AI & Machine Learningneuradex/langfuse-skills

langfuse-session-list

List Langfuse sessions. Use when checking user sessions, analyzing conversation flows, or monitoring session activity.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningelastic/agent-skills

observability-llm-obs

Monitor LLMs and agentic apps: performance, token/cost, response quality, and workflow orchestration. Use when the user asks about LLM monitoring, GenAI observability, or AI cost/quality.

🇺🇸|EnglishTranslated

AI & Machine Learningdatadog-labs/agent-skills

llm-obs-eval-bootstrap

Bootstrap evaluators from production traces — emit SDK code, a framework-agnostic JSON spec, or publish online LLM-judge evaluators directly to Datadog. Use when user says "bootstrap evaluators", "generate evaluators", "create evals from traces", "eval bootstrap", "write evaluators", "build eval suite", "publish evaluators", or wants to generate BaseEvaluator/LLMJudge code or online judge configs from production LLM trace data. Works with ml_app and optional RCA report or failure hypothesis.

🇺🇸|EnglishTranslated

AI & Machine Learningolorolor/fundamentals-wit...

module6-strategy

AI 도입 전략, Build vs Buy, 우선순위 설정, 거버넌스/보안, 6개월 확장 로드맵을 다루는 모듈.

🇺🇸|EnglishTranslated

AI & Machine Learninglubu-labs/langchain-agent...

langsmith-trace-analyzer

Fetch, organize, and analyze LangSmith traces for debugging and evaluation. Use when you need to: query traces/runs by project, metadata, status, or time window; download traces to JSON; organize outcomes into passed/failed/error buckets; analyze token/message/tool-call patterns; compare passed vs failed behavior; or investigate benchmark and production failures.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learninglitefuse/skills

litefuse

Interact with Litefuse and access its documentation. Use when needing to (1) query or modify Litefuse data programmatically via the CLI — traces, prompts, datasets, scores, sessions, and any other API resource, (2) look up Litefuse documentation, concepts, integration guides, or SDK usage, or (3) understand how any Litefuse feature works. This skill covers CLI-based API access (via npx) and multiple documentation retrieval methods.

🇺🇸|EnglishTranslated

AI & Machine Learninglangfuse/skills

langfuse-observability

Instrument LLM applications with Langfuse tracing. Use when setting up Langfuse, adding observability to LLM calls, or auditing existing instrumentation.

🇺🇸|EnglishTranslated

AI & Machine Learningmlflow/skills

instrumenting-with-mlflow-tracing

Instruments Python and TypeScript code with MLflow Tracing for observability. Triggers on questions about adding tracing, instrumenting agents/LLM apps, getting started with MLflow tracing, or tracing specific frameworks (LangGraph, LangChain, OpenAI, DSPy, CrewAI, AutoGen). Examples - "How do I add tracing?", "How to instrument my agent?", "How to trace my LangChain app?", "Getting started with MLflow tracing", "Trace my TypeScript app"

🇺🇸|EnglishTranslated

AI & Machine Learningscientiacapital/skills

langgraph-agents

Multi-agent systems with LangGraph - supervisor/swarm/handoff/router patterns, state coordination, Deep Agents, guardrails, testing, observability, deployment. Use when building multi-agent workflows, coordinating agents, or need cost-optimized orchestration. Uses Claude, DeepSeek, Gemini (no OpenAI).

🇺🇸|EnglishTranslated

AI & Machine Learningdavila7/claude-code-templ...

langfuse

Expert in Langfuse - the open-source LLM observability platform. Covers tracing, prompt management, evaluation, datasets, and integration with LangChain, LlamaIndex, and OpenAI. Essential for debugging, monitoring, and improving LLM applications in production. Use when: langfuse, llm observability, llm tracing, prompt management, llm evaluation.

🇺🇸|EnglishTranslated

AI & Machine Learningflowlines-ai/skills

integrate-flowlines-sdk-python

Integrates Flowlines observability SDK into Python LLM applications. Use when adding Flowlines telemetry, instrumenting LLM providers, or setting up OpenTelemetry-based LLM monitoring.

🇺🇸|EnglishTranslated