Search Results: llm-optimization

Found 39 Skills

ad-add-fusion-transformation

Claude Code skill (trtllm-agent-toolkit): implement or extend TensorRT-LLM AutoDeploy fusion transforms under transform/library/ in a TensorRT-LLM checkout. Prefer existing kernels and custom ops; use Triton only when no viable existing-kernel path exists. Use ad-graph-dump for AD_DUMP_GRAPHS_DIR workflows. Covers TRT-LLM paths, registry, default.yaml registration, graph validation, tests, and a review checklist — without prescribing profiling tools or throughput targets.

🇺🇸|EnglishTranslated

AI & Machine Learningpromptingcompany/nv-skill...

tilegym-monkey-patch-kernels-to-transformers

Integrate TileGym kernels into Hugging Face `transformers` models by replacing the library's submodule(s) and certain class(es)' implementations, and patching certain class(es)' init/forward/load weight methods prior to instantiating models. Used when the user requires integrating TileGym kernels into `transformers` models.

🇺🇸|EnglishTranslated

AI & Machine Learningwyattowalsh/agents

prompt-engineer

Comprehensive prompt and context engineering for any AI system. Four modes: (1) Craft new prompts from scratch, (2) Analyze existing prompts with diagnostic scoring and optional improvement, (3) Convert prompts between model families (Claude/GPT/Gemini/Llama), (4) Evaluate prompts with test suites and rubrics. Adapts all recommendations to model class (instruction-following vs reasoning). Validates findings against current documentation. Use for system prompts, agent prompts, RAG pipelines, tool definitions, or any LLM context design. NOT for running prompts, generating content, or building agents.

🇺🇸|EnglishTranslated

AI & Machine Learningmorphiqlabs/morphiq-labs-...

morphiq-scan

This skill should be used when the user asks to "audit a website for AI visibility", "scan a domain", "check AI readiness", "evaluate content quality", "run a Morphiq Scan", "check if a site is optimized for LLMs", or mentions scanning a website for LLM citation readiness. Performs a full AI visibility audit across 5 categories (agentic readiness, content quality, chunking & retrieval, query fanout, policy files) and scores the domain on a 100-point rubric.

🇺🇸|EnglishTranslated

2 scripts/Checked

AI & Machine Learninginterfacex-co-jp/genshiji...

genshijin

Ultra-compressed communication mode. Talk like a caveman to reduce token usage by about 75%. Full technical accuracy is maintained. Intensity levels: 3 tiers - Polite, Normal (default), Extreme. Activate by saying "Caveman Mode", "Shorten", "Be Concise", "Save Tokens", or using /genshijin.

🇨🇳|ChineseTranslated

AI & Machine Learningaradotso/trending-skills

tilekernels-gpu-kernels

Expert skill for using TileKernels, a library of optimized GPU kernels for LLM operations (MoE routing, quantization, transpose, engram gating, Manifold HyperConnection) built with TileLang.

🇺🇸|EnglishTranslated

AI & Machine Learninglwlee2608/agent-skills

writing-system-prompts

Use when writing or editing a system prompt for any LLM API or SDK (any code passing a `system=` / `system` role parameter, or a `.txt`/`.md` file holding such a prompt). Applies prompt-engineering and prompt-caching best practices.

🇺🇸|EnglishTranslated

AI & Machine Learningshipshitdev/library

context-degradation

Recognize, diagnose, and mitigate patterns of context degradation in agent systems. Use when context grows large, agent performance degrades unexpectedly, or debugging agent failures.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningjwiegley/claude-prompts

caveman

Compress and simplify prompts to preserve meaning while reducing use of context

🇺🇸|EnglishTranslated

AI & Machine Learningglebis/claude-skills

rag-eval

Iterate on RAG systems with structured evals instead of eyeballing. This skill should be used when the user is tuning a RAG pipeline — changing retrieval prompts, swapping models, adjusting chunking, or debugging poor answers — and wants a cheap, ranked set of experiments with cost tracking and structured feedback on the stack. Also use when the user asks "how do I know if my RAG is working?", "this RAG eval is burning money", or "what should I try next on retrieval?".

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learninggarrytan/gbrain

functional-area-resolver

Compress an agent's routing file (RESOLVER.md or AGENTS.md) by converting granular skill-per-row tables into functional-area dispatchers. Each area lists sub-skills in a "(dispatcher for: ...)" clause. The LLM reads one area entry and routes to the correct sub-skill. Proven via held-out A/B eval: dispatcher pattern outperforms naive pipe-table compression.

🇺🇸|EnglishTranslated

AI & Machine Learningfarmage/opencode-skills

prompt-engineer

Writes, refactors, and evaluates prompts for LLMs — generating optimized prompt templates, structured output schemas, evaluation rubrics, and test suites. Use when designing prompts for new LLM applications, refactoring existing prompts for better accuracy or token efficiency, implementing chain-of-thought or few-shot learning, creating system prompts with personas and guardrails, building JSON/function-calling schemas, or developing prompt evaluation frameworks to measure and improve model performance.

🇺🇸|EnglishTranslated