Search Results: llm-cost-optimization

Found 5 Skills

AI & Machine Learningaradotso/codex-skills

codexsaver-cost-router

Route low-risk coding tasks to cheaper LLMs while keeping Codex for high-risk decisions, using MCP tools for cost-aware delegation

🇺🇸|EnglishTranslated

AI & Machine Learningvesely/skills

context-audit

Audit your Claude Code setup for token waste and context bloat. Use when the user says "audit my context", "check my settings", "why is Claude so slow", "token optimization", "context audit", or runs /context-audit. Starts by running /context to see real overhead, then audits MCP servers, CLAUDE.md rules, skills, settings, and file permissions. Returns a health score with specific fixes.

🇺🇸|EnglishTranslated

AI & Machine Learningsagargupta16/claude-cost-...

cost-mode

Cost-conscious Claude Code mode. Reduces output tokens 40-70% and overall costs 30-60% by enforcing concise responses, smart model routing, and efficient workflow patterns. Keeps full technical accuracy. Activate with /cost-mode or "enable cost mode". Auto-triggers on mentions of budget, cost, tokens, or spending.

🇺🇸|EnglishTranslated

AI & Machine Learningyonatangross/orchestkit

semantic-caching

Redis semantic caching for LLM applications. Use when implementing vector similarity caching, optimizing LLM costs through cached responses, or building multi-level cache hierarchies.

🇺🇸|EnglishTranslated

2 scripts/Checked

AI & Machine Learningcosmicstack-labs/mercury-...

token-budget-tracking

Track, optimize, and control token consumption across multi-agent systems. Covers budget allocation, real-time monitoring, cost attribution, per-agent limits, and proactive cost optimization for production LLM deployments.

🇺🇸|EnglishTranslated