Loading...
Loading...
Found 3 Skills
Horizontal session personality overlay — auto-detects conversation mode from density signals, defaults casual, upgrades to structured only on sustained signal. Includes CommitMono aesthetic preference and MoE/thinking-chain runtime awareness.
Mistral AI efficient open models. Use for efficient AI.
Review, design, and refactor TensorRT-LLM PyTorch MoE code for architecture fit, clean code, maintainability, and testability. Always use for any modification, review, refactor, or design planning that touches MoE modules, including tensorrt_llm/_torch/modules/fused_moe, ConfigurableMoE, MoE backends, MoEScheduler/moe_scheduler.py, forward execution/chunking, communication strategies, EPLB, quantization/weight handling, routing, factories, MoE docs, or MoE tests. Also use when the user asks whether a MoE design follows the current architecture or whether a MoE refactor is reasonable.