Search Results: memory-efficiency

Found 5 Skills

nemo-mbridge-perf-moe-dispatcher-selection

Choose the right MoE token dispatcher (`alltoall`, DeepEP, or HybridEP) for the hardware, EP degree, and optimization stage. Summarizes patterns from DSV3, Qwen3, Qwen3-Next, and VLM bring-up work.

🇺🇸|EnglishTranslated

AI & Machine Learningwshobson/agents

vector-index-tuning

Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.

🇺🇸|EnglishTranslated

Backend Developmentpluginagentmarketplace/cu...

streams

Master Node.js streams for memory-efficient processing of large datasets, real-time data handling, and building data pipelines

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningactionbook/rust-skills

domain-ml

Use when building ML/AI apps in Rust. Keywords: machine learning, ML, AI, tensor, model, inference, neural network, deep learning, training, prediction, ndarray, tch-rs, burn, candle, 机器学习, 人工智能, 模型推理

🇺🇸|EnglishTranslated

AI & Machine Learningsnakeo/claude-debug-and-r...

refactor:pytorch

Refactor PyTorch code to improve maintainability, readability, and adherence to best practices. Identifies and fixes DRY violations, long functions, deep nesting, SRP violations, and opportunities for modular components. Applies PyTorch 2.x patterns including torch.compile optimization, Automatic Mixed Precision (AMP), optimized DataLoader configuration, modular nn.Module design, gradient checkpointing, CUDA memory management, PyTorch Lightning integration, custom Dataset classes, model factory patterns, weight initialization, and reproducibility patterns.

🇺🇸|EnglishTranslated