Loading...
Loading...
Found 20 Skills
Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices.
Best practices for scikit-learn machine learning, model development, evaluation, and deployment in Python
The industry standard library for machine learning in Python. Provides simple and efficient tools for predictive data analysis, covering classification, regression, clustering, dimensionality reduction, model selection, and preprocessing.
Use when "scikit-learn", "sklearn", "machine learning", "classification", "regression", "clustering", or asking about "train test split", "cross validation", "hyperparameter tuning", "ML pipeline", "random forest", "SVM", "preprocessing"
Scikit-learn machine learning library. Use for classical ML.
Debug Scikit-learn issues systematically. Use when encountering model errors like NotFittedError, shape mismatches between train and test data, NaN/infinity value errors, pipeline configuration issues, convergence warnings from optimizers, cross-validation failures due to class imbalance, data leakage causing suspiciously high scores, or preprocessing errors with ColumnTransformer and feature alignment.
Refactor Scikit-learn and machine learning code to improve maintainability, reproducibility, and adherence to best practices. This skill transforms working ML code into production-ready pipelines that prevent data leakage and ensure reproducible results. It addresses preprocessing outside pipelines, missing random_state parameters, improper cross-validation, and custom transformers not following sklearn API conventions. Implements proper Pipeline and ColumnTransformer patterns, systematic hyperparameter tuning, and appropriate evaluation metrics.
This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.
Advanced sub-skill for scikit-learn focused on model interpretability, feature importance, and diagnostic tools. Covers global and local explanations using built-in inspection tools and SHAP/LIME integrations.
Train ML models with scikit-learn, PyTorch, TensorFlow. Use for classification/regression, neural networks, hyperparameter tuning, or encountering overfitting, underfitting, convergence issues.
Machine-learning prediction strategy framework via Longbridge Securities — walk-forward rolling training with feature engineering (MACD, RSI, Bollinger Band width, volume change rate) and a scikit-learn classifier (Random Forest / Gradient Boosting); retrains every 60 days, predicts 5-day direction; buy signal when probability > 0.6, sell when < 0.4; evaluates win rate, profit factor, and Sharpe ratio. Triggers: "机器学习", "ML策略", "预测模型", "随机森林", "梯度提升", "深度学习", "AI选股", "walk-forward", "機器學習", "ML策略", "預測模型", "隨機森林", "梯度提升", "machine learning", "ML strategy", "predictive model", "random forest", "gradient boosting", "AI stock selection", "walk-forward", "rolling training", "feature engineering", "scikit-learn", "XGBoost".
Supervised & unsupervised learning, scikit-learn, XGBoost, model evaluation, feature engineering for production ML