Search Results: data-pipeline

Found 61 Skills

Data Processingmicrosoft/skills-for-fabr...

e2e-medallion-architecture

Implement end-to-end Medallion Architecture (Bronze/Silver/Gold) lakehouse patterns in Microsoft Fabric using PySpark, Delta Lake, and Fabric Pipelines. Use when the user wants to: (1) design a Bronze/Silver/Gold data lakehouse, (2) set up multi-layer workspace with lakehouses for each tier, (3) build ingestion-to-analytics pipelines with data quality enforcement, (4) optimize Spark configurations per medallion layer, (5) orchestrate Bronze-to-Silver-to-Gold flows via notebooks. Triggers: "medallion architecture", "bronze silver gold", "lakehouse layers", "e2e data pipeline", "end-to-end lakehouse", "data lakehouse pattern", "multi-layer lakehouse", "build medallion", "setup medallion".

🇺🇸|EnglishTranslated

Data Processingfatfingererr/macro-skills

nickel-concentration-risk-analyzer

以全球鎳供給結構為核心，量化各國的主導程度（例如印尼）、主要礦區供給量、以及政策配額/減產情境對全球供需平衡與價格非對稱的影響。

🇺🇸|EnglishTranslated

6 scripts/Checked

Data Processingfatfingererr/macro-skills

lithium-supply-demand-gap-radar

Integrate the lithium industry chain (mining → refined chemicals → batteries and end demand) into a set of computable proxy indicators; then map these indicators to the component exposure and long-term price trends of lithium-themed ETFs (such as LIT) to form a basis for decision-making.

🇨🇳|ChineseTranslated

7 scripts/Checked

Data Processingadaptyvbio/protein-design...

protein-qc

Quality control metrics and filtering thresholds for protein design. Use this skill when: (1) Evaluating design quality for binding, expression, or structure, (2) Setting filtering thresholds for pLDDT, ipTM, PAE, (3) Checking sequence liabilities (cysteines, deamidation, polybasic clusters), (4) Creating multi-stage filtering pipelines, (5) Computing PyRosetta interface metrics (dG, SC, dSASA), (6) Checking biophysical properties (instability, GRAVY, pI), (7) Ranking designs with composite scoring. This skill provides research-backed thresholds from binder design competitions and published benchmarks.

🇺🇸|EnglishTranslated

Data Processingjeremylongshore/claude-co...

flink-job-creator

Flink Job Creator - Auto-activating skill for Data Pipelines. Triggers on: flink job creator, flink job creator Part of the Data Pipelines skill category.

🇺🇸|EnglishTranslated

Data Processingg1joshi/agent-skills

airflow

Apache Airflow workflow orchestration. Use for data pipelines.

🇺🇸|EnglishTranslated

Data Processinggemini-cli-extensions/dat...

dbt-bigquery

Expert guidance for creating, modifying, and optimizing dbt pipelines for BigQuery. Use this skill whenever user asks for generating or modifying a dbt model or project. Activate this skill when the user - Creates, modifies, or troubleshoots **dbt models or pipelines** - Needs to **optimize SQL** within a dbt project - Is **setting up a new dbt project** or configuring existing one

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesgemini-cli-extensions/dat...

gcp-pipeline-orchestration

This skill helps the agent generate or update orchestration pipeline definitions for Google Cloud Composer to initialize orchestration pipeline or update the orchestration definition for orchestration of various data pipelines, like dbt pipelines, notebooks, Spark jobs, Dataform, Python scripts or inline BigQuery SQL queries. This skill also helps deploy and trigger orchestration pipelines.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingastronomer/agents

authoring-dags

Workflow and best practices for writing Apache Airflow DAGs. Use when the user wants to create a new DAG, write pipeline code, or asks about DAG patterns and conventions. For testing and debugging DAGs, see the testing-dags skill.

🇺🇸|EnglishTranslated

Data Processingmarkdown-viewer/skills

data-analytics

Create data analytics and data pipeline diagrams using PlantUML syntax with analytics/database stencil icons. Best for ETL pipelines, data lakes, real-time streaming, data warehousing, and BI dashboards. NOT for simple flowcharts (use mermaid) or general cloud infra (use cloud skill).

🇺🇸|EnglishTranslated

Data Processinggemini-cli-extensions/dat...

gcp-dataflow

Provides guidance for writing, packaging and executing Apache Beam pipelines on GCP using Cloud Dataflow. Use when: - Creating an Apache Beam Dataflow pipeline. - Creating a Google Flex Template.

🇺🇸|EnglishTranslated

Data Processingnvidia-nemo/datadesigner

data-designer

Use when the user wants to create a dataset, generate synthetic data, or build a data generation pipeline.

🇺🇸|EnglishTranslated

1 scripts/Checked