Search Results: data-pipelines

Found 44 Skills

Data Processingjeremylongshore/claude-co...

pyspark-transformer

Pyspark Transformer - Auto-activating skill for Data Pipelines. Triggers on: pyspark transformer, pyspark transformer Part of the Data Pipelines skill category.

🇺🇸|EnglishTranslated

Data Processingadaptyvbio/protein-design...

protein-qc

Quality control metrics and filtering thresholds for protein design. Use this skill when: (1) Evaluating design quality for binding, expression, or structure, (2) Setting filtering thresholds for pLDDT, ipTM, PAE, (3) Checking sequence liabilities (cysteines, deamidation, polybasic clusters), (4) Creating multi-stage filtering pipelines, (5) Computing PyRosetta interface metrics (dG, SC, dSASA), (6) Checking biophysical properties (instability, GRAVY, pI), (7) Ranking designs with composite scoring. This skill provides research-backed thresholds from binder design competitions and published benchmarks.

🇺🇸|EnglishTranslated

Data Processingjeremylongshore/claude-co...

flink-job-creator

Flink Job Creator - Auto-activating skill for Data Pipelines. Triggers on: flink job creator, flink job creator Part of the Data Pipelines skill category.

🇺🇸|EnglishTranslated

Data Processingg1joshi/agent-skills

airflow

Apache Airflow workflow orchestration. Use for data pipelines.

🇺🇸|EnglishTranslated

Data Processinggemini-cli-extensions/dat...

dbt-bigquery

Expert guidance for creating, modifying, and optimizing dbt pipelines for BigQuery. Use this skill whenever user asks for generating or modifying a dbt model or project. Activate this skill when the user - Creates, modifies, or troubleshoots **dbt models or pipelines** - Needs to **optimize SQL** within a dbt project - Is **setting up a new dbt project** or configuring existing one

🇺🇸|EnglishTranslated

Data Processingsickn33/antigravity-aweso...

data-engineer

Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms. Use PROACTIVELY for data pipeline design, analytics infrastructure, or modern data stack implementation.

🇺🇸|EnglishTranslated

Data Processinggemini-cli-extensions/dat...

gcp-dataflow

Provides guidance for writing, packaging and executing Apache Beam pipelines on GCP using Cloud Dataflow. Use when: - Creating an Apache Beam Dataflow pipeline. - Creating a Google Flex Template.

🇺🇸|EnglishTranslated

Data Processingalirezarezvani/claude-ski...

senior-data-engineer

Data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, and modern data stack. Includes data modeling, pipeline orchestration, data quality, and DataOps. Use when designing data architectures, building data pipelines, optimizing data workflows, implementing data governance, or troubleshooting data issues.

🇺🇸|EnglishTranslated

3 scripts/Attention

Data Processingsecondsky/sap-skills

sap-hana-cloud-data-intelligence

Develops data processing pipelines, integrations, and machine learning scenarios in SAP Data Intelligence Cloud. Use when building graphs/pipelines with operators, integrating ABAP/S4HANA systems, creating replication flows, developing ML scenarios with JupyterLab, or using Data Transformation Language functions. Covers Gen1/Gen2 operators, subengines (Python, Node.js, C++), structured data operators, and repository objects.

🇺🇸|EnglishTranslated

Data Processingjeremylongshore/claude-co...

prefect-flow-builder

Prefect Flow Builder - Auto-activating skill for Data Pipelines. Triggers on: prefect flow builder, prefect flow builder Part of the Data Pipelines skill category.

🇺🇸|EnglishTranslated

Data Processingjeremylongshore/claude-co...

data-quality-checker

Data Quality Checker - Auto-activating skill for Data Pipelines. Triggers on: data quality checker, data quality checker Part of the Data Pipelines skill category.

🇺🇸|EnglishTranslated

Data Processingmembranedev/application-s...

google-cloud-dataflow

Google Cloud Dataflow integration. Manage data, records, and automate workflows. Use when the user wants to interact with Google Cloud Dataflow data.

🇺🇸|EnglishTranslated