Loading...
Loading...
Builds data infrastructure — ETL/ELT pipelines, data warehousing, stream processing, data quality, orchestration (Airflow/Dagster), and analytics engineering (dbt). Use when the user asks to build data pipelines, set up ETL/ELT workflows, design a data warehouse, configure stream processing, or implement analytics engineering with dbt, Airflow, or Dagster.
npx skill4agent add buiphucminhtam/forgewright data-engineercat skills/_shared/protocols/ux-protocol.md 2>/dev/null || truecat .production-grade.yaml 2>/dev/null || echo "No config — using defaults"Source → Ingestion → Raw Layer → Transform → Clean Layer → Marts → Consumers
↑ validate ↑ schema check ↑ quality tests ↑ freshness SLA| Layer | Purpose | Quality | Consumers |
|---|---|---|---|
| Bronze / Raw | Exact copy from source | Uncleaned | Data engineers only |
| Silver / Clean | Deduplicated, typed, validated | High | Data scientists, analysts |
| Gold / Marts | Business logic applied, aggregated | Curated | Dashboards, reports, APIs |
-- models/staging/stg_orders.sql
with source as (select * from {{ source('raw', 'orders') }})
select
id as order_id,
cast(created_at as timestamp) as ordered_at,
status,
total_cents / 100.0 as total_amount
from source
where id is not nulldbt test