home / skills / sfc-gh-dflippo / snowflake-dbt-demo / dbt-migration-snowflake

dbt-migration-snowflake skill

safe

This skill converts Snowflake DDL to dbt models, preserving logic while generating tests and documentation to modernize your data warehouse.

npx playbooks add skill sfc-gh-dflippo/snowflake-dbt-demo --skill dbt-migration-snowflake

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

8.7 KB

---
name: dbt-migration-snowflake
description:
  Convert Snowflake DDL to dbt models. This skill should be used when converting views, tables, or
  stored procedures from Snowflake to dbt code, generating schema.yml files with tests and
  documentation, or migrating existing Snowflake SQL to follow dbt best practices.
---

# Snowflake to dbt Model Conversion

## Purpose

Transform Snowflake DDL (views, tables, stored procedures) into production-quality dbt models,
maintaining the same business logic and data transformation steps while following dbt best
practices.

## When to Use This Skill

Activate this skill when users ask about:

- Converting Snowflake views or tables to dbt models
- Migrating Snowflake stored procedures to dbt
- Generating schema.yml files with tests and documentation
- Modernizing existing Snowflake SQL to follow dbt best practices

---

## Task Description

You are a database engineer working for a hospital system. You need to convert Snowflake DDL to
equivalent dbt code, maintaining the same business logic and data transformation steps while
following dbt best practices.

## Input Requirements

I will provide you the Snowflake DDL to convert.

## Audience

The code will be executed by data engineers who are learning Snowflake and dbt.

## Output Requirements

Generate the following:

1. One or more dbt models with complete SQL for every column
2. A corresponding schema.yml file with appropriate tests and documentation
3. A config block with materialization strategy
4. Explanation of key changes and architectural decisions
5. Inline comments highlighting any syntax that was converted

## Conversion Guidelines

### General Principles

- Replace procedural logic with declarative SQL where possible
- Break down complex procedures into multiple modular dbt models
- Implement appropriate incremental processing strategies
- Maintain data quality checks through dbt tests
- Use Snowflake SQL functions rather than macros whenever possible

### Sample Response Format

```sql
-- dbt model: models/[domain]/[target_schema_name]/model_name.sql
{{ config(materialized='view') }}

/* Original Object: [database].[schema].[object_name]
   Source Platform: Snowflake
   Purpose: [brief description]
   Conversion Notes: [key changes]
   Description: [SQL logic description] */

WITH source_data AS (
    SELECT
        customer_id::INTEGER AS customer_id,
        customer_name::VARCHAR(100) AS customer_name,
        account_balance::NUMBER(18,2) AS account_balance,
        created_date::DATE AS created_date
    FROM {{ ref('upstream_model') }}
),

transformed_data AS (
    SELECT
        customer_id,
        UPPER(customer_name)::VARCHAR(100) AS customer_name_upper,
        account_balance,
        created_date,
        CURRENT_TIMESTAMP()::TIMESTAMP_NTZ AS loaded_at
    FROM source_data
)

SELECT
    customer_id,
    customer_name_upper,
    account_balance,
    created_date,
    loaded_at
FROM transformed_data
```

```yaml
## models/[domain]/[target_schema_name]/_models.yml
version: 2

models:
  - name: model_name
    description: "Table description; converted from Snowflake [Original object name]"
    columns:
      - name: customer_id
        description: "Primary key - unique customer identifier"
        tests:
          - unique
          - not_null
      - name: customer_name_upper
        description: "Customer name in uppercase"
      - name: account_balance
        description: "Current account balance; Foreign key to OTHER_TABLE"
        tests:
          - relationships:
              to: ref('OTHER_TABLE')
              field: OTHER_TABLE_KEY
      - name: created_date
        description: "Date the customer record was created"
      - name: loaded_at
        description: "Timestamp when the record was loaded by dbt"
```

```yaml
## dbt_project.yml (excerpt)
models:
  my_project:
    +materialized: view
    domain_name:
      +schema: target_schema_name
```

### Specific Translation Rules

#### dbt Specific Requirements

- If the source is a view, use a view materialization in dbt
- Include appropriate dbt model configuration (materialization type)
- Add documentation blocks for a schema.yml
- Add descriptions for tables and columns
- Include relevant tests
- Define primary keys and relationships
- Assume that upstream objects are models
- Comprehensively provide all the columns in the output
- Break complex procedures into multiple models if needed
- Implement appropriate incremental strategies for large tables
- Use Snowflake SQL functions rather than macros whenever possible
- **Always cast columns with explicit precision/scale** using `::TYPE` syntax (e.g.,
  `column_name::VARCHAR(100)`, `amount::NUMBER(18,2)`) to ensure output matches expected data types
- **Always provide explicit column aliases** for clarity and documentation

#### Performance Optimization

- Suggest clustering keys if needed
- Recommend materialization strategy (view vs table)
- Identify potential performance improvements

#### Snowflake to dbt Conversion Patterns

Since the source is Snowflake, focus on converting to dbt best practices:

| Snowflake Object  | dbt Equivalent | Materialization                     |
| ----------------- | -------------- | ----------------------------------- |
| VIEW              | dbt model      | `view`                              |
| TABLE (static)    | dbt model      | `table`                             |
| TABLE (append)    | dbt model      | `incremental` (append)              |
| TABLE (merge)     | dbt model      | `incremental` (merge)               |
| DYNAMIC TABLE     | dbt model      | `incremental` or `table`            |
| MATERIALIZED VIEW | dbt model      | `table` with scheduling             |
| STORED PROCEDURE  | dbt model(s)   | Break into CTEs/models              |
| STREAM + TASK     | dbt model      | `incremental` with is_incremental() |

#### Key Conversion Examples

```sql
-- Snowflake VIEW → dbt view model
CREATE VIEW schema.my_view AS SELECT ... →
{{ config(materialized='view') }}
SELECT ...

-- Snowflake TABLE with CTAS → dbt table model
CREATE TABLE schema.my_table AS SELECT ... →
{{ config(materialized='table') }}
SELECT ...

-- Snowflake MERGE pattern → dbt incremental
MERGE INTO target USING source ON ... →
{{ config(
    materialized='incremental',
    unique_key='id',
    merge_update_columns=['col1', 'col2']
) }}
SELECT ... FROM {{ ref('source_model') }}
{% if is_incremental() %}
WHERE updated_at > (SELECT MAX(updated_at) FROM {{ this }})
{% endif %}

-- Snowflake STREAM/TASK → dbt incremental
CREATE STREAM my_stream ON TABLE source;
CREATE TASK my_task ... INSERT INTO target SELECT * FROM my_stream →
{{ config(materialized='incremental', unique_key='id') }}
SELECT * FROM {{ ref('source') }}
{% if is_incremental() %}
WHERE _metadata_timestamp > (SELECT MAX(_metadata_timestamp) FROM {{ this }})
{% endif %}

-- Stored procedure logic → CTE pattern
BEGIN ... multiple statements ... END →
WITH step1 AS (...), step2 AS (...), step3 AS (...)
SELECT * FROM step3
```

#### Snowflake-Specific Features in dbt

```sql
-- Clustering keys
{{ config(
    materialized='table',
    cluster_by=['date_col', 'category']
) }}

-- Transient tables (no Time Travel/Fail-safe)
{{ config(
    materialized='table',
    transient=true
) }}

-- Copy grants
{{ config(copy_grants=true) }}

-- Query tags
{{ config(query_tag='dbt_model_name') }}
```

#### Data Type Handling

Snowflake data types map directly - no conversion needed.

#### Dependencies

- List any upstream dependencies
- Suggest model organization in dbt project

---

## Validation Checklist

- [] Every DDL statement has been accounted for in the dbt models
- [] SQL in models is compatible with Snowflake (already native)
- [] All business logic preserved
- [] All columns included in output
- [] Data types correctly mapped
- [] Functions translated to Snowflake equivalents
- [] Materialization strategy selected
- [] Tests added
- [] SQL logic description complete
- [] Table descriptions added
- [] Column descriptions added
- [] Dependencies correctly mapped
- [] Incremental logic (if applicable) verified
- [] Inline comments added for converted syntax

---

## Related Skills

- $dbt-migration - For the complete migration workflow (discovery, planning, placeholder models,
  testing, deployment)
- $dbt-modeling - For CTE patterns and SQL structure guidance
- $dbt-testing - For implementing comprehensive dbt tests
- $dbt-architecture - For project organization and folder structure
- $dbt-materializations - For choosing materialization strategies (view, table, incremental,
  snapshots)
- $dbt-performance - For clustering keys, warehouse sizing, and query optimization
- $dbt-commands - For running dbt commands and model selection syntax
- $dbt-core - For dbt installation, configuration, and package management
- $snowflake-cli - For executing SQL and managing Snowflake objects

Overview

This skill converts Snowflake DDL (views, tables, stored procedures) into production-ready dbt models, schema.yml files, and configuration blocks. It preserves business logic while applying dbt best practices like modular models, explicit typing, and tests. The output includes clear materialization recommendations and migration notes for Snowflake-to-dbt patterns.

How this skill works

Provide Snowflake DDL or object definitions and the skill analyzes object type, SQL logic, and dependencies. It generates one or more dbt model SQL files with explicit casts and aliases, a schema.yml with column docs and tests, and a config block indicating materialization and performance settings. The skill replaces procedural stored-procedure steps with CTEs or modular models and adds incremental logic where appropriate.

When to use it

Converting Snowflake VIEW or TABLE DDL into dbt models
Migrating Snowflake stored procedures into declarative dbt SQL/CTE patterns
Generating schema.yml with tests, descriptions, and relationships
Modernizing Snowflake SQL to follow dbt best practices and incremental patterns
Planning materialization strategy and performance settings for Snowflake targets

Best practices

Use view materialization for direct view conversions and table/incremental for large/appendable tables
Break procedural stored procedure logic into modular CTEs or separate models for testability
Always cast columns explicitly with ::TYPE and include explicit column aliases
Add schema.yml with descriptions, unique/not_null tests and relationship tests for foreign keys
Use incremental materialization with is_incremental() checks and a stable unique_key for merge/append patterns
Recommend clustering keys and transient/table settings in config when appropriate

Example use cases

Convert CREATE VIEW ... AS SELECT ... into a dbt view model with {{ config(materialized='view') }} and full column casting
Transform a MERGE-based Snowflake load into an incremental dbt model with unique_key, merge_update_columns, and is_incremental() guard
Rewrite a Snowflake stored procedure that performs multiple steps into chained dbt models or a single model using CTEs for step1, step2, step3
Generate schema.yml with tests (unique, not_null, relationships) and column descriptions to support data quality checks
Migrate a CTAS table to a dbt table model and recommend clustering keys and transient settings for performance and cost control

FAQ

Do you preserve Snowflake-specific SQL functions?

Yes. Snowflake functions are preserved and used where appropriate; macros are avoided in favor of native Snowflake SQL.

How are stored procedures converted?

Procedural logic is refactored into declarative CTEs and/or multiple modular models so each transformation step is testable and documented.

What materialization will you choose?

Choice is driven by source object: views → view, static tables → table, append/merge patterns → incremental. I’ll recommend clustering, transient, or copy_grants where beneficial.