Snowflake Cortex Unifies dbt and Airflow

Snowflake Cortex is natively integrating dbt and Apache Airflow through an expanded CLI, a major move to unify the modern data stack for AI. The update allows teams to combine data transformation (dbt) and pipeline orchestration (Airflow) directly within Snowflake's platform. This is a big deal for analytics engineering, aiming to streamline everything from ELT and feature engineering to deploying LLM-powered analytics in regulated environments like biotech.

This integration addresses a long-standing fragmented workflow for data teams. Previously, developers generated code with AI assistants like VS Code extensions that were disconnected from their actual data pipelines, requiring manual rework to fit the specific logic of their dbt models or Airflow DAGs. The key differentiator is Cortex Code's use of your enterprise's metadata. Unlike general-purpose tools, the AI agent reads your Snowflake information schema—table definitions, column types, and key relationships—to produce code that is contextually aware of your specific data landscape. This is a strategic move by Snowflake to become the central intelligence layer of the data stack. By embedding AI assistance directly into the control planes for transformation (dbt) and orchestration (Airflow), Snowflake aims to increase "platform gravity," making its ecosystem the default for building and managing data logic and AI enablement. The pricing model for Cortex shifts from Snowflake's traditional compute credits to a token-based consumption model. Costs are incurred per token processed (both input and output) and vary based on the underlying large language model selected from providers like Anthropic, Meta, and Google. For regulated industries like biotech, this architecture provides a significant security advantage by keeping sensitive data within the Snowflake perimeter. It enables advanced AI use cases, such as analyzing complex biological data or accelerating research, without exposing data to external APIs. Looking ahead, Snowflake is also introducing a managed Model Context Protocol (MCP) Server. This provides a standardized, secure method for connecting AI agents from platforms like Anthropic's CrewAI and Salesforce's Agentforce to proprietary enterprise data, enabling more complex, multi-agent AI systems.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.