dbt Labs Cuts Compute Costs by 64% with New Orchestrator

dbt Labs reported a 64% reduction in its own dbt-related compute costs after adopting its next-generation runner, Fusion. The new orchestrator uses state-aware processing to intelligently skip unnecessary model runs and avoid redundant computations. The case study signals a maturation of dbt best practices from transformation logic to include operational efficiency and cost governance.

- The move to a Rust-based architecture with the Fusion engine provides significant performance gains, with parsing and compilation of dbt projects running up to 30 times faster than the previous Python-based engine. This accelerates development cycles by providing near-instant feedback in the IDE. - State-aware orchestration represents a shift from traditional DAG-based execution, where all models in a selection are rebuilt, to a more intelligent process that only runs models affected by code or data changes. This can immediately reduce compute costs by around 10% just by enabling the feature, with the potential for further savings through fine-tuned freshness configurations. - The acquisition of SDF Labs was a key enabler for Fusion, providing the engine with a deep understanding of SQL. This allows for advanced features like real-time syntax validation and column-level lineage directly within the development environment, reducing errors before they reach the data warehouse. - For organizations in regulated industries like healthcare, dbt provides a framework for building auditable and compliant data pipelines necessary for standards like HIPAA. Features such as automated documentation and testing support robust data governance. - The introduction of dbt Copilot and other AI-powered agents aims to streamline the analytics workflow by automating repetitive tasks. These tools can auto-generate documentation, suggest data quality tests, and assist in creating semantic models, freeing up engineers to focus on higher-level architectural challenges. - The new architecture is designed for the modern data stack, with support for open table formats like Apache Iceberg. This provides greater flexibility and cross-platform portability for data teams building on lakehouse architectures. - For developers, the enhanced VS Code extension, powered by Fusion, creates a more integrated development environment. It offers features like real-time error detection and the ability to preview CTEs, which tightens the development loop and improves individual productivity.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.