Building Modular and Testable Data Pipelines with dbt on Databricks

Master version-controlled data transformation workflows by building modular, tested, and documented pipelines using dbt on Databricks.

4.6 (612) ⏱ 1h 8m 📚 6 lessons 🎧 Audio version

About this course

Modern data teams need reliable, version-controlled, and thoroughly tested pipelines to transform raw data into actionable insights. Managing these workflows at scale requires combining the processing power of Databricks with the structured transformation framework of dbt. This course teaches you how to design and maintain clean data architectures that scale seamlessly. In this course, you will learn how to build production-grade data transformation pipelines from scratch. You will transition from writing raw, unorganized SQL to developing modular, reusable, and fully tested data models using both dbt Core and dbt Cloud on Databricks. What you'll learn: - Understand the foundational concepts of dbt, including project structure, configuration with YAML, and the multi-layer Bronze-Silver-Gold data architecture. - Configure dbt to connect seamlessly with Databricks using both dbt Cloud and dbt Core environments. - Build modular data models using SQL, Jinja templating, and custom macros to write dry, reusable transformation logic. - Implement robust data quality checks using modern dbt testing configurations and third-party utility packages. - Apply advanced materialization strategies, including incremental loads and snapshots, to optimize pipeline performance and track historical changes. - Integrate version control best practices to collaborate safely and maintain a clean history of your data pipeline changes. You will start with core data engineering concepts and dbt setup before moving step-by-step through modeling, testing, and advanced performance tuning. The written explanations and practical code snippets guide you from initial project initialization to deploying a resilient, production-ready pipeline. This course is designed for aspiring data engineers, analytics engineers, and data analysts who want to build structured data pipelines. No prior experience with dbt or Databricks is required, though a basic understanding of SQL is helpful. Start building cleaner, more reliable data pipelines today.

What you'll get

  • 📜 Certificate of completion
    Add it to your LinkedIn profile
  • 💬 Personal AI tutor
    Stuck on a lesson? Ask your built-in tutor anything, any time.
  • 🎧 Audio version included
    Learn on the go — no screen needed
  • ♾️ Lifetime access
    Come back anytime, no expiry
  • 📱 Phone or computer
    Works anywhere, any device
  • 💸 30-day refund
    No questions asked
  • Short & focused
    1h 8m of practical content

Reviews (2)

Anna Müller AT Verified learner
★ 5 · 2026-04-27T08:32:54+00:00

Pretty solid overall. Some parts moved a little fast for me, but the practical examples were super helpful. Glad I took it.

Hannah Meyer AT
★ 3 · 2025-04-12T18:14:54+00:00

It's a decent introduction. Could benefit from more diverse examples and a slightly better flow between modules.

Write a review

You'll be asked to sign in after sending — your draft is saved.

Learners also took

Frequently asked

What do I need to take this course? +

Just a phone or computer with internet. No installs, no special hardware.

How do I pay? +

By card via Stripe, or with cryptocurrency. We do not store card details — Stripe handles them securely.

Can I get a refund? +

Yes — full refund within 30 days, no questions asked.

How long will I have access? +

Forever. Once you purchase, the course is yours to revisit anytime.

Will I get a certificate? +

Yes. On completion you'll receive a certificate you can add to your LinkedIn profile.

Built for learners in
Tech Design Finance Marketing Healthcare Education Hospitality Manufacturing