Fix breaking changes in recent dbt-fusion version #920

devin-ai-integration · 2026-01-27T20:38:01Z

Summary

This PR addresses breaking changes introduced in dbt-fusion versions after 2.0.0-preview.76.

Changes:

Updated .dbt-fusion-version from 2.0.0-preview.76 to 2.0.0-preview.102
Simplified .github/workflows/test-warehouse.yml to install latest dbt-fusion without reading from version file
Fixed ignore_small_changes parameter handling in get_anomalies_test_configuration.sql - normalized to dict when None to prevent .items() call on None
Added skip_for_dbt_fusion markers to tests with known dbt-fusion conformance issues:
- test_schema_changes - dbt-fusion caches column information and doesn't refresh when tables are recreated
- test_dbt_invocations - invocation_args_dict is empty in dbt-fusion, so yaml_selector cannot be captured
- test_sample_count_unlimited - test meta configuration not passed through to flattened_test

Review & Testing Checklist for Human

CI is still failing - test_event_freshness_anomalies tests are failing with dbt1501: Column 'event_timestamp' is not a timestamp error. This is an additional dbt-fusion breaking change that needs investigation.
Review the ignore_small_changes normalization fix - verify the default dict structure is correct
Verify the skip_for_dbt_fusion markers are appropriate - these are dbt-fusion conformance issues, not bugs in this codebase
Consider whether installing "latest" dbt-fusion is desired vs pinning to a specific working version
Check if other tests might need similar skip markers

Notes

The skipped tests represent dbt-fusion conformance issues that should be reported upstream:

invocation_args_dict is empty (should contain CLI args)
Test meta configuration not passed through
Column introspection returns cached/stale data

Status: CI is failing on test_event_freshness_anomalies tests. Additional investigation needed for the timestamp column error in dbt-fusion.

Link to Devin run: https://app.devin.ai/sessions/91780843af9a49208c0c87ba18f4682b
Requested by: Itamar Hartstein (@haritamar)

Summary by CodeRabbit

Version Update
- Released version 2.0.0-preview.102
Bug Fixes
- Improved handling of test configuration for ignore_small_changes settings
- Enhanced compatibility with dbt-fusion environment
Improvements
- Extended timestamp type support for Spark backend with additional timezone-aware variants

_{✏️ Tip: You can customize this high-level summary in your review settings.}

linear · 2026-01-27T20:38:04Z

ELE-5222 Fix breaking changes in recent dbt-fusion version

devin-ai-integration · 2026-01-27T20:38:05Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

github-actions · 2026-01-27T20:38:10Z

👋 @devin-ai-integration[bot]
Thank you for raising your pull request.
Please make sure to add tests and document all user-facing changes.
You can do this by editing the docs files in the elementary repository.

coderabbitai · 2026-01-27T20:38:45Z

📝 Walkthrough

Walkthrough

The PR upgrades dbt-fusion from version 2.0.0-preview.76 to 2.0.0-preview.102 by removing the version pin in CI and updating the version file. It addresses breaking changes through defensive code handling, test skip decorators for incompatible tests, and expanded Spark timestamp type support.

Changes

Cohort / File(s)	Summary
Version & CI Configuration `.dbt-fusion-version`, `.github/workflows/test-warehouse.yml`	Version reference updated to preview.102; install step no longer pins fusion version, now installs without version constraint.
Integration Test Compatibility `integration_tests/tests/test_dbt_artifacts/test_artifacts.py`, `integration_tests/tests/test_override_samples_config.py`, `integration_tests/tests/test_schema_changes.py`	Added `@pytest.mark.skip_for_dbt_fusion` decorators to three tests: `test_dbt_invocations`, `test_sample_count_unlimited`, and `test_schema_changes` due to dbt-fusion incompatibilities (missing invocation_args, test meta config passing, and column caching behavior).
Macro Logic Fixes `macros/edr/tests/test_configuration/get_anomalies_test_configuration.sql`	Added defensive None-handling: initialize `ignore_small_changes` to a dict with default keys when None, and skip per-key validation if not provided.
Data Type Extensions `macros/utils/data_types/data_type_list.sql`	Extended Spark backend `timestamp_list` to include `'timestamp_ltz'` and `'timestamp_ntz'` types.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 A fusion upgrade hops into view,
Version dots jump from seventy-six to two-oh-two,
With defensive checks and timestamps new,
Some tests take a rest, while macros break through!
The breaking changes bow to our fix.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Fix breaking changes in recent dbt-fusion version' directly summarizes the main objective: addressing breaking changes introduced in newer dbt-fusion versions.
Linked Issues check	✅ Passed	The PR addresses all key coding objectives from ELE-5222: updated dbt-fusion version, removed version pin from workflow, fixed ignore_small_changes normalization bug, added test skip markers for known fusion issues, and added timestamp type support for Databricks/Spark.
Out of Scope Changes check	✅ Passed	All changes are directly related to fixing dbt-fusion compatibility issues: version updates, workflow adjustments, macro fixes, test skips, and data type support are all within the scope of addressing breaking changes.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

…rsion Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

- Update Python version from 3.9 to 3.10 (required by elementary-data package) - Remove .dbt-fusion-version file and install latest dbt-fusion directly - This fixes CI failures caused by elementary dropping Python 3.9 support Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

The workflow runs from master branch which still expects this file. This updates the version from 2.0.0-preview.76 to 2.0.0-preview.102. Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

dbt-fusion fails with 'none has no method named items' when ignore_small_changes is None. This fix: 1. Normalizes ignore_small_changes to a dict with expected keys when None 2. Adds early return in validate_ignore_small_changes when None Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In `@macros/edr/dbt_artifacts/upload_dbt_invocation.sql`:
- Around line 105-131: The logs in get_invocation_yaml_selector currently print
full invocation_args_dict and config.args (via invocation_args_dict and
config.args) at info level, which may expose secrets; change these log calls to
use debug-level only and redact sensitive fields (e.g., --vars, tokens,
passwords, emails, ENV-derived secrets) before logging by either omitting those
keys or replacing their values with "[REDACTED]"; keep all other debug messages
(e.g., when returning selector, selector_name, or INVOCATION_COMMAND matches)
intact and ensure logging uses the same debug/info flagging as the surrounding
module.

In `@macros/edr/materializations/test/test.sql`:
- Around line 54-59: The code currently logs the entire flattened_test.meta at
info level; change it to avoid exposing PII by removing or replacing the full
meta log and instead log only the specific key needed
(flattened_test["meta"]["test_sample_row_count"] or flattened_test.get("meta",
{}).get("test_sample_row_count")) at debug level; update the {% do log(...) %}
call that references flattened_test.get("meta", {}) to log only the single key
and use info=False (debug) so only the necessary, non-sensitive value is
emitted, leaving the sample_limit assignment and subsequent debug log intact.

🧹 Nitpick comments (1)

macros/edr/data_monitoring/schema_changes/get_columns_snapshot_query.sql (1)

15-25: Gate verbose column logging behind the debug logger.

Line 15–25 logs every column at info level, which can be noisy and potentially expose schema details in normal runs. Prefer elementary.debug_log (or a debug flag) to keep this diagnostic-only.

♻️ Proposed change

-    {% do log('DEBUG get_columns_snapshot_query: model_relation = ' ~ model_relation, info=True) %}
-    {% do log('DEBUG get_columns_snapshot_query: full_table_name = ' ~ full_table_name, info=True) %}
+    {% do elementary.debug_log('get_columns_snapshot_query: model_relation = ' ~ model_relation) %}
+    {% do elementary.debug_log('get_columns_snapshot_query: full_table_name = ' ~ full_table_name) %}
...
-    {% do log('DEBUG get_columns_snapshot_query: columns count = ' ~ columns | length, info=True) %}
+    {% do elementary.debug_log('get_columns_snapshot_query: columns count = ' ~ columns | length) %}
     {% for column in columns %}
-        {% do log('DEBUG get_columns_snapshot_query: column[' ~ loop.index ~ '] = ' ~ column.name ~ ' (' ~ column.data_type ~ ')', info=True) %}
+        {% do elementary.debug_log('get_columns_snapshot_query: column[' ~ loop.index ~ '] = ' ~ column.name ~ ' (' ~ column.data_type ~ ')') %}
     {% endfor %}

macros/edr/dbt_artifacts/upload_dbt_invocation.sql

coderabbitai · 2026-01-27T22:28:53Z

macros/edr/materializations/test/test.sql

+  {# DEBUG: Log sample_limit determination #}
+  {% do log('DEBUG handle_dbt_test: initial sample_limit from config = ' ~ sample_limit, info=True) %}
+  {% do log('DEBUG handle_dbt_test: flattened_test meta = ' ~ flattened_test.get("meta", {}), info=True) %}
  {% if "meta" in flattened_test and "test_sample_row_count" in flattened_test["meta"] %}
    {% set sample_limit = flattened_test["meta"]["test_sample_row_count"] %}
+    {% do log('DEBUG handle_dbt_test: sample_limit from meta = ' ~ sample_limit, info=True) %}


⚠️ Potential issue | 🟠 Major

Don’t log full flattened_test.meta at info level.

Line 54–57 emits the entire meta dict, which can include user-supplied PII/secrets. Please log only the specific key(s) needed and use the debug logger.

🔒 Proposed change (redacted + debug-only)

- {% do log('DEBUG handle_dbt_test: initial sample_limit from config = ' ~ sample_limit, info=True) %} - {% do log('DEBUG handle_dbt_test: flattened_test meta = ' ~ flattened_test.get("meta", {}), info=True) %} + {% do elementary.debug_log('handle_dbt_test: initial sample_limit from config = ' ~ sample_limit) %} + {% do elementary.debug_log('handle_dbt_test: meta keys = ' ~ (flattened_test.get("meta", {}).keys() | list)) %} ... - {% do log('DEBUG handle_dbt_test: sample_limit from meta = ' ~ sample_limit, info=True) %} + {% do elementary.debug_log('handle_dbt_test: sample_limit from meta = ' ~ sample_limit) %}

🤖 Prompt for AI Agents

In `@macros/edr/materializations/test/test.sql` around lines 54 - 59, The code currently logs the entire flattened_test.meta at info level; change it to avoid exposing PII by removing or replacing the full meta log and instead log only the specific key needed (flattened_test["meta"]["test_sample_row_count"] or flattened_test.get("meta", {}).get("test_sample_row_count")) at debug level; update the {% do log(...) %} call that references flattened_test.get("meta", {}) to log only the single key and use info=False (debug) so only the necessary, non-sensitive value is emitted, leaving the sample_limit assignment and subsequent debug log intact.

- Add skip_for_dbt_fusion marker to test_schema_changes (dbt-fusion caches column info) - Add skip_for_dbt_fusion marker to test_dbt_invocations (invocation_args_dict is empty) - Add skip_for_dbt_fusion marker to test_sample_count_unlimited (test meta not passed through) - Remove debug logs from get_columns_snapshot_query.sql - Remove debug logs from upload_dbt_invocation.sql - Remove debug logs from test.sql Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

…timestamp issue Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

devin-ai-integration bot and others added 4 commits January 27, 2026 21:49

Remove dbt-fusion version pin to use latest version

51d6af8

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

Restore .dbt-fusion-version with 'latest' to use latest dbt-fusion ve…

6229c41

…rsion Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

Restore .dbt-fusion-version with version 2.0.0-preview.102

bbc4104

The workflow runs from master branch which still expects this file. This updates the version from 2.0.0-preview.76 to 2.0.0-preview.102. Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

devin-ai-integration bot force-pushed the devin/ELE-5222-1769546122 branch from e25e57d to bbc4104 Compare January 27, 2026 21:49

devin-ai-integration bot and others added 2 commits January 27, 2026 22:00

Add debug logs to investigate dbt-fusion test failures

d6c7e57

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

coderabbitai bot reviewed Jan 27, 2026

View reviewed changes

devin-ai-integration bot and others added 3 commits January 27, 2026 22:39

Add debug log to find_normalized_data_type_for_column to investigate …

63d087e

…timestamp issue Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

Fix timestamp_ltz/timestamp_ntz recognition for Databricks/Spark

5220f78

Co-Authored-By: Itamar Hartstein <haritamar@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix breaking changes in recent dbt-fusion version #920

Fix breaking changes in recent dbt-fusion version #920

devin-ai-integration bot commented Jan 27, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

linear bot commented Jan 27, 2026

Uh oh!

devin-ai-integration bot commented Jan 27, 2026

Uh oh!

github-actions bot commented Jan 27, 2026

Uh oh!

coderabbitai bot commented Jan 27, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Fix breaking changes in recent dbt-fusion version #920

Are you sure you want to change the base?

Fix breaking changes in recent dbt-fusion version #920

Conversation

devin-ai-integration bot commented Jan 27, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Review & Testing Checklist for Human

Notes

Summary by CodeRabbit

Uh oh!

linear bot commented Jan 27, 2026

Uh oh!

devin-ai-integration bot commented Jan 27, 2026

🤖 Devin AI Engineer

Uh oh!

github-actions bot commented Jan 27, 2026

Uh oh!

coderabbitai bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

devin-ai-integration bot commented Jan 27, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 27, 2026 •

edited

Loading