dbx_ingestion_monitoring #126

cbotev-databricks · 2025-11-19T22:48:24Z

This directory contains common code and DABs to deploy observability ETL and dashboards for Databricks ingestion projects. The goal is to provide an example and a starting point for building ingestion observability across pipelines and datasets.

In particular, the package provides:

Tools to ETL observability data from a variety of sources such as SDP event log, Auto Loader cloud_file_states, system tables and other.
Tag-based pipeline discovery: Specify pipelines to monitor using flexible tag expressions with OR-of-ANDs logic (e.g., "tier:T0;team:data,tier:T1") instead of maintaining lists of pipeline IDs
Build a collection of observability tables on top of the above data using the medallion architecture.
Provide out-of-the-box AI/BI Dashboards based on the above observability tables
Code and examples to integrate the observability tables with third-party monitoring providers such as Datadog, New Relic, Azure Monitor, Splunk

Currently Generic SDP pipelines and Lakeflow CDC Connector pipelines are supported.

..._connector_monitoring_dab/dashboards/CDC Connector Monitoring Dashboard Template.lvdash.json

...stion_monitoring/cdc_connector_monitoring_dab/monitoring_etl/cdc_monitoring_pipeline_main.py

lennartkats-db · 2025-11-26T16:14:05Z

...stion_monitoring/cdc_connector_monitoring_dab/monitoring_etl/cdc_monitoring_pipeline_main.py

+import sys
+import logging
+
+sys.path.append("../../lib")


The alternative to this is to use the trick at https://github.com/databricks/cli/blob/c55065cc0ddc280146038446cf256af7afcc9eaf/libs/template/templates/default/template/%7B%7B.project_name%7D%7D/resources/%7B%7B.project_name%7D%7D_etl.pipeline.yml.tmpl#L40-L43

Honestly, it's a bit too much magic IMO. This way one can execute the notebook manually too.

...gestion_monitoring/generic_sdp_monitoring_dab/monitoring_etl/sdp_monitoring_pipeline_main.py

contrib/databricks_ingestion_monitoring/common/src/build_pipeline_tags_index.ipynb

contrib/databricks_ingestion_monitoring/common/resources/build_pipeline_tags_index.job.yml

contrib/databricks_ingestion_monitoring/NOTICE

lennartkats-db

Please review the directory structure: should jobs/ not be called src/? Otherwise this LGTM

cbotev-databricks · 2025-12-12T01:07:37Z

Please review the directory structure: should jobs/ not be called src/? Otherwise this LGTM

Done. Renamed jobs/ to src/

This commit fixes formatting issues in the databricks_ingestion_monitoring files that were introduced in PR #126. The files were merged without being properly formatted according to ruff standards, causing CI checks to fail. Changes: - Reformatted 14 files (Python files and Jupyter notebooks) using ruff format - No functional changes, only formatting improvements 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

…139) This commit fixes formatting issues in the databricks_ingestion_monitoring files that were introduced in PR #126. The files were merged without being properly formatted according to ruff standards, causing CI checks to fail. Changes: - Reformatted 14 files (Python files and Jupyter notebooks) using ruff format - No functional changes, only formatting improvements 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>

dbx_ingestion_monitoring release 0.3.4

d3b226d

lennartkats-db reviewed Nov 26, 2025

View reviewed changes

Address code review comments

3b9889b

cbotev-databricks requested a review from lennartkats-db December 3, 2025 00:05

lennartkats-db approved these changes Dec 11, 2025

View reviewed changes

Address code review comments

c4d4fe5

cbotev-databricks requested a review from lennartkats-db December 12, 2025 01:07

lennartkats-db merged commit 1cf3dba into databricks:main Dec 12, 2025

lennartkats-db mentioned this pull request Dec 19, 2025

Fix ruff formatting issues in databricks_ingestion_monitoring files #139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

dbx_ingestion_monitoring #126

dbx_ingestion_monitoring #126

Uh oh!

cbotev-databricks commented Nov 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lennartkats-db Nov 26, 2025

Uh oh!

cbotev-databricks Dec 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lennartkats-db left a comment

Uh oh!

cbotev-databricks commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dbx_ingestion_monitoring #126

dbx_ingestion_monitoring #126

Uh oh!

Conversation

cbotev-databricks commented Nov 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lennartkats-db Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

cbotev-databricks Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lennartkats-db left a comment

Choose a reason for hiding this comment

Uh oh!

cbotev-databricks commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants