Git Product home page Git Product logo

google-datacatalog-dbt-tag's Introduction

Google Cloud Data Catalog and dbt

Example to create or update a Google Cloud Data Catalog tag on BigQuery tables or views with dbt Cloud metadata via a Python Cloud Function.

Data Catalog tag : dbt Run Metadata tag attached to the BigQuery table or view and containing information from the dbt Run used to create or update the BigQuery table or view : Run durations and date, dbt Project and Model, Cloud job, Cloud project and approximative size and rows count.

To activate, learn and use Cloud Data Catalog, go to https://cloud.google.com/data-catalog and https://console.cloud.google.com/datacatalog.

This repository contains the Cloud Function Python code to create or update the Data Catalog tag.

This Cloud Function uses:

In your Cloud Function, you need the 5 files:

Before runing the Cloud Function (and create or update tags), you need to create the Data Catalog Tag Template for dbt Run Metadata.

You can use:

To use the Cloud Function you just have to pass the dbt Cloud Run ID and the dbt Cloud Account ID in a JSON format like {"dbt_run_id":"13161733","dbt_account_id":"11442"}.

When the Data Catalog template tag is created and when a tag is created or updated on BigQuery tables or views, you can find all results from https://console.cloud.google.com/datacatalog.

Finally, you can also search BigQuery tables or views in Cloud Data Catalog with a dbt tag from your own application like https://github.com/victorcouste/dbt-datacatalog-explorer


Happy tagging !


image

google-datacatalog-dbt-tag's People

Contributors

victorcouste avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.