Git Product home page Git Product logo

demo-etl-dbt-omop's Introduction

ohdsi-2022

The work of OMOP on the dbt for demo in OHDSI Symposium 2022

Abstract available at: https://www.ohdsi.org/2022showcase-2/ image

About

This repo include:

  • A part of productionize pipeline of OMOP CDM conversion at Siriraj Hospital (the 'Dev' box in the figure above.)
  • Sub-repo of the dbt project and model that handling ETL in SQL.

Only for the demonstration, We use data pipeline and ETL convension from OHDSI/ETL-Synthea.

Learn more about dbt.

Features

Data Lineage

The dbt docs serve is providing full documentation with graph of data lineage, ease developer to maintain their conversion. image

Scheduled Pipeline

From dbt manifest to Apache Airflow, Wrapping dbt project into DAG of tasks dynamically per each models from the dbt with its execution order. image

Jinja Macro Templating

Some ETL pattern is redundant (example: Mapping Concepts), Define parameterized funtions at one place to keep maintainability by not edit on every .sql file that operate the same pattern. image

Unit Test

Developer can quickly run dry test for uniqueness in ID column, relationship between concept ID and concept table (PK and FK) with dbt test before proceed on DQD. image

Containerized and Version Control

Back-end infrastructure was wrapped up in Dockerfile allow to deploy on any container platform (Docker, K8, etc.) and version controlled via GitHub or GitLab.

Disclaimer

This article is an independent publication and has not been authorized, sponsored, or otherwise approved by dbt Labs, Inc., the owner of dbtTM, or any owners of the products mentioned therein.

demo-etl-dbt-omop's People

Contributors

thanepi avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

b8heng

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.