Git Product home page Git Product logo

mdw-dataops's Introduction

page_type languages products description
sample
python
C#
TypeScript
bicep
Azure
Azure-Data-factory
Azure-Databricks
Azure-Stream-Analytics
Azure-Data-Lake-Gen2
Azure-Functions
Code samples showcasing how to apply DevOps concepts to the Modern Data Warehouse Architecture leveraging different Azure Data Technologies.

DataOps for the Modern Data Warehouse

This repository contains numerous code samples and artifacts on how to apply DevOps principles to data pipelines built according to the Modern Data Warehouse (MDW) architectural pattern on Microsoft Azure.

The samples are either focused on a single azure service (Single Tech Samples) or showcases an end to end data pipeline solution as a reference implementation (End to End Samples). Each sample contains code and artifacts relating one or more of the following

  • Infrastructure as Code (IaC)
  • Build and Release Pipelines (CI/CD)
  • Testing
  • Observability / Monitoring

Single Technology Samples

End to End samples

  • Parking Sensor Solution - This demonstrates batch, end-to-end data pipeline following the MDW architecture, along with a corresponding CI/CD process. See here for the presentation which includes a detailed walk-through of the solution. Architecture
  • Temperature Events Solution - This demonstrate a high-scale event-driven data pipeline with a focus on how to implement Observability and Load Testing. Architecture
  • Dataset Versioning Solution - This demonstrates how to use DataFactory to Orchestrate DataFlow, to do DeltaLoads into DeltaLake On DataLake(DoDDDoD).
  • MDW Data Governance and PII data detection - This sample demonstrates how to deploy the Infrastructure of an end-to-end MDW Pipeline using Azure DevOps pipelines along with a focus around Data Governance and PII data detection.
    • Technology stack: Azure DevOps, Azure Data Factory, Azure Databricks, Azure Purview, Presidio

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

mdw-dataops's People

Contributors

devlace avatar hannesne avatar kiwibayer avatar akirakakar avatar tejado avatar deniscep avatar microsoftopensource avatar jmostella avatar jsburckhardt avatar dependabot[bot] avatar balteravishay avatar herman-wu avatar elenaterenzi avatar nick287 avatar davidburela avatar jomit avatar nt-d avatar namitms avatar maye-msft avatar shawndeggans avatar tessferrandez avatar azadehkhojandi avatar microsoft-github-operations[bot] avatar

Stargazers

Priyanka O avatar

Watchers

 avatar

Forkers

priya-gittest

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.