Git Product home page Git Product logo

Hello 👋

I am develloping a series of data engeneering projects aiming to connect source systems, warehouse platforms and reporting systems like PowerBI, Tableau. There is going to be a great journey. Great thanks to Dmitry from Syrfanalytics who initiated my move to exploring and documenting my journey.

The aim of this project to learn main functionalities of two large and fast system in ELT tools: Snowflake and dbt. I am going to concentrate how to connect these to systems, how to marry them, and how to get main functionalities working

I am following along this course on Udemy: https://www.udemy.com/course/complete-dbt-data-build-tool-bootcamp-zero-to-hero-learn-dbt/?referralCode=659B6722C93EF4096D11 and supplement it with my notes what has worked and what doesn't.

A few words about these systems with some help of chatGpt.

Snowflake is a cloud-based data warehousing platform that allows users to store and analyze large volumes of data in a scalable and cost-effective way. It supports both structured and semi-structured data and is known for its simplicity and ease of use

dbt - or "data build tool" - is an open-source software tool that helps data analysts and engineers transform data in their warehouse more effectively. It is particularly popular in the context of data analytics and data warehousing. dbt focuses on the transformation step of the data analytics workflow, allowing users to define transformations on raw data within their data warehouse.

I will widely apply Python and SQL in my development as well as use AWS S3 to store data.

Python - is a high-level, interpreted programming language known for its readability, versatility, and ease of learning. It was created by Guido van Rossum and first released in 1991. Python has become one of the most popular programming languages, widely used in automation, data engineering, data science, artificial intelligence.

SQL - Structured Query Language - is a standard programming language used for managing and manipulating relational databases. It provides a standardized way to interact with databases, enabling users to perform tasks such as querying data, updating data, inserting data, and deleting data

**Amazon S3 **- Amazon Simple Storage Service - is a scalable cloud storage service provided by Amazon Web Services (AWS). It is designed to store and retrieve any amount of data. S3 is often used for backup, archiving, content distribution, and as a data storage backend for various applications.

sashamos's Projects

blog icon blog

Data, code, and scripts for the analysis in the Mode blog.

covid19 icon covid19

Resources for the Udemy Course - Azure Data Factory For Data Engineers - Project on Covid19 by Ramesh Retnasamy

datagen icon datagen

Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.

faker-cli icon faker-cli

Command-line interface to quickly generate fake CSV and JSON data

jmeter icon jmeter

Apache JMeter open-source load testing tool for analyzing and measuring the performance of a variety of services

oh-my-posh icon oh-my-posh

The most customisable and low-latency cross platform/shell prompt renderer

oletools icon oletools

oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and debugging.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.