Git Product home page Git Product logo

Hi there, I'm Lewis 👋


  • 📫 How to reach me: LinkedIn
  • ⚡ Fun fact: Two of my favorites books are A Billion Wicked Thoughts by Ogi Ogas & Sai Gaddam, and How Will You Measure Your Life by Clayton Christensen!
  • 📚 I'm currently reading Streaming Data by Andrew Psaltis, Designing Cloud Data Platforms by Danil Zburivsky & Lynda Partner, and Building the Data Lakehouse by Bill Inmon

Ofili Lewis's Projects

analyzing-visualizing-data-powerbi icon analyzing-visualizing-data-powerbi

This repository contains the lab files and other resources for the free Microsoft course DAT207x: Analyzing and Visualizing Data with Power BI. To learn how to connect, explore, and visualize data with Power BI, sign up for this course on edX.

behavior-analytics icon behavior-analytics

This project builds a data pipeline to populate the user_behavior_metric table. The user_behavior_metric table is an OLAP table, meant to be used by analysts, dashboarding.

bigquery-etl icon bigquery-etl

Simple ETL script to migrate data from BigQuery to Postgres database.

data-lake icon data-lake

This project builds an ETL pipeline for a data lake hosted on S3. We will load data from S3, process the data into analytics tables using Spark, and load them back into S3. We will deploy this Spark process on a cluster using AWS.

data-pipeline-with-gcp icon data-pipeline-with-gcp

This project implements a data ingestion and processing pipeline to collect, store and process time-series data. The pipeline consists of a publisher, a message queue (Pub/Sub), a consumer, a data warehouse (BigQuery) and a data extractor. The pipeline is designed to be scalable, efficient and easy to maintain.

data-warehouse icon data-warehouse

This project creates an ETL pipeline to build a data warehouse hosted on Redshift.

data_pipeline_with_airflow icon data_pipeline_with_airflow

This project builds a data pipeline that ingests Sparkify's music data into an AWS Redshift Data Warehouse. The ETL pipeline will be run on an hourly basis, scheduled using Airflow.

jetbrain-activation-code icon jetbrain-activation-code

jetbrain software全家桶激活码activation code, including intellij idea,pycharm,datagrip, webstorm...

learningsparkv2 icon learningsparkv2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

mysqlbackup icon mysqlbackup

This script performs automated backups of a MySQL database using the mysqldump command. It allows you to schedule daily backups and provides a notification upon successful completion.

nyc-taxi-data icon nyc-taxi-data

This etl pipeline extracts and integrates NYC Taxi Trip Data with Taxi Zone Lookup Data to create a dataset that can be used for descriptive and predictive analysis. For example, to predict the number of trips per day for a given taxi zone.

pyspark-template icon pyspark-template

Structured Streaming app that can read files from the local system folder as new files are added to the folder as stream data and apply all the operations on the new data and, finally, write the results in an output directory.

real-time-analytics-platform icon real-time-analytics-platform

The Real-Time Analytics Platform is a robust and scalable solution designed to ingest, process, and visualize high-volume data streams in real-time.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.