Git Product home page Git Product logo

airflow-deployment's Introduction

airflow-deployment

This repository contains deployments for running a dockerized version of airflow. There are two deployment options (local & production). The local directory has a compose file with all the necessary services required for a first time run whereas the production directory only has the core airflow components and assumes you have your own installation of Postgresql(airflow's internal database)

Requirements

  • Docker
  • PostgreSQL (When running from the production directory)

Running airflow

  1. It's possible to run the airflow containers using the official images but you should consider builing a custom image if you intend to run the DWH spark applications. The Dockerfile can be found in the dockerfiles directory. Remember to use the defined image tag when running the local/production container. Below is an example of how to generate the image.
cd dockerfiles
docker build -t khmis/airflow:0.0.1 .
  1. You can define environment varibles using the .env file which can be found in both the local and production directories.

  2. The dag code folder contains the main dag file (dwh_etl_dag.py) and component task files. When airflow runs fo the first time, the dags,plugins,logs and sparkapps directories should be automatically generated. Copy the dag file to the dags directory, and the plugin files to the plugin directories. The sparkapps directory contains the spark applications used in the DWH ETL. You can download them here and copy them to the directory

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.