Git Product home page Git Product logo

sparkify's Introduction

Sparkify

This repository is the work for my capstone project from the Udacity Data Scientist Nanodegree Program. In this project, I will analyze the data from Sparkify to predict customer churn.

Sparkify is a simulation data of a subscription-based company that provide music service like Spotify, Apple Music, etc. Customer churn prediction is a very challenging and common task for a data scientist or analyst to improve a company's business. Processing and analyzing a large amount of data with Spark is also a must-have skill in the data fields.

๐Ÿš€ Table of contents

  1. Prerequisites
  2. Project Motivation
  3. Instructions
  4. Results
  5. Acknowledgements

Prerequisites

These are libraries that is used in this project:

  • PySpark

Instructions

  1. Install PySpark
  2. Run the notebook Sparkify.ipynb

Results

The findings of this project has been published here.

Acknowledgements

This project use disaster data from Sparkify.

The code is inspired by Udacity Data Scientist Nanodegree Program.

๐Ÿ”จ Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

  1. Fork the Project.
  2. Create your Feature Branch (git checkout -b feature/Feature).
  3. Commit your Changes (git commit -m 'Add some feature').
  4. Push to the Branch (git push origin feature/Feature).
  5. Open a Pull Request.

๐Ÿ“ซ Contact

sparkify's People

Contributors

dhuy237 avatar

Watchers

 avatar

Forkers

dinhquants

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.