Git Product home page Git Product logo

kickstarter_campaign_classification's Introduction

Kickstarter Campaign Classification [Machine Learning]

Build Status

-- Project Status: [Active]

Project Intro/Objective

Kickstarter is a US based global crowd funding platform focused on bringing funding to creative projects. Since the platform’s launch in 2009, the site has hosted over 159,000 successfully funded projects with over 15 million unique backers. Kickstarter uses an “all-or-nothing” funding system. This means that funds are only dispersed for projects that meet the original funding goal set by the creator.

Kickstarter earns 5% commission on projects that are successfully funded. Currently, less than 40% of projects on the platform succeed. The objective is to predict which projects are likely to succeed so that these projects can be highlighted on the site either through 'staff picks' or 'featured product' lists.

Collaborators

Name Github Page Personal Website
Nateé Johnson nateej1 ---
Misha Berrien mishaberrien www.mishaberrien.com

Methods Used

  • Machine Learning
  • Data Visualization
  • Predictive Modeling

Technologies

  • Python
  • Pandas, jupyter

Project Description

In order to increase the number of successful campaigns, we propose two related solutions:

  • Predict Successful Campaigns and promote those with the lowest predicted probability of being successful.
  • Contact creators from those campaigns that are just below the “success” margin and give them insights that will help them succeed.

Getting Started

  1. Clone this repo.
  2. A sample of the the deduplicated dataset can be found in the data_sample folder here.
  3. In order to reproduce results first open the "results" file located in the results folder here. Then change the two file paths at the beginning of the document

from:

kick_deduped = pd.read_csv('../../data/02_intermediate/kick_deduped.csv.zip')
cluster_features_df =  pd.read_csv('../../data/03_processed/KNN_cluster_features_.csv'))

to:

kick_deduped = pd.read_csv('../../data_sample/kick_deduped_sample.csv.zip')
cluster_features_df =  pd.read_csv('../../data_sample/KNN_cluster_features_.csv'))

then run the results file.

  1. The data processing/transformation scripts are being kept in the src folder here

  2. A data dictionary can be found in the references folder here

Featured Notebooks/Analysis/Deliverables


This file structure is based on the DSSG machine learning pipeline.

kickstarter_campaign_classification's People

Contributors

mberrien-fitzsimons avatar nateej1 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.