Git Product home page Git Product logo

cse450team02's Introduction

CSE450Team02

Team 2 work

Module 02

Dataset

Spend some time with your team evaluating the data. Be sure to look at data types, ranges, and meanings of each feature from the data dictionary.

Links

Stakeholders

The core task we're interested in is identifying those customers most likely to subscribe to a term deposit.

  • find any actionable patterns in our results.
    • Should we only call single people on Saturdays?
    • Does it make sense to call students at all?
  • does contacting people too frequently for these marketing campaigns have an adverse affect on the outcome?

One last thing, there are a bunch of social and economic indicators in the data.

  • We may want to see separate models for times when, for example, the consumer confidence index is high compared to when it is low.

We'll definitely want to know if it's better to use a particular model during different economic situations.

once you have the models trained and tested, that you persist those trained models to a file so we can load them into our production systems.

run the list through your model and make some predictions for us?

Data

Then, please submit a csv file that has a single column, with the header "predictions" and a prediction (one per row) for each individual in this file. If we should contact the individual, predict a 1. If we shouldn't contact the individual, predict a 0 for that row. There should be 4,119 predictions in the csv file when completed.

DATA DICTIONARY

Our database analyst put together this data dictionary to help explain the values and sources of different columns in the bank dataset, so be sure to review that.

BINNING

There are multiple ways to do this, but previously we used the map() function.

DECISION TREES

You can find documentation on how to use decision trees with sci-kit learn on these pages:

User Guide Entry for Decision Trees API Reference for DecisionTreeClassifier Tips for Practical Use, from the User Guide

MODEL PERSISTENCE

You'll want to save these trained models using python's pickle module, as shown here.

However, rather than using pickle's default protocol version, you should use protocol version 5, which was introduced in Python 3.8 and is optimized for dealing with structures that contain numpy arrays and pandas data frames.

MODEL ENSEMBLES, BAGGING, AND BOOSTING

For details on how to use these techniques with Sci-Kit Learn, see this page.

We will need this collection of Google Colab notebooks that might be useful.

cse450team02's People

Contributors

awarbler avatar justicewheelwright avatar jarvisk032 avatar leocg87 avatar

Watchers

 avatar

cse450team02's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.