Git Product home page Git Product logo

decision_trees's Introduction

Decision_Trees

C++ implementation of Decision Trees and Random Forests for classification of Insurance Dataset

Please visit HERE for detailed analysis of code and experiment results.

We build decision trees and random forests for a insurance dataset, evaluating it for various experiments such as adding noise and tree pruning. Dataset taken from : https://archive.ics.uci.edu/ml/datasets/Insurance+Company+Benchmark+%28COIL+2000%29

HOW TO RUN :

  1. Compile the program by entering the following command :
   g++  -o ID3 ID3.cpp
  1. The exceutable requires two command line arguments. The first argument should be the name of the file containing the dataset. The .txt file should be in the same folder as the executable file. The second argument should specify the experiment number, i.e. 1, 2, 3 or 4 ;

  2. Run the executable by entering the following command :

   ./ID3  ticdata2000.txt  experiment_no

ticdata2000.txt contains the dataset for creating the tree.

  1. Press enter to print the output. Although the code has been optimized using appropriate data structures to minimize processing time, still some experiments may take more than a few seconds to run.

  2. Please refer to the Results and Conclusion file to see the final results of all the experiments.

Experiments :

  1. We vary the "stopping criteria" that prevents further splitting of node. Changes in accuracy and complexity of model are observed.
  2. Add noise to the dataset and evaluate the accuracy of the model along with the change in its complexity (number of nodes)
  3. Perform "Reduced Error Pruning" on the tree and measure the change in accuracy of the tree.
  4. Create a random forest using "Feature Bagging" approach where we select a subset of features, make multiple trees, and take majority vote for the result.

decision_trees's People

Contributors

devendrapratapyadav avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.