The mini_projet_exemple from madclam

mini_projet_exemple's Introduction

This is a sample starting kit for the Iris challenge. It uses the well known Iris dataset from Fisher's classic paper (Fisher, 1936). The data set contains 3 classes of 50 instances each, where each class refers to a type of iris plant. One class is linearly separable from the other 2; the latter are NOT linearly separable from each other.

References and credits: R. A. Fisher. The use of multiple measurements in taxonomic problems. Annual Eugenics, 7, Part II, 179-188 (1936).

Prerequisites: Install Anaconda Python 2.7, including jupyter-notebook

Usage:

(1) If you are a challenge participant:

The file README.ipynb contains step-by-step instructions on how to create a sample submission for the Iris challenge. At the prompt type: jupyter-notebook README.ipynb
modify sample_code_submission to provide a better model
zip the contents of sample_code_submission (without the directory, but with metadata), or
download the public_data and run: python ingestion_program/ingestion.py public_data sample_result_submission ingestion_program sample_code_submission then zip the contents of sample_result_submission (without the directory).

(2) If you are a challenge organizer and use this starting kit as a template, ensure that:

you modify README.ipynb to provide a good introduction to the problem and good data visualization
sample_data is a small data subset carved out the challenge TRAINING data, for practice purposes only (do not compromise real validation or test data)
the following programs run properly: python ingestion_program/ingestion.py sample_data sample_result_submission ingestion_program sample_code_submission python scoring_program/score.py sample_data sample_result_submission scoring_output
the metric identified by metric.txt in the utilities directory is the metric used both to compute performances in README.ipynb and for the challenge.

Recommend Projects