This repository contains a Jupyter Notebook file and the associated dataset for a Kaggle competition aimed at predicting an individual's smoking status based on bio-signal data. The model leverages the Random Forest algorithm and achieves an accuracy of 85%.
The dataset consists of various features such as age, height, weight, waist circumference, eyesight, hearing, blood pressure, blood sugar levels, cholesterol levels, liver function tests, and dental caries information. The target variable is the smoking status.
- Clone this repo.
- Make sure Anaconda and Jupyter Notebook are installed in the system.
- Open the Jupyter notebook and change the address of the training dataset.
- Run the main code inside the notebook.