The prediction task is to predict the number of rings on an abalone, a kind of shellfish. Full information of the dataset is provided here: https://archive.ics.uci.edu/ml/datasets/Abalone
I have followed all the steps as listed below :
- Data Understanding
- Data Retreival
- Exploratory Data Analysis
- Missing Value Analysis
- Outlier Analysis
- Feature Engineering
- Feature Scaling
- Modeling and Evaluation
- Dimensionality Reduction
- Modeling with Hyperparameters for tuning
Dataset provided (Abalone.data) had features which was highly correlated with each other, hence i had to use PCA to get higher dimensional data features which got me good accuracy as well as prediction power.
I have shown every graph in the jupyter notebook attached as well in R program.
Every code is properly commented for your better understanding.
I have successfully achieved the best accuracy (~99%) with less RMSE and R^2 as high as 0.99.