This project includes implementation of multiple classification machine learning models on spam twitter account data set. I have used Naive Bayes, LogisticRegression, RandomForest, Neural Networks, Decision Tree and Extra Trees Classifier.
Describe Data
Info
I removed the outliers present in the dataset using z-score method.
Before
After
Classification Report of Different Models:
- Logistic Regression
- Naive Bayes
- Random Forest
- Neural Network
- Decision Tree Classifier
- Extra Trees Classifier
Out of all the classification models, Extra Trees Classifier has given the highest accuracy followed by Random Forest Classifier.