This is just a small project to classify the spam messages present in the data set. Used the NLP text preprocessing techniques such as :
- Cleaning the text , removing all letters otehr than alphabets(lower and upper)
- Removing the stop words from each messages
- Lemmatized the remaining words
- encoded the predictor column
- trained Multinomial Naive Bayes classifier
https://archive.ics.uci.edu/dataset/228/sms+spam+collection
Multinomial Naive bayes
97.66