Solution for Amazon ML Challenge 2021
Team Name - CRP
Team members: Paritosh Dahiya, Rishaab Kalra and Chaitanya Gupta.
- Did the basic data cleaning - lower case, replacing symbols with spaces, deleted stopwords from text.
- Replaced NaNs with empty string ''.
- Concatenated all the strings together.
- Created SKlearn pipeline using CountVectorizer, tfifdTransformer.
- Tried multiple sklearn classifiers and XgBoost too, but best results were given with LinearSVC classifier.
- Trained on 50,000 samples.
- Obtained 55.47958 accuracy on test set.
- Achieved 141th rank on the leaderboard out of 3290 teams.