Git Product home page Git Product logo

jayshah5696 / kaggle_homedepot Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 7.89 MB

Shoppers rely on Home Depot’s product authority to find and buy the latest products and to get timely solutions to their home improvement needs. From installing a new ceiling fan to remodeling an entire kitchen, with the click of a mouse or tap of the screen, customers expect the correct results to their queries – quickly. Speed, accuracy and delivering a friction less

Jupyter Notebook 93.84% R 6.16%

kaggle_homedepot's Introduction

Kaggle - Home Depot Product Search Relevance

Shoppers rely on Home Depot’s product authority to find and buy the latest products and to get timely solutions to their home improvement needs. From installing a new ceiling fan to remodeling an entire kitchen, with the click of a mouse or tap of the screen, customers expect the correct results to their queries – quickly. Speed, accuracy and delivering a frictionless customer experience are essential.

In this competition, Home Depot is asking Kagglers to help them improve their customers' shopping experience by developing a model that can accurately predict the relevance of search results.

Search relevancy is an implicit measure Home Depot uses to gauge how quickly they can get customers to the right products. Currently, human raters evaluate the impact of potential changes to their search algorithms, which is a slow and subjective process. By removing or minimizing human input in search relevance evaluation, Home Depot hopes to increase the number of iterations their team can perform on the current search algorithms.

About the Project

This Project was first experience on working with text data. By working on this project, I got good grasp on working with text manipulation and building models with it.

Approch

Basic Text Pre-processing of text data

  1. Lower casing
  2. Punctuation removal
  3. Stopwords removal
  4. Frequent words removal
  5. Spelling correction
  6. Tokenization
  7. Stemming
  8. Lemmatization

Basic feature extraction using text data

  1. Number of words
  2. Number of characters
  3. Average word length
  4. Number of stopwords
  5. Number of special characters
  6. Number of numerics

Results

Here, 2 Algorithms are implemented.

  1. XGboost
  2. Feed Forward Neaural Network

H2O API was not working, so NN is implemented in R.

We achieved RMSE score of - 0.47

Please, Check out code. And your feedback is appriciated.

kaggle_homedepot's People

Contributors

jayshah5696 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.