Git Product home page Git Product logo

Rishabh Taneja's Projects

analyzing-us-accidents-data-using-pyspark-and-databricks icon analyzing-us-accidents-data-using-pyspark-and-databricks

I have analysed the accidents dataset related to United States of America. To handle the dataset requirements of million+ records, I decided to analyse the dataset using DataBricks which is a cloud-based tool that makes use of PySpark to process the data.

identifying-risky-bank-loans-using-random-forest icon identifying-risky-bank-loans-using-random-forest

Illustration of Random Forest model on credits data for identifying risky bank loans. Random forest combines the decision trees and train them on different set of observations. By using this I will compare this with the C5.0 algorithm and determine which one is the best among them depending on the accuracy.

predictive-analysis icon predictive-analysis

Performed predictive analysis using linear and logistic regression where I compared which algorithm worked best. Later, I also managed to perform boosting algorithm to improve the efficiency.

real-estate-price-prediction-using-python icon real-estate-price-prediction-using-python

A real estate company wants to predict the prices for the future based on the data available. This way it will help them adjust their budget, change the way they work so as to earn profit and also help them decide where to invest in. As a Data Scientist, I will explore the data available, perform preprocessing, check for manipulation and finally apply and evaluate machine learning algorithms to see which algorithm provides the best result.

sentiment-analysis-using-n-gram-and-naive-bayes icon sentiment-analysis-using-n-gram-and-naive-bayes

The focus is on the sentiment analysis, and by using the ratings 0 or 1 with negative or positive sentiment. GOAL - Finally, would able to conclude which movie, product or a restaurant is good based on the sentiments. Additionally, doing the n-gram analysis while trying to answer โ€œWhether n-gram analysis is important in every text analysis?โ€

sql-project icon sql-project

Performing normalisation and analysing the different datasets to gather insights for decision making

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.