rish332 Goto Github PK
Name: Rishabh Taneja
Type: User
Bio: Data Analytics graduate enthusiastic about analysis, data engineering and machine learning. Proficient in a range of technologies including Python, R and AWS.
Location: Greater Boston
Name: Rishabh Taneja
Type: User
Bio: Data Analytics graduate enthusiastic about analysis, data engineering and machine learning. Proficient in a range of technologies including Python, R and AWS.
Location: Greater Boston
I have analysed the accidents dataset related to United States of America. To handle the dataset requirements of million+ records, I decided to analyse the dataset using DataBricks which is a cloud-based tool that makes use of PySpark to process the data.
Illustration of C5.0 decision trees algorithm which is used to identify risky bank loans. C5.0 is used because of the benefits it provides which is, that it is quite opinionated towards pruning.
Illustration of Random Forest model on credits data for identifying risky bank loans. Random forest combines the decision trees and train them on different set of observations. By using this I will compare this with the C5.0 algorithm and determine which one is the best among them depending on the accuracy.
Performed predictive analysis using linear and logistic regression where I compared which algorithm worked best. Later, I also managed to perform boosting algorithm to improve the efficiency.
A real estate company wants to predict the prices for the future based on the data available. This way it will help them adjust their budget, change the way they work so as to earn profit and also help them decide where to invest in. As a Data Scientist, I will explore the data available, perform preprocessing, check for manipulation and finally apply and evaluate machine learning algorithms to see which algorithm provides the best result.
The focus is on the sentiment analysis, and by using the ratings 0 or 1 with negative or positive sentiment. GOAL - Finally, would able to conclude which movie, product or a restaurant is good based on the sentiments. Additionally, doing the n-gram analysis while trying to answer โWhether n-gram analysis is important in every text analysis?โ
Performing normalisation and analysing the different datasets to gather insights for decision making
Basic template to scrape the web using python.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.