rl16193 Goto Github PK
Name: Rahul Lal
Type: User
Bio: I am a data analyst with an excellent understanding of programming fundamentals and machine learning concepts.
Location: Toronto
Name: Rahul Lal
Type: User
Bio: I am a data analyst with an excellent understanding of programming fundamentals and machine learning concepts.
Location: Toronto
Work on the Watches dataset and use PySpark to perform the ETL process to extract the dataset, transform the data, connect to an AWS RDS instance, and load the transformed data into pgAdmin. Next, you’ll use PySpark, Pandas, or SQL to determine if there is any bias toward favorable reviews from Vine members in your dataset.
We need to identify the top 10 bacterial species in their belly buttons. That way, if Improbable Beef identifies a species as a candidate to manufacture synthetic beef, Roza's volunteers will be able to identify whether that species is found in their navel.
The main purpose of this report is to study the New York CitiBike data for the month of August which is the busiest month and highlight the main talking points such us type of users, CitiBike user by Gender, and prepare a comparative study for a similar project/opportunity in Des Moines, Iowa.
Credit risk is an inherently unbalanced classification problem, as good loans easily outnumber risky loans. Therefore, you’ll need to employ different techniques to train and evaluate models with unbalanced classes. Using the credit card credit dataset from LendingClub, a peer-to-peer lending services company,
Accountability Accounting, a prominent investment bank, is interested in offering a new cryptocurrency investment portfolio for its customers. The company, however, is lost in the vast universe of cryptocurrencies. We create a report that includes what cryptocurrencies are on the trading market and how they could be grouped
Write a code in VS Code to perform analysis on the election data provided and obtain the results
The aim of this study is to visualize the gun violence data collected for the year 2018 and highlight what I think are the most important Factors contributing to gun violence.
Statistical Analysis of Boston Housing
Analyze data for Kickstarter campaigns across the globe and present Louise with information on trends
Plot the earthquake data in relation to the tectonic plates’ location on the earth, and show all the earthquakes with a magnitude greater than 4.5 on the map, and they would like to see the data on a third map.
Perform multiple linear regression analysis to identify which variables in the dataset predict the mpg of MechaCar prototypes. Collect summary statistics on the pounds per square inch (PSI) of the suspension coils from the manufacturing lots. Run t-tests to determine if the manufacturing lots are statistically different from the mean population.
Use BeautifulSoup and Splinter to scrape full-resolution images of Mars’s hemispheres and the titles of those images, store the scraped data on a Mongo database, use a web application to display the data, and alter the design of the web app to accommodate these images.
Amazing Prime loves the dataset and wants to keep it updated on a daily basis. We create one function that takes in the three files Wikipedia data, Kaggle metadata, the MovieLens rating data and creates an automated pipeline that takes in new data, performs the appropriate transformations, and loads the data into existing tables.
With your knowledge of machine learning and neural networks, you’ll use the features in the provided dataset to help create a binary classifier that is capable of predicting whether applicants will be successful if funded by Alphabet Soup.
Determine the number of retiring employees per title, and identify employees who are eligible to participate in a mentorship program. Write a report that summarizes your analysis and helps prepare Bobby’s manager for the “silver tsunami” as many current employees reach retirement age.
Analyze all the rideshare data from January to early May of 2019 and create a compelling visualization
Use Jupyter Notebook and Pandas library to perform the analysis for standardized test for a school district
Use VBA to write a code Which will help in performing automated analysis on the stock options
Create temperature reports to help determine if the surf shop can operate through out the year, leading to a sustainable business.
Python Project for Data Scientists- IBM Labs
The goal of this study is to find the most common type and location of parking infraction. We can then determine any alternate parking available at that location. We can also analyse the socio-demagraphic trends of the neighborhoods with most infractions.
The purpose of this project is to create a webpage that allows us to filter the table containing UFO sightings. Multiple filters on the webiste are Date, City, State, Country and shape which can be used at the same time
The aim of this paper is to review existing methods already available in the literature, and present a research into the use of smartphones for classification of unpaved roads utilizing the machine learning techniques including K-Nearest Neighbor (KNN) and Support Vector Machines (SVM).
Use beta testers input statements to filter the data for their weather preferences, which will be used to identify potential travel destinations and nearby hotels. The beta tester will then choose four cities to create a travel itinerary. Finally, using the Google Maps Directions API, we will create a travel route between these cities .
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.