Git Product home page Git Product logo

jayshah6699 / datascience-mashup Goto Github PK

View Code? Open in Web Editor NEW
41.0 1.0 53.0 575.9 MB

In this repo I will try to gather all of the projects related to data science with clean datasets and high accuracy models to solve real world problems.

License: MIT License

Python 0.21% Jupyter Notebook 99.68% CSS 0.01% Shell 0.01% HTML 0.11%
datascience-mashup accuracy-models data-science machine-learning dwoc2020 deep-neural-networks deep-learning age-detector python machine-learning-algorithms

datascience-mashup's Introduction

Hi ๐Ÿ‘‹, I'm Jay

Here are some ideas that can help you to get to know me.
  • ๐Ÿ”ญ Iโ€™m currently working on a website.
  • ๐ŸŒฑ Iโ€™m currently learning Node JS
  • ๐Ÿ‘ฏ Iโ€™m looking to collaborate on Data science/ML projects.
  • ๐Ÿค” Iโ€™m looking for help with pretty much everything(lol).
  • ๐Ÿ’ฌ Ask me about Community, Data science, ML, Software Engineering and My Youtube Channel!
  • ๐Ÿ“ซ How to reach me: Get the link to twitter.
  • ๐Ÿ˜„ Pronouns: He/Him
  • โšก Fun fact: In my free time I try to do experiments with softwares and hardwares(Arduino, raspberry pi etc.)
  • I am experienced engineer as I have done several short circuits while implementing Home Automation!

Jay's GitHub stats Top Langs

By the way don't judge me based on the stats, I do lot of stuff. Currently working on my portfolio website...

datascience-mashup's People

Contributors

07yuvraj avatar allcontributors[bot] avatar amansingh0-0 avatar amit366 avatar avishkar2001 avatar bhagyashri2000 avatar bharath-acchu avatar charvyjain avatar dhriti-roy avatar gargnityansh avatar imgbotapp avatar indrapaul824 avatar infiniteoverflow avatar jayshah6699 avatar kanishkaa24 avatar kenkirito avatar keshav340 avatar ksdkamesh99 avatar madhurima99 avatar manvi0308 avatar muthuannamalai12 avatar nitya123-github avatar oshi36 avatar pranai2279 avatar pranjal-2001 avatar priyanka2109 avatar shraiyya avatar tharunc avatar vaishnavi-1 avatar vashuev avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

datascience-mashup's Issues

Shakespeare style text generation

I would like to implement a Bi-directional LSTM neural network architecture to generate Shakespeare styled text.
The dataset I'll be using is from kaggle and it contains almost 1,11,396 lines from Shakespeare's plays from the various Act-Scenes.

@Jayshah6699 I'm participating in DWOC. It would be nice if you could assign this issue to me.
Thank you.

Graphs for Visualisation

A file containing all types of graphs(along with implementation) so that everyone can look at it and use in their own projects.

Apple Stock price Prediction using Linear Regression

The model takes the input of the 'Apple' stock prices and outputs the prediction of the Closing price of the stock. It is implemented on Python using Linear Regression Models.Used the Quandl API to apply ML to the stock market.
Please label this issue as DWOC.
Also please add the level.

car price prediction

I had made car price prediction project and deployed it using Flask framework.
Please allow me to contribute that here.

Data Science Project "Dogs vs Cats"

I can work on this issue. I have trained the dataset from Kaggle and created a CNN model to classify whether the image is of dog or cat. @Jayshah6699 Kindly, assign the labels to this issue and then assign it to me.

Data Science Project "The Traffic Sign Prediction"

This project is all about predicting the traffic signs. The tech stack used here is python,tensorflow,tkinter,cnn and son on. The dataset is obtained from kaggle. It has test and train dataset on which model is going to be trained . Apart from traing the model a tkinter [ python GUI] is made to upload the picture and predict the symbol. So basically it is a standalone application for classifying or predicting the various traffic signs.

ADD ISSUE_TEMPLATE

Hello there,
I would like to add an issue template for your repository. This template would have four issues namely bug, documentation, feature, proposal. Kindly assign me this issue as a part of DWOC.
THANK YOU

TMDb Movie Analysis and Prediction

Abstract:

In this machine learning project, I cleaned, analyzed, and predicted two target variables โ€“ both revenue (numerical) and profitability (categorical), from the dataset of THE MOVIE DATABASE (TMDb).

Dataset:

The dataset contains around 5000 movies with 22 features and is obtained from Kaggle (here). The information available about each movie include its budget, revenue generated, genres, rating, vote count, popularity, actors and actresses and any more. However, I used an unclean version of the dataset for our project (Link used within notebook).

Implementation:

In this project, I will use this dataset to clean, analyze and determine whether any information about a movie can predict the total revenue of a movie. I will then attempt to predict whether a movie's revenue will exceed its budget (profitability). Also, I will test two different models for each prediction to check which predicts our target variable better.

The results obtained from this project will be helpful for the Movie Production Teams to analyze the rubrics of their Movie Idea before it moves on to the Production Phase.

New Data Science Project "Coronary Heart Disease Prediction Web App"

In this project, we will use various models for predicting the 10-year CHD of a person based on the features provided in the dataset and use a web app for users to select any of the models and view the results obtained from them. Users won't be able to input data for prediction since there are a lot of features but will be able to input hyperparameters of their choice if required. Options will also be provided to choose among various metrics available for viewing.

Dataset:

Framingham from Kaggle, a single CSV file.

Models:

Using both scikit-learn and TensorFlow.

Web App:

Using either Streamlit or Flask (yet to be decided)

Web Hosting:

Using either Streamlit-sharing or Heroku

Any suggestions are always welcome.

Earth quake prediction

It is well known that if a disaster occurs in one region, it is likely to happen again. Some regions have frequent earthquakes, but this is only a comparative amount compared to other regions.

So, predicting the earthquake with date and time, latitude and longitude from previous data is not a trend that follows like other things, it happens naturally.

So basically if we have past data we can predict future disasters.
Make machine learning model do it.

I can provide dataset if needed.

Add Issue Template

Hai,
I would love to add an issue template for your repository. This template would have four issues namely bug, documentation, feature, proposal and question

Amazon Review Analyzer

Implemented and compared three types of linear classifiers to use for sentiment analysis of Amazon product reviews using self-made linear classifiers from scratch.

The goal of this project is to design a classifier to use for sentiment analysis of product reviews. The training set consists of reviews written by Amazon customers for various food products. The reviews, originally given on a 5 point scale, have been adjusted to a +1 or -1 scale, representing a positive or negative review, respectively.

In order to automatically analyze reviews, I did the following tasks:

Construct the three classifiers: the perceptron algorithm, the average perceptron algorithm, and the Pegasos algorithm.

Use these self-made classifiers on the food review dataset, using some simple text features.

Experiment with additional features and explore their impact on classifier performance.

Stock Price Prediction

Hi @Jayshah6699

I want to work on predicting the stock closing price of a company using bidirectional LSTM's.
Please assign me this issue so that I can start working

Predicting the survival of the passenger of Titanic

This project is about predicting the survival on the Titanic cruise. The tech stack used here is python, sklearn, pandas etc. The dataset is obtained from kaggle. It has test and train dataset on which model will be trained and tested.

Logo

I would Love to create a logo for this project. Kindly add the labels and assign it to me if you're fine with the suggestion. @Jayshah6699

A python script which refine a dataset to feed it into a model

There are issues with the scikit learn library that it can't be fed with NULL (missing values) and String (Object) data-type.
Hence a Data Scientist(beginners or who don't have any trick yet) has to give his extra time to clean the Data (or dataset),

I want to build a python script that does the following tasks:

  1. take input data
    -it can take any format (like .xslx, .csv )
  2. drop duplicates rows, columns
  3. Impute missing values.
  4. Change the Object data-type to a model fed datatype( uint )
  5. Output the file ( with same name and extension as given)

Whoo! the newly created file is ready to feed into any model

You may check one of my predicting models' file, where I have been gone through to clean the dataset before feeding into it a model.
check the file here.
@Jayshah6699 Please check it and put the tags accordingly and assign to me.

Digit recognition(0-9) using Logistic-Regression

Hello again,
This is a multiclass classification problem

dataset Logistic-Regression: sklearn.dataset
There will be preprocessing and visualization of the data, and then tuning the model for the right accuracy
It will be achieved by changing various hyperparameters for the models,
at last accuracy with various metric like, MSE, R2, Confusion matrix,
@Jayshah6699 Please add the label and assign.

Rain fall predictor

Rainfall Prediction is one of the difficult and uncertain tasks that have a significant impact on human society.

Timely and accurate forecasting can proactively help reduce human and financial loss.

This study will present a set of experiments that involves the use of common machine learning techniques to create models that can predict whether it will rain tomorrow or not based on the weather data for that day.

New data science project

New data science project folder.
Create a new issue with the project nameand in that explain your data science project(which is basically your folder name).

Rain fall prediction

Rainfall Prediction is one of the difficult and uncertain tasks that have a significant impact on human society.

Timely and accurate forecasting can proactively help reduce human and financial loss.

This study will present a set of experiments that involves the use of common machine learning techniques to create models that can predict whether it will rain tomorrow or not based on the weather data for that day.

Add New project

I want to add my project which reviews your Twitter tweet with the help of a graph or any other chart type by google sheets in terms of the Number of retweets, Positive tweets, Negative Tweets, and their difference as a Net score.

Integrate Welcome Bot

Hello there,
I can add a welcome bot config file having a proper message that will show up when any user will open up an issue or pull-request for the first time as a part of DWOC. Please assign me this issue. For reference, kindly check out: https://github.com/apps/welcome
THANK YOU

Movie Recommendation System

The model takes input data of various users, the movies they have watched and if they like it or not. Later on, the model is trained on that data by implementing Real Boltzmann Machine using Pytorch to predict the movies in the test set that the concerned user may like to watch.

Model in different languages( Python, R etc.)

You can add models in different languages but for that, in the specific project folder if that language exists (for example python file or python model) then make another folder named that language and add the model in that folder with project name(for example if the project name is face mask detection then file name will be facemaskdetector.py or facemaskdetector.r)

Labels of the pull request will be assigned as per the accuracy of the model.

Data science project "Handwritten Digits Recognition"

I want to work on this issue. The accuracy of my model was approximately 98%. Kindly @Jayshah6699 put the labels of DWOC and level and then assign it to me. I have made a CNN model for recognising the handwritten digits and used keras, tensorflow for training the MNIST dataset. I had trained and tested the model on Google Colab.

Iris-flower species prediction using Decision Tree ( multiclass Classification )

This is also a multiclass classification problem

dataset DecisionTree: https://www.kaggle.com/uciml/iris

Steps

  • There will be preprocessing and visualization of the data,
  • tuning the model for the right accuracyIt will be achieved by changing various hyperparameters for the models like depth of the tree, no. of the element on the node
  • creating 3 models of the decision tree which uses the following split methods,
    • Gini Impurity
    • Chi-square
    • entropy
  • accuracy with various metric like, MSE, R2, Confusion matrix,
  • At last, a .png file showing the whole Decision tree
    @Jayshah6699 Please add the label and assign.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.