shikhar0605 Goto Github PK

followers: 8.0 following: 3.0 repos: 22.0 gists: 0.0

Name: Shikhar Tiwari

Type: User

Shikhar Tiwari's Projects

applying-regression-model-on-house-sales-data

Applied Turicreate Linear Regression model on house sales dataset. Examined the effect of feature selections on model accuracy.

classifier-model-for-cancer-detection-malignant-or-benign

Applied Scikit learn K-Nearest Neighbor classification algorithm to develop a model for Breast Cancer diagnosis

creating-and-manipulating-graph-using-networkx

credit-card-fraud-detection

Applied SVC classifier and Logistic Regression classifier algorithm onto credit card transaction dataset to detect any fraud.

custom-data-visualization-using-matplotlib

Created a dynamic graph using matplotlib to better judge probabilistic data generated through the election dataset. A generated graph changes its colour w.r.t change in y-axis values.

documents-similarity-prediction-using-wikipedia-s-people-dataset

Applied Turicreate's Nearest Neighbor Model to predict the similarity between any two documents taken from Wikipedia's People Dataset.

hypothesis-testing-using-t-test

Hypothesis: University towns have their mean housing prices less effected by recessions. Performed a T-test to compare the ratio of the mean price of houses in university towns the quarter before the recession starts compared to the recession bottom.

long-term-stock-price-growth-prediction-using-nlp-on-10-k-financial-reports

A 10-K FInancial Report is a comprehensive report which must be filed annually by all publicly traded companies about its financial performance. These reports are filed to the US Securities Exchange Commission (SEC). This is even more detailed than the annual report of a company. The 10K documents contain information about the Business' operations, risk factors, selected financial data, the Management's discussion and analysis (MD&A) and also Financial Statements and supplementary data. I have been expected to build an NLP pipeline that ingests 10-K reports of various publicly traded companies and build a machine learning model which can uncover the hidden signals to predict the long term stock performance of a company from the 10-K docs using the ‘Loughran McDonald Master Dictionary’. The Dictionary contain words that are specifically curated in the context of financial reports

movie-recommender-engine

Created a movie recommender engine based on cosine similarity

network-connectivity

Importing and analyzing an internal email communication network between employees of a mid-sized manufacturing company. Each node represents an employee and each directed edge between two nodes represents an individual email. The left node represents the sender and the right node represents the recipient.

nlp-on-10k-documents

This projects helps scraping and analysing the 10K and 10Q documents filed by publicly traded companies to the SEC.

pandas-for-data-science-assignment-1

Basics of Pandas for Data Analysis on Census Data

predicting-property-maintenance-fines-using-logistic-regression

Applied Scikit learn Logistic Regression algorithm to predict whether a given blight ticket will be paid on time

programmingassignment2

Repository for Programming Assignment 2 for R Programming on Coursera

project-report

This project's sole aim is to find out whether there exists any relationship between the World's University Ranking and the expenditure made by each country for their respective education system.

regex-assignment

The goal of this assignment is to correctly identify all of the different date variants encoded in this dataset and to properly normalize and sort the dates.

salary-and-new-connections-predictions-using-networkx

By using Networkx and ML algorithms created a model to predict whether or not employees in a given company are receiving a management position salary. Also predicted future connections between the employees of the network.

sec-10k-item-1a-ml-kmean-clustering

This project was my final project for the UOFM data analytics certificate program used ML to cluster text files and validated those clusters using stock market data

sentiment-analysis

Exploratory sentiment analysis of a firm's management discussion from 10K annual SEC filing

social-media-sentiment-analysis

Using Text Mining and Natural Language Processing Techniques pre- processed 50k tweets. Visualized the impact of hashtags on tweets sentiment using Seaborn. Applied machine learning models, calculated f1_scores, accordingly used the best model for sentiment prediction.

spelling-recommender

Created three different spelling recommenders, that each take a list of misspelled words and recommends a correctly spelled word for every word in the list. Each spelling recommender uses different Jaccard distance metrics. For every misspelled word, the recommender find the word in correct spellings that has the shortest distance, and starts with the same letter as the misspelled word, and return that word as a recommendation.

temperature-analysis-using-ncei-dataset

Analyzed the temperature variations of "Ann Arbor, Michigan, U.S." over 2005-2014 using NCEI Dataset.

shikhar0605 Goto Github PK

Shikhar Tiwari's Projects

Recommend Projects

Recommend Topics

Recommend Org