Kedar Ghule's Projects
The project implements Unsupervised Machine Learning techniques to develop customer segmentation using credit card data to develop marketing strategy.
The project implements a ResNet to detect brain tumours from MRI images and then uses ResUNet model to perform localization of the identified brain tumours.
Cartography is a Python tool that consolidates infrastructure assets and the relationships between them in an intuitive graph view powered by a Neo4j database.
Assignment Solutions of my coursework of CIVE 7100 Time Series and Geospatial Data Sciences course at Northeastern University
EECE 5645 Project: Performing community detection on Reddit Hyperlink network dataset and leverage the power of Spark and GraphFrames
This project implements a logistic regression model using grid search for hyperparameter tuning to get a AUPRC score of 0.70. This is further improved to a value of 0.82 by an XGBoost Classifier after using Hyperopt for hyperparameter tuning.
The project implements and tests various machine learning algorithms to better classify and detect DDoS attacks.
An EDA and Data Visualization Project which collects data from DevOps job postings on Indeed and provides insights on the DevOps Job Market.
This is a document question answering app made with LangChain and deployed on Streamlit where you can upload a .txt or .docx file, ask questions based on the file and an LLM like Falcon-7B or Dolly-V2-3B answers it. ChromaDB is used as the vector database.
A Python client for the Neo4j Graph Data Science (GDS) library
A graph conversion tool for https://cloud.google.com/
Final Project for EECE 7205 Fundamentals of Computer Engineering
This project use multiple visualization techniques and visualizes statistics of each player in the English Premier League on a dashboard using Plotly and Dash and keeping in mind different visualization principles. Visualizations include bar charts, stacked bar charts, pie charts, tables, lollipop charts (to visualize expected goals and assists) and football field heatmaps. This project was done as a capstone project for EECE 5642 Data Visualization at Northeastern University.
Sentiment Analysis of 10-K documents using FinBERT
GPT coded and trained from scratch that outputs Shakespeare-like text.
CS 7150: Deep Learning project
This project explores the data of medical insurance claims. Descriptive Analysis, Exploratory data analysis, Univariate, Bivariate and multivariate analysis is performed to explore the data and how different features are correlated to each other. Finally, hypothesis testing is performed by employing t-test, Chi-squared test and One-way ANOVA.
The project aims to profile stocks with similar weekly percentage returns using K-Means Clustering. The project calculates realized volatility for each stock and predicts realized volatility for each stock using classical volatility models and machine learning models and comparing their performance. This is a capstone project for CIVE 7100 Time Series and Geospatial Data Sciences.
A database project implemented using PHP,MySQL with HTML and CSS implemented for the frontend. The mini project is a part of the curriculum in Third Year of Bachelor of Engineering as per Savitribai Phule Pune University (formerly University of Pune). The project is a group project.
The project aims to accurately predict the Zillow home value index of missing values in the dataset using Ridge Regression. The model is evaluated on the basis of various metrics like R2 score, root mean squared error, mean absolute percentage error and bias. This mini project was a part of CIVE 7100 course at Northeastern University.