Git Product home page Git Product logo

As a Data Analyst, I successfully applied my skills in SQL, Tableau, Python, and IBM Cognos Analytics to deliver accurate and reliable insights for various business needs. I have demonstrated end-to-end ownership of Tableau dashboards, ensuring data quality, design, and utilization by cross-functional teams. I have also leveraged SQL scripting to verify and report data, addressing ad hoc requests swiftly and effectively.

I recently graduated from Humber College with a post-graduate degree in Business Insights and Analytics, where I learned new concepts relevant to data science. I am looking for a full-time role as a Data Scientist or Data Analyst, where I can contribute to the growth and success of the organization with my passion and expertise in the subjects.

Hi there šŸ‘‹

  • šŸ”­ Iā€™m currently working on building data science applications
  • šŸŒ± Iā€™m currently learning Front end development
  • šŸ’¬ Ask me about Data analytics, statistics, python and Tableau
  • šŸ“« How to reach me: DM on LinkedIn or E-mail

Portfolio Projects

  • Description: Many factors influence health insurance premiums, and understanding these variables is crucial for predicting costs accurately. This project explores the relationships between age, gender, BMI, number of children, smoking habits, and region with health insurance charges.
  • Skills used: This project involves a diverse set of skills spanning data analysis, machine learning, data preprocessing, GitHub version control, Markdown documentation, statistical analysis, Jupyter Notebooks, Python programming, and regression modelling. The data analysis component includes exploratory data analysis (EDA) using Python libraries such as Pandas, NumPy, and Seaborn, visualizing data distributions, relationships, and summaries with Matplotlib and Seaborn, and interpreting statistical concepts like right-skewed distributions. Machine learning skills come into play with the implementation and evaluation of regression models, including Linear Regression, Ridge Regression, Lasso Regression, and Random Forest Regressor. Feature engineering, transformation, and selection, as well as handling categorical data using techniques like encoding, are part of the data preprocessing tasks. GitHub and version control facilitate collaborative development and code sharing. Documentation is created using Markdown, and the analysis is conducted in Jupyter Notebooks. The project also involves Python programming, specifically using libraries such as NumPy, Pandas, Matplotlib, Seaborn, and Scikit-Learn. The exploration of feature importance in machine learning models and the application of Polynomial Regression to capture non-linear relationships further enhance the depth and complexity of the project. Collectively, these skills contribute to a holistic approach in predicting health insurance charges based on various influencing factors.
  • Description: Analyzing some 911 call data from Kaggle. (Exploratory data analysis and Data visualization)
  • Skills used: the skills required include proficient data cleaning and preprocessing abilities to handle emergency call data effectively. Exploratory Data Analysis (EDA) skills are crucial to understand patterns and trends in the data. Additionally, strong data visualization skills using libraries like Matplotlib and Seaborn are essential to present insights visually. If the analysis involves temporal data, skills in time series analysis become relevant. For projects dealing with geographic data, proficiency in geographic data analysis is necessary to make sense of location-based information.
  • Description:This project revolves around the application of the K-Means clustering algorithm to perform customer segmentation. Customer segmentation is a crucial task in marketing and business strategy, allowing companies to categorize customers based on similar attributes and behaviours.
  • Skills used: Data Cleaning and Preprocessing, K-Means Clustering Algorithm, Exploratory Data Analysis (EDA), Data Visualization, Machine Learning with scikit-learn, Statistical Analysis
  • Description: Models used: Linear Regression Decision Tree Random Forest Gradient Boosting SVM KNN.
  • Skills used: machine learning skills are essential, particularly in classification tasks if predicting delays versus no delays. Time series analysis might be necessary if considering time-dependent factors in predicting flight delays. Feature engineering skills are crucial to extract relevant information from the dataset. Model evaluation and selection skills are needed to choose the best predictive model. Data cleaning and preprocessing skills are important to handle missing or noisy data effectively.
  • Description: Models used: Linear Regression Decision Tree Random Forest Gradient Boosting SVM KNN.
  • Skills used: machine learning skills are employed, specifically logistic regression for binary classification tasks. Model evaluation and selection skills are crucial for assessing the performance of the logistic regression model. Feature engineering is important to identify and use relevant features for classification. Data cleaning and preprocessing skills are necessary for preparing the data for training the logistic regression model.
  • Description: Models used: Linear Regression Decision Tree Random Forest Gradient Boosting SVM KNN.
  • Skills used: skills involve linear regression for regression analysis. Model evaluation and selection skills are crucial for assessing the performance of the linear regression model. Feature engineering is important for identifying and utilizing relevant features in the regression analysis. Data cleaning and preprocessing skills are necessary for preparing the data for training the linear regression model.
  • Description: Models used: Linear Regression Decision Tree Random Forest Gradient Boosting SVM KNN.
  • Skills used: The project involves a machine learning approach to predict death rates. Skills required include regression analysis, feature engineering to enhance predictive features, and the ability to evaluate and select the most suitable model. Statistical analysis skills are beneficial for understanding the underlying patterns in the data. Furthermore, expertise in data cleaning and preprocessing is essential to prepare the data for model training.

https://github.com/AdityaDabrase/DSPortfolioProjects/tree/main/DS-ML/Insurance

AD's Projects

ecom icon ecom

A repository for e-commerce projects using analytics/ data science

sql icon sql

A repository for SQL queries for the Code Signal Arcade Questions.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    šŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. šŸ“ŠšŸ“ˆšŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ā¤ļø Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.