Git Product home page Git Product logo

EXPERIENCE

BASIC DATA ANALYSIS | STATISTICS | AND VISUALIZATION

Research on Disability in Malawi

  • Drew conclusions using Scipy statistics
    • Linear Regression
    • Spearman and Pearson correlations
    • Wilcoxon signed-rank test
  • Collected data in person while distributing mobility devices

DATA ENGINEERING | PYTHON DEVELOPMENT

This project, completed in my role as a Data Engineer / Analyst at SixTwentySix, involves the design and development of a dynamic web application that aids in enhanced budget management and cost savings. The application, hosted on AWS, automates the extraction of budget data from past projects stored in Dropbox using its API, transforms the disparate data into a uniform format, and visualizes it in a user-friendly, interactive Google Looker dashboard.

This real-time, auto-updating dashboard provides easy analysis and decision-making tools for users by making the most current data available at all times. I built this solution to identify potential areas for cost savings and facilitate more accurate budget estimations for future projects. This project showcases the integration of multiple data services (Dropbox API and Google Looker) for efficient, automated data handling and visualization.

I developed a Flask API hosted on AWS Elastic Beanstalk integrated with AWS S3 buckets, YouTube Data API, and Google Sign-In API. This API facilitates communication between a livestream and an application for user appearance customization. Using Flask, I built robust endpoints for authentication, data retrieval, and updates. AWS S3 ensures secure storage and retrieval of user data. YouTube Data API retrieves channel information, and Google Sign-In API enables secure authentication. The API is hosted on AWS Elastic Beanstalk for scalability and reliable performance. This project showcases my expertise in Flask, AWS services, API integration, and secure data management.

MACHINE LEARNING

I developed machine learning models to predict sentiment about various products on Twitter, enabling businesses to quickly and accurately analyze social media data and gain valuable insights. I combined and filtered two datasets, cleaned and processed the data, and engineered features to improve model performance. I trained and evaluated binary and multiclass classification models using accuracy, precision, recall, and macro-averaged F1 score as evaluation metrics. Using the sentiment analysis models, I identified areas for improvement, responded to customer feedback, informed product development, and monitored competitors. The project demonstrated the value of sentiment analysis for businesses looking to make data-driven decisions based on customer feedback.

This project uses machine learning to predict the likelihood of readmission within 30 days of initial discharge for diabetes patients. By analyzing admission, demographic, clinical, medication, and discharge data, the model identifies high-risk patients and provides targeted interventions to reduce the likelihood of readmission. The project demonstrates the potential of data science and machine learning in healthcare, and contributes to efforts to improve patient outcomes and reduce healthcare costs.

The goal of this project was to develop a machine learning classification model to predict whether a user's activity on a company website would lead to a purchase. To achieve this, I utilized several ensemble tree models, including XGBoost and Light GBM, on a dataset of 6165 user sessions. Through preprocessing, hyperparameter tuning, and model optimization, I achieved an accuracy of 93.76% with good precision. Deploying this model would enable companies to better target their marketing, optimize their website, and incorporate dynamic pricing in order to boost revenue.

TECHNICAL SKILLS

Python 3 | Pandas, SKLearn, Keras, Tensorflow, Scipy, Matplotlib, Numpy, OpenCV (cv2)

Graphic Design | Affinity Designer and Affinity Photo

Godot / GDScript

Spanish | Conversational

Aaron Bastian's Projects

particles2d_plus icon particles2d_plus

Simple Class that creates a "particles_cycle_finished" signal to eliminate needing to add timers to all of your particle nodes.

pyicloud icon pyicloud

A Python + iCloud wrapper to access iPhone and Calendar data.

xmlreader icon xmlreader

A high level class to allow for reading and parsing xml files.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.