Git Product home page Git Product logo

resume's Introduction

Isaac Campbell-Smith




Technical Skills

Languages:

  • Python, SQL

Frameworks/Libraries:

  • Pandas, NumPy, SciPy, Tensorflow, PySpark, AWS EC2 | S3 | ECS | RDS, Selenium, NLTK, Docker, Tableau, Excel

Conceptual:

  • NLP, Bayesian and Frequentist Modeling, Statistical Analysis, Data Modeling, Regression - Classification - Clustering, Sequence Modeling, A/B Testing

Data Science and Analytics

Data Scientist | AAK Tele-Sciences

November 2020 - Present, Remote

  • Designed, tested & integrated Python APIs to search and store scientific researcher profile information for patented inventions, academic papers & SEC filings
  • Worked with batch streaming data and Apache query engines to analyze & build SQL Patents database
  • Collaborated with Front-End team to integrate data pipeline into website user profiles within FastAPI & Unicorn framework
  • Implemented Spark parallel ETL processes to greatly reduce runtime of existing pure Python scripts
  • Managed team of 4 data engineers and set up code reviews, testing, and database integrity checks

Data Analyst Consultant | Leafly

July 2020 - Present, Seattle, WA

  • Built Tableau dashboards and statistical models to identify significant changes & trends in listenership for non-technical audience
  • Wrote Python scripts to transfer streaming data stored on multiple AWS S3 and to SQL Database
  • Proposed several useful content and format changes be made based on past and current trends and collaborated with social media producer for listener survey about recent downturn in listenership
  • Proposed several content and format changes based on data insights, resulting in increased downloads • Collaborated with social media producer to develop new KPIs

Database Engineer | Fuel & Ox

July 2020 - November 2020, Seattle, WA

  • Wrote Python scripts to source user data stored on multiple AWS S3 buckets and write to AWS SQL server
  • Collaborating with stakeholders to write queries for identifying optimal target audiences for new independent movie marketing campaigns

Vision Zero 2030: Seattle Traffic Collisions Data Analysis

July 2020 - September 2020, Seattle, WA

  • Collaborated with 4 data scientists seeking to identify road features associated with higher rates of collisions order to make impactful reccomendations to the City of Seattle's 'Vision Zero 2030' project
  • Joined data sourced from multiple dataframes to visualize collision rates and feature variance on each street using Matplotlib and Pandas
  • Used Machine Learning modelling to suggest that traffic circles have no bearing on collision rates in Seattle and found evidence to support SDOT's plan to reduce speed limits city-wide
  • Presented findings as part of public event

Project Management

Construction Production Manager | York Enterprises

2017 - 2019, Tacoma, WA

  • Managed 20 employees, client meetings, and improved company permit issuance rates leading to record-breaking positive customer satisfaction survey
  • Performed market research on Google Analytics to validate necessary re-branding of language which lead to website SEO improvements from 3rd page to #1 search result on Google locally
  • Reviewed contracts, cost proposals and served as point-of-contact for all subcontractors & suppliers
  • Prepared quarterly budget reports using Excel & Salesforce, Gantt charts for tracking project timelines
  • Managed inventory and truck fleet and developed automated checkout processes

Events & Marketing | Kamalaya Koh Samui

2014 - 2017, Thailand

  • Managed CRM technologies and implemented new customer segments which increased email inbox open rates by 50 %
  • Researched and produced B2B marketing videos and pamphlets to recruit international event partners in the UK, Singapore, Germany, Hong Kong and Australia
  • Coordinated with multiple departments to deliver resort's retreat packages, hotel events program and high-performing marketing content

Applications

Pokestars

Visualizing 6 Years of Competitive Pokemon Battles

  • Utilized BeautifulSoup to mine over 6 years of battling statistics
  • Built and stored PostgreSQL database on AWS to model an ETL analytics workflow
  • Creating monthly Tableau Dashboards to visualize the state of the metagame
  • Used the database as part of the bi-weekly 'Wine & SQL Night' remote meet-up that I organize to teach advanced functions and techniques

Technology Stack: Psycopg2 | Beautiful Soup | AWS (RDS) | Tableau

Martyr Politics

First Place entry for Galvanize Data Science 2020 Competition

  • Introduced Sentiment Analysis and Machine Learning to answer whether Trump’s Covid-19 diagnosis benefitted him politically
  • Delivered custom Python Class to load, transform and clean Twitter dataset from 2gb JSONL file to Pandas DataFrame object based on features relevant to the ‘business question’
  • Designed and interpreted results of T-Test on compound sentiment scores from VADER model to determine increased average sentiment for Trump did not benefit his campaign
  • Prepared video presentation of repository, resulting in winning first place

Technology Stack: Python | VADER | ScikitLearn | Tableau | AWS EC2

Comedy of Errors

A recommender engine designed to improve stand-up comedy recommendations using natural language processing

  • Utilized BeautifulSoup and Selenium to scrape transcripts and IMDB reviews
  • Performed multiple NMF and KMeans clustering transformations to better categorize comedy specials by genre
  • Deployed recommender web app using Flask and AWS

Technology Stack: Selenium | BeautifulSoup | NLTK | SciKitLearn | Flask | AWS (EC2)

Technology Stack: Python | Matplotlib | NumPy | Pandas | SciKitLearn | CatBoost | XGBoost | LightGBM


Education

Data Science Advanced Certificate | Galvanize

2020, Seattle

  • 13 week immersive with 700+ hours of coding, weekly Case Studies, and 3 capstones
  • Python-based curriculum focused on machine learning and best practices in statistical analysis, including frequentist and Bayesian methods
  • Utilizes regression, classification and clustering to model real-world structured and unstructured data
  • Explores NLP and Deep Learning techniques, in addition to Spark on AWS

Bachelor of Arts | Lewis & Clark College

2010 - 2014, Portland

resume's People

Contributors

isaac-campbell-smith avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.