- Data Science Experience
- Project Management Experience
- Applications
- Education
Languages:
- Python, SQL
Frameworks/Libraries:
- Pandas, NumPy, SciPy, Tensorflow, PySpark, AWS EC2 | S3 | ECS | RDS, Selenium, NLTK, Docker, Tableau, Excel
Conceptual:
- NLP, Bayesian and Frequentist Modeling, Statistical Analysis, Data Modeling, Regression - Classification - Clustering, Sequence Modeling, A/B Testing
Data Scientist | AAK Tele-Sciences
November 2020 - Present, Remote
- Designed, tested & integrated Python APIs to search and store scientific researcher profile information for patented inventions, academic papers & SEC filings
- Worked with batch streaming data and Apache query engines to analyze & build SQL Patents database
- Collaborated with Front-End team to integrate data pipeline into website user profiles within FastAPI & Unicorn framework
- Implemented Spark parallel ETL processes to greatly reduce runtime of existing pure Python scripts
- Managed team of 4 data engineers and set up code reviews, testing, and database integrity checks
Data Analyst Consultant | Leafly
July 2020 - Present, Seattle, WA
- Built Tableau dashboards and statistical models to identify significant changes & trends in listenership for non-technical audience
- Wrote Python scripts to transfer streaming data stored on multiple AWS S3 and to SQL Database
- Proposed several useful content and format changes be made based on past and current trends and collaborated with social media producer for listener survey about recent downturn in listenership
- Proposed several content and format changes based on data insights, resulting in increased downloads • Collaborated with social media producer to develop new KPIs
Database Engineer | Fuel & Ox
July 2020 - November 2020, Seattle, WA
- Wrote Python scripts to source user data stored on multiple AWS S3 buckets and write to AWS SQL server
- Collaborating with stakeholders to write queries for identifying optimal target audiences for new independent movie marketing campaigns
Vision Zero 2030: Seattle Traffic Collisions Data Analysis
July 2020 - September 2020, Seattle, WA
- Collaborated with 4 data scientists seeking to identify road features associated with higher rates of collisions order to make impactful reccomendations to the City of Seattle's 'Vision Zero 2030' project
- Joined data sourced from multiple dataframes to visualize collision rates and feature variance on each street using Matplotlib and Pandas
- Used Machine Learning modelling to suggest that traffic circles have no bearing on collision rates in Seattle and found evidence to support SDOT's plan to reduce speed limits city-wide
- Presented findings as part of public event
Construction Production Manager | York Enterprises
2017 - 2019, Tacoma, WA
- Managed 20 employees, client meetings, and improved company permit issuance rates leading to record-breaking positive customer satisfaction survey
- Performed market research on Google Analytics to validate necessary re-branding of language which lead to website SEO improvements from 3rd page to #1 search result on Google locally
- Reviewed contracts, cost proposals and served as point-of-contact for all subcontractors & suppliers
- Prepared quarterly budget reports using Excel & Salesforce, Gantt charts for tracking project timelines
- Managed inventory and truck fleet and developed automated checkout processes
Events & Marketing | Kamalaya Koh Samui
2014 - 2017, Thailand
- Managed CRM technologies and implemented new customer segments which increased email inbox open rates by 50 %
- Researched and produced B2B marketing videos and pamphlets to recruit international event partners in the UK, Singapore, Germany, Hong Kong and Australia
- Coordinated with multiple departments to deliver resort's retreat packages, hotel events program and high-performing marketing content
Visualizing 6 Years of Competitive Pokemon Battles
- Utilized BeautifulSoup to mine over 6 years of battling statistics
- Built and stored PostgreSQL database on AWS to model an ETL analytics workflow
- Creating monthly Tableau Dashboards to visualize the state of the metagame
- Used the database as part of the bi-weekly 'Wine & SQL Night' remote meet-up that I organize to teach advanced functions and techniques
Technology Stack: Psycopg2 | Beautiful Soup | AWS (RDS) | Tableau
First Place entry for Galvanize Data Science 2020 Competition
- Introduced Sentiment Analysis and Machine Learning to answer whether Trump’s Covid-19 diagnosis benefitted him politically
- Delivered custom Python Class to load, transform and clean Twitter dataset from 2gb JSONL file to Pandas DataFrame object based on features relevant to the ‘business question’
- Designed and interpreted results of T-Test on compound sentiment scores from VADER model to determine increased average sentiment for Trump did not benefit his campaign
- Prepared video presentation of repository, resulting in winning first place
Technology Stack: Python | VADER | ScikitLearn | Tableau | AWS EC2
A recommender engine designed to improve stand-up comedy recommendations using natural language processing
- Utilized BeautifulSoup and Selenium to scrape transcripts and IMDB reviews
- Performed multiple NMF and KMeans clustering transformations to better categorize comedy specials by genre
- Deployed recommender web app using Flask and AWS
Technology Stack: Selenium | BeautifulSoup | NLTK | SciKitLearn | Flask | AWS (EC2)
Technology Stack: Python | Matplotlib | NumPy | Pandas | SciKitLearn | CatBoost | XGBoost | LightGBM
Data Science Advanced Certificate | Galvanize
2020, Seattle
- 13 week immersive with 700+ hours of coding, weekly Case Studies, and 3 capstones
- Python-based curriculum focused on machine learning and best practices in statistical analysis, including frequentist and Bayesian methods
- Utilizes regression, classification and clustering to model real-world structured and unstructured data
- Explores NLP and Deep Learning techniques, in addition to Spark on AWS
Bachelor of Arts | Lewis & Clark College
2010 - 2014, Portland