Git Product home page Git Product logo

Hi, Iโ€™m Dai-Phuong Ngo (Liam Ngo) ๐Ÿ‘‹ ๐Ÿ‡จ๐Ÿ‡ฆ ๐Ÿ ๐Ÿ‘

Contact BI ETL Cloud / ML Hackathon Server / Automation E- Learning
Email Microsoft Certified: Power BI Data Analyst Associate Alteryx Certified: Advanced Designer SQL Certified Advanced HackerRank Alteryx Certified: Server Implementation Alteryx 9-Comet & Completed Challeges
Linkedin Tableau Certified: Desktop Specialist Alteryx Certified: Advanced Designer Cloud Microsoft Certified: Azure Data Scientist Associate (soon) Python Certified Problem Solving Intermediate HackerRank Alteryx Certified: Server Administration GitHub
Alteryx Certified: Foundational Micro-Credential Excel Certified Expert 2019 (soon) Databricks Certified: Fundamentals of Databricks Lakehouse Platform Google Certified: Tensorflow Developer (soon) R Certified Intermediate HackerRank (soon) Tableau Public
Tableau Data Analyst (soon) Alteryx Certified: Core Designer Microsoft Certified: Azure Data Fundamentals HackerRank Credly
Alteryx Ceritified: Machine Learning Fundamentals Alteryx Certified: Designer Cloud Core Microsoft Certified: Azure AI Fundamentals CodeSignal Six Sigma Certified White Belt
Microsoft Certified: Azure Enterprise Data Analyst (soon) SAS Safe Roads 2022 Competition Participant

"Don't let what you think you canโ€™t do interfere with what you can do."

Education & Experience:

Apr 2024 - now - Analyst, Business Insights, Accounting, Tax & Finance - Hudson's Bay Company - Toronto, Ontario, Canada ๐Ÿ๐Ÿ‡จ๐Ÿ‡ฆ

  • Responsible for initiatives, applications and realizations of technologies and languages for multiple US & Canada teams: Indirect Tax, Corporate Tax, Accounting, Operations Logistics, and management of involved member & development, testing and production phases
  • Develop, upgrade, scale up Alteryx workflows to enable automation on complex reporting processes from manual 2-5 working days to 15-30 minutes of running/processing time (in average)
  • Preprocess data in Alteryx, integrate with visualization tool, build up Tableau dashboards to visualize audit of US & Canada business entities' Reconciliation summaries based on locations and states/provinces across multiple dimensions, tax categories
  • Enhance, modify workflows which checking and refactoring codes, configurations, functions, abnormalities to audit and optimize Alteryx performance, Oracle & Snowflake SQL analytical queries and Python codes on monthly basis for different projects: Account Payable Tax Recovery, Provincial Tax Rate Adjustment, Courier's Freight Fee and Tax Reconciliation, Business Unit Tax Computation, Cost Below Sales Analysis, Cash Forecast, Tax Returns, Corporate Tax Provision, Card-Related Loyalty Recoveries, Vendors' Tax Code Compliance, State-wise Tax Validation, etc
  • Integrate, combine Alteryx nodes with Python's (Extract-Transform-Load) ETL, Machine Learning, Natural Language Processing (NLP) using multiple libraries, models, techniques, decision tree diagrams on multi-classifying US & Canada SKU Tax Codes, Use Tax Rates at SKU, POS, product, category, invoice levels
  • Accomplish, perform automatable Alteryx processes to replicate human's traditional accounting tasks, automatically generate sophisticated reports and deliver insights & stories while exchanging discussions with stakeholders across departments, teams and borders
  • Cooperate with IT Teams to access and enable Robotic Processing Automation (RPA) flows using UiPath powered by AI Assistant and business intelligence dashboards for Tax-related activities
  • Skills: Alteryx, Dataiku, Power BI, Tableau, SQL, Python, Machine Learning

Jan 2023 - Apr 2024 - Alteryx Administrator, AWS Cloud Ops Data Migration - Billennium IT Inc for Roche (Swiss BioTech), Data Engineering - Integration, Data Services & Insights Foundational Domain - Toronto, Ontario, Canada ๐Ÿ๐Ÿ‡จ๐Ÿ‡ฆ

  • Translate business needs to technical requirements & synthesize insights, solutions through ServiceNow to technical & non-technical global Roche stakeholders while researching new tools & technologies in business intelligence & ETL areas
  • Perform maintenance, upgrade, backup, installation of Designer workflows from Roche on Alteryx Server, MongoDB, AWS
  • Develop batch scripting, Rest API in Python, R with complex, efficient SQL queries & extract insights from Tableau reports
  • Alteryx User ID 288253: working towards Alteryx Designer Expert Exam and participating in multiple weekly challenges

Jan 2021 - Aug 2022 - Business Insights & Analytics Post-Graduate Program - Humber College - Toronto, Ontario, Canada ๐Ÿ๐Ÿ‡จ๐Ÿ‡ฆ

Jan 2021 - now - Data Science Intern (remote) - Cohost AI (founded in San Francisco, USA, based in Ha Noi, Viet Nam) - Toronto, Ontario, Canada ๐Ÿ๐Ÿ‡จ๐Ÿ‡ฆ

  • Completed 2 projects using Python, SQL, Power BI, Alteryx, Excel to research hidden patterns, trends, insights for travel agencies & refactor the code base by developing functions, discussing with stakeholders for each topic to optimize reuse & navigation by 90%
  • Validated 95% of new metrics: Inventory, Room Night & Sold, Average Daily, Occupancy, RevPAR specialized on each topic for pricing recommendations, & growing seasonal sales, analysis of sources, properties, booking behavior, stay period & length

Jan'22 - Apr 2022 - Data Analyst Intern - iRestify Inc. (based in Toronto, Canada) - Toronto, Ontario, Canada ๐Ÿ๐Ÿ‡จ๐Ÿ‡ฆ

  • Supported cross-departmental projects by Power BI to analyze insights, & performance from customer surveys & end-user discussion with customized charts, tables, reports using DAX, MDX queries with engineered KPIs, ratios, & conditional features
  • Identified & reported trends, & patterns, measured stakeholdersโ€™ compliance, early, late, and on-time completeness for the operations & customer success to increase productivity by 70% for users to reuse, update, & interact on Power BI Service App

Aug-Dec 2021 - Data Engineering & Analytics Intern (remote) - Center of Talent in AI (CoTAI, based in Ho Chi Minh City, Viet Nam) - Toronto, Ontario, Canada ๐Ÿ๐Ÿ‡จ๐Ÿ‡ฆ

  • Brainstormed with AI Scientist & developed Data Engineering pipeline & database structure in Python, SQLAlchemy, SQL to retrieve, preprocess big data in million rows, & generate it from API faster by 10h per load, & function loops in Python for Sentiment Analysis
  • Built 90% new Tableau charts & metrics to discover driven factors & intentions, minimize complaints, & negative feedback
  • Compiled Machine & Deep Learning classifiers tackling imbalanced datasets to detect fraud for Bankingโ€™s Marketing Targets

Projects:

Topic more projects available on GitHub & Tableau Public
IEEE-CIS Fraud Detection (Capstone, Humber College) - Preprocessed data in Python, designed architecture solution, analyzed performance between ML classifiers to determine the best performers on the imbalanced dataset, Balanced Random Forest with ROC AUC around 0.9 & Random Forest with ROC AUC, Precision around 0.9
Safe Roads 2022 Competition - Toronto Police Service - Used Power BI, Python, Azure Machine Learning to analyze geospatial datasets, provide interpretation, conduct A/B testing, determine factors, recommend on road conditions, awareness, top fatal intersections to enhance traffic safety, prevent fatal accidents, achieve prediction using Random Forestโ€™s ROC AUC & Precision around 0.8
Sentiment Analysis - Conducted Sentiment Analysis on customerโ€™s comments & analyzed data generated from a system using Natural Language Processing through API on Fan Pagesโ€™ dialogs of diet products & participated in Data Operations, ETL in Python, SQL in MySQL, Azure, Visualization in Tableau to determine top customers, top efficient fan pages, most crucial intentions & demand entities, peak effective contact hours, peak periods of confirmations, common complaints
Banking Dataset โ€“ Marketing Targets - Used classification methods of ML, DL in Python to predict more accurately filing a claim while avoiding overfitting on an imbalanced dataset; - RUS Boost had the highest Balanced Accuracy, Geometric Mean, F1 scores & best Confusion Matrix among classifiers
SQL Murder Mystery - Determined the extract murder and killing planner with the shortest-possible SQL queries from basic to intermediate querying skills & approaches using: INNER/LEFT JOIN, GROUP BY, WITH, WHERE, Sub-Queries
Porto Seguroโ€™s Safe Driver Prediction - Used classification methods of ML, DL in Python to predict more accurately auto insurance policy holders filing a claim (predict the probability) while avoiding overfitting on imbalanced dataset - RUS Boost had the highest Balanced Accuracy, Geometric Mean, F1 scores & best Confusion Matrix among classifiers
Acquisition & Merger Analysis - Compared techniques between loading dataset in Pythonโ€™s SQL Alchemy to MySQL & loading it in SQL to Hadoop, investigated & identified organizations for the most profitable merger and acquisition by examining accumulated data sets in terms of Sales, Revenue, Product Line in SQL on Zeppelin, visualized charts in Tableau, Power BI
Pharma Portfolio Predictive Analysis - Coded in Python and AzureML to analyze time-series pharmaceutical sales data and forecast the key pharma product and predict the patterns in the future
Annual Sales Analysis & Visualization - Applied EDA in Python, visualized 200K datapoints to answer Revenue questions - Visualized & compared results between charts in Tableau & Power BI to determine that the variables which caused the highest Sales Value: December, San Francisco, peak hours placing orders, top sold products, correlation between Prices & Volumes
Income Analysis & Classification - Preprocessed, analyzed the Income background of all records in Python, SQL & visualized key variables in Tableau / Power BI to determine highlights, trends & predictions of Income types with ML, DL Classifiers
Eden Hotels & Resorts Group - Created a Sales Incentive Plan in Java: input, check password, calculate Salespersons, Revenues & export reports, calculated Hotel Revenueโ€™s metrics in Excel to analyze, visualize different types of KPIs - Designed Database and inserted sample data into tables of hotels, guests, employees & bookings in SQL queries
University Admission - Led a team & built a Java program (< 150 coding lines) to store information of the newly admitted students, prompted user to enter the student name & high school grades, calculated GPA & assigned to the Universityโ€™s schools
Investment Analysis of Shopify and Lightspeed in Canada - Managerial Finance & Accounting Report
Governance & Ethics in Data - Gained the highest grade of 95% in all Professor's classes analyzing ethics & governance models about data manipulated in Cybersecurity, COVID-19, Vaccination, etc. - Analyzed 3 aspects of the ethics model, data governance to mitigate potential challenges in the chosen context
TD Bank's Porterโ€™s Value Chain Analysis (available for being shown only in a section) - Conducted an analysis of TD Bank over history, vision, mission, strategic and financial objectives, External environment based on PESTEL and Five Forces analysis, Internal environment based on SWOT-analysis, resource and capability analysis, and a value chain analysis, the current strategic approach and its various strategic actions, the staffing practices and strategy execution, Organizational structure.
Better Working Word - EY, NASA, Microsoft - Using Python, Machine Learning, Azure Studio, Azure Machine Learning in 3 challenges for 3 months to help locate and protect the biodiversity of frogs by discovering and counting local and global frogs on weather data sampled over space and time (spatiotemporal sampling) with given preliminary F1 score.
US Medicaid Pharmacy Pricing Analysis - Establishing tables by nodes and Graph on Neo4j in Cypher, and on Azure in SQL to predict future prices/quantities and important pharmaceutical products of US Medicaid datasets in Python, AzureML
Home Credit Default Risk - Connected, transformed datasets, conducted EDA in SQL, Scala on Hive, Zeppelin on customized datasets on the to analyze the loan applicants' background and help expanding to those unable to access financial services - Determined on Zeppelin/ Tableau/ Power BI the most significant background check of applicants who got most loan approvals

Academic Progress:

Courses Details
Data Analytics Tools โœ… SAS, SPSS Modeler, SPSS, Excel, Cognos
Managerial Finance & Accounting โœ… Excel (Investment Analysis of Shopify and Lightspeed in Canada)
Big Data โœ… Hadoop, R, Neo4j, Cypher, Graph
Quantitative Research Methods I & II โœ… Descriptive & Inferential Statistics, Probability, Normal Distribution, Estimation, Hypothesis Testing
Database & SQL โœ… SQL, ERD, Normalization
Governance & Ethics in Data โœ… Reflection & Integration of Knowledge: Governance & Ethics of Analytics in in Data, AI & Technology - only available from hyperlink in my Resume - (graded 95/100 & feedbacked by Professor. Kathleen Mcginn ๐Ÿ˜ง : "My goodness Phuong,Thank you for sharing this with me. It is indeed a very deep, intelligent and meaningful piece of writing that deserves an excellent grade - 95 (!) - the highest grade I have given so far. Congratulations - you have truly earned it.")
Canadian Business & Strategy โœ… TD Bank's Porterโ€™s Value Chain Analysis & Nucor Corporation Analysis
Marketing โœ…
Predictive Analytics โœ… linear and multiple regression, decision trees, linear programming, factor analysis, cluster analysis, modelling
Machine Learning and Programming 1 & 2 โœ… Python: Data Mining, Data Science, Data Visualization, Dimension Reduction, CRM, Evaluation Predictive Performance, Multiple Linear Regression, K-NN, Naives Bayes Classifier, Classification, Regression Trees, Logistic Regression, Cluster Analysis
Communication & Data Visualization โœ… Excel, Tableau
Business Intelligence โœ… Power BI
Machine Learning and Programming 2 โœ… Python: Time Series Forecasting, Market Basket Analysis, Natural Language Processing
Capstone Course โœ… IEEE-CIS Fraud Detection (Capstone, Humber College)
Project Management โœ… Boeing Aviation Case Report of Sales and Supply Boost

Languages, Technologies, Skills:

Criteria Details
Programming Certified SQL, Python (Pandas, Numpy, Matplotlib, Keras, SkLearn), Tensorflow Developer (in progress), T-SQL, PL/pgSQL, Java, Scala, R, HTML
Viz & ETL Certified Power BI, Tableau Desktop, Alteryx Advanced Designer, Alteryx Designer Cloud Advanced, Alteryx Machine Learning Fundamentals, Tableau Prep, SPSS (Modeler, Statistics), SAS (Studio, Enterprise Miner), Cognos, Qlik
Big Data Certified Azure Data Fundamentals, Azure AI Fundamentals, Alteryx Server Administration, Databricks Accredited Lakehouse Fundamentals, AWS (ML & Data Analytics), Azure (ML, Synapse), MySQL, MongoDB, MS SQL, Oracle, PostgreSQL, Hadoop (Hive, Zeppelin), Neo4j, Splunk
Collaboration wiki Atlassian Confluence, Jira, Trello
Languages English ๐Ÿ‡บ๐Ÿ‡ฒ (fluent), Vietnamese (native), French ๐Ÿ‡จ๐Ÿ‡ฆ๐Ÿ‡จ๐Ÿ‡ต (basic overall, intermediate reading), German ๐Ÿ‡ฉ๐Ÿ‡ช (basic overall, intermediate reading)
Others Certified Six Sigma White Belt, Excel (Solver, GoalSeek, Macros), GDPR, ServiceNow, Confluence, Jira, Trello, Machine & Deep Learning, AI, Teamwork, Statistics, Probability, Sales, Accounting, Finance, Project Management, Hospitality, Presentation, Communication, Marketing

Other Certificates:

Earned ๐Ÿ… Details
ProtonX Tensorflow Developer (Statistics, Probability, Algebra, Machine Learning, Deep Learning, AI)
Center of Talent in AI Python, Machine Learning, Deep Learning, AI, Reinforcement Learning
Nordic Coder Python, Tableau
DataCamp SQL Intermediate
Microsoft Office Specialist Word, Excel, Powerpoint
Udemy Power BI for Business Intelligence

Dai-Phuong Ngo (Liam)'s Projects

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.