Git Product home page Git Product logo

👋 Hello, I'm Kenneth Leung

  • Thanks for popping by! As an avid learner, bold builder, curious explorer, and driven doer with a bias towards action, I enjoy seeking and solving meaningful problems with data and technology while having fun at the same time.
  • I welcome you to join me on a journey of data science discovery! Follow me on GitHub, Medium, and LinkedIn to stay updated with more engaging and practical content.
  • You can find my data science portfolio here, where every project and article was born out of inspiration, curiosity, and motivation. Feel free to connect for a chat (coffee or virtual) to discuss shared interests and topics!

Project Count

How to reach me

    Buy Me A Coffee 

Portfolio Contents

  1. Computer Vision
  2. Database Management
  3. Data Extraction and Web Scraping
  4. Data Science Certification Guides
  5. Data Science Toolkit
  6. Data Science in the Real World
  7. Generative AI
  8. Insights from Data Science Talks
  9. Machine Learning
  10. MLOps
  11. Natural Language Processing
  12. Networks and Graphs
  13. Sports Analytics
  14. Visualization
  15. Web Development
  16. Web3 and Metaverse
  17. Writing for DataCamp
  18. Writing Tips

Projects with ⭐ are my personal favourites, so do check them out!


Computer Vision 👁️

Title Article Repo
Classifying Images of Alcoholic Beverages with fast.ai v2 🔗 🔗
Russian Car Plate Detection with OpenCV and TesseractOCR 🔗 🔗
Evaluate OCR Output Quality with Character Error Rate (CER) and Word Error Rate (WER) 🔗 🔗
Top Python libraries for Image Augmentation in Computer Vision 🔗 🔗
⭐ PyTorch Ignite Tutorial - Classifying Tiny ImageNet with EfficientNet 🔗 🔗
Practical Guide to Transfer Learning in TensorFlow for Multiclass Image Classification 🔗 🔗

Database Management 🗄️

Title Article Repo
⭐ Definitive Guide to Creating a SQL Database on Cloud with AWS and Python 🔗 🔗
PyMySQL - Connecting Python and SQL for Data Science 🔗 🔗

Data Extraction and Web Scraping 🧰

Title Article Repo
Using OneMap API to extract Singapore postal codes, coordinates and travel distance - 🔗
A Detailed Web Scraping Walkthrough Using Python and Selenium 🔗 🔗
⭐ How to Web Scrape Wikipedia using LangChain Agents and Tools with OpenAI's LLMs and Function Calling 🔗 🔗

Data Science Certification Guides 👨‍🎓

Title Article Repo
3 Steps to Get AWS Cloud Practitioner Certified in 2 Weeks 🔗 🔗
3 Steps to Get Tableau Desktop Certified in 2 Weeks 🔗 -
⭐ No-Frills Guide to Passing the AWS Certified Machine Learning Specialty Exam 🔗 -

Data Science Toolkit 🛠️

Title Article Repo
Common Python codes for Data Wrangling - 🔗
Enhance your Python code’s readability with pycodestyle 🔗 -
Free Resources for Generating Realistic Fake Data 🔗 -
Most Starred and Forked GitHub Repos for Data Science and Python 🔗 -
Most Starred and Forked GitHub Repos for Data Science and R 🔗 -
Automatically Generate Machine Learning Code with Just a Few Clicks 🔗 -
Read and Modify Image Metadata with Python 🔗 🔗
Top Tips to Google Search Like a Seasoned Data Scientist 🔗 -
How to Swap Day and Month of Incorrectly Formatted Excel Dates 🔗 -

Data Science in the Real World 🌏

Title Article Repo
Exploring Illegal Drugs in Singapore — A Data Perspective 🔗 🔗
Pharmacokinetic Modeling of Drug Concentration Trajectories using Ordinary Differential Equations (ODE) and Global Optimization with Differential Evolution - 🔗
Healthcare’s AI Future — In Conversation with Andrew Ng and Fei-Fei Li 🔗 -
Real-World Data Science Use Cases in the Insurance Industry 🔗 -
⭐ Failed-ML: Compilation of high-profile real-world examples of failed machine learning projects 🔗 🔗

Generative AI 🤖

Title Article Repo
Generative AI Pharmacist - Macy 🔗 🔗
⭐ ChatPod - Q&A over your Podcasts with Whisper, FAISS, and LangChain 🔗 🔗
⭐ Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A 🔗 🔗
Domain LLMs - Compilation of Customized LLMs for Specific Domains and Industries - 🔗
⭐ Text-to-Audio Generation with Bark, Clearly Explained 🔗 🔗
Guide to ChatGPT's Advanced Settings — Top P, Frequency Penalties, Temperature, and More 🔗 -
Inside the Leaked System Prompts of GPT-4, Gemini 1.5, Claude 3, and More 🔗 -

Insights from Data Science Talks 👨‍🏫

Title Article Repo
Bridging AI’s Proof-of-Concept to Production Gap — Insights from Andrew Ng 🔗 -

Machine Learning 🎰

Title Article Repo
Exploring Condominium Rental Prices with Web Scraping and Exploratory Data Analysis 🔗 🔗
Using Ensemble Regressors to Predict Condominium Rental Prices 🔗 🔗
The Dying ReLU Problem, Clearly Explained 🔗 -
Why Bootstrapping Actually Works 🔗 -
⭐ Assumptions of Logistic Regression, Clearly Explained 🔗 🔗
Data-Centric AI Competition - Tips and Tricks of a Top 5% Finish 🔗 🔗
Credit Card Fraud Detection with AutoXGB 🔗 🔗
⭐ Micro, Macro & Weighted Averages of F1 Score, Clearly Explained 🔗 -
Principal Component Regression - Clearly Explained and Implemented 🔗 🔗
⭐ Feature Selection with Simulated Annealing in Python, Clearly Explained 🔗 🔗
Quick Primer on Types of Missing Data and Imputation Techniques 🔗 -
Imputation of Missing Data in Tables with DataWig 🔗 🔗

MLOps - Machine Learning Operations 👨‍🔧

Title Article Repo
Key Learning Points from MLOps Specialization — Course 1/4 🔗 🔗
Key Learning Points from MLOps Specialization — Course 2/4 🔗 🔗
Key Learning Points from MLOps Specialization — Course 3/4 🔗 🔗
Key Learning Points from MLOps Specialization — Course 4/4 🔗 🔗
⭐ End-to-End AutoML Pipeline with H2O AutoML, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell 🔗 🔗
⭐ How to Dockerize Machine Learning Applications Built with H2O, MLflow, FastAPI, and Streamlit 🔗 🔗
⭐ Building and Managing an Isolation Forest Anomaly Detection Pipeline with Kedro 🔗 🔗

Natural Language Processing 📑

Title Article Repo
COVID-19 Vaccine — What’s the Public Sentiment? 🔗 🔗
Keyword Extraction and Analysis Pipeline with KeyBERT and Taipy 🔗 🔗

Networks and Graphs 🌐

Title Article Repo
⭐ Network Analysis and Visualization of Drug-Drug Interactions 🔗 🔗
How to Deploy Interactive Pyvis Network Graphs on Streamlit 🔗 🔗
A No-Code Approach to Building Knowledge Graphs 🔗 🔗

Sports Analytics ⚽

Title Article Repo
⭐ Analyzing English Premier League VAR Football Decisions 🔗 🔗
Combining Python and R for FIFA Football World Ranking Analysis 🔗 🔗

Visualization 📈

Title Article Repo
Uniform Singapore Energy Price and Demand Forecast Dashboard (with Plotly Dash) - 🔗
Visualizing Fortune 500 Companies in a Bar Chart Race 🔗 🔗
How to Easily Draw Neural Network Architecture Diagrams 🔗 🔗

Web Development 🖥️

Title Article Repo
⭐ Post COVID-19 Vaccination Wait-Time Tracker (with Python Flask) 🔗 🔗
From HTTP to HTTPS — Easily Secure Flask Web Apps With Talisman 🔗 -
⭐ Food King Directory (in collaboration with Night Owl Cinematics) 🔗 🔗

Web3 and Metaverse 👨‍💻

Title Article Repo
The Web3 / Metaverse Glossary — A Keyword Guide to the Tech Future 🔗 -

Writing for DataCamp ✍️

Title Article Repo
⭐ What Mature Data Infrastructure Looks Like 🔗 -
Democratizing Data in Government Agencies 🔗 -
A Survey Into Data Governance Tools 🔗 -
Scaling Data Science With Data Governance 🔗 -
3 Reasons Why All Teams Should Learn SQL 🔗 -
3 Reasons Why All Teams Should Learn R 🔗 -
How Tableau Helps Your Organization Achieve Greater Data Insights 🔗 -
How PowerBI Helps Your Organization Achieve Greater Data Insights 🔗 -

Writing Tips 📜

Title Article Repo
Create a Clickable Table of Contents for Your Medium Posts 🔗 -

Kenneth Leung's Projects

aws-rds-mysql-python icon aws-rds-mysql-python

Integrating Amazon RDS, MySQL Workbench, and PyMySQL to build and deploy a database on the cloud

domain-llms icon domain-llms

Comprehensive Compilation of Customized LLMs for Specific Domains and Industries

failed-ml icon failed-ml

Compilation of high-profile real-world examples of failed machine learning projects

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.