Git Product home page Git Product logo

A passionate researcher in efficient machine learning design with a focus on large language models (text, vision), classical machine learning, mlops, and parallel computing.


🎓 Pursuing PhD in Electrical and Computer Engineering @ the Kansas State University.

📖 Graduate research assistant @ the ISCAAS Lab.

💻 Currently developing lightweight large language models using knowledge distillation.

🌱 Love to make research projects, tutorials, and insightful technical blogs. Personal website

⚡ Fun fact: I love to travel and attend various community festivals.

Technical Skills:

Connect with me:

Email LinkedIn Medium

Muhammad Ali Shafique's Projects

alishafique3.github.io icon alishafique3.github.io

Welcome to my personal technical website! Here, you'll find a curated collection of my research projects, tutorials, and insightful technical blogs. As a passionate researcher and learner in the field of Generative AI, I aim to share my knowledge and research with fellow enthusiasts and aspiring learners.

distributed_training_of_llm_using_deepspeed icon distributed_training_of_llm_using_deepspeed

In this project, LLM (model: distilbert) is finetuned on a multiple GPUs for text classification task. Distributed training is performed using deepspeed (ZeRO 1, 2, and 3) with profiling in wandb.

llm-evaluations-hub icon llm-evaluations-hub

A repository that provides a thorough collection of approaches and methods used for evaluating Large Language Models (LLMs).

ml_and_dl_made_easy icon ml_and_dl_made_easy

Here, I talk about cool stuff of deep learning. I try to explain tricky things in a storytelling way. Whether you know a lot about these concepts or just a bit, my blogs will help you to learn or refresh your concepts.

pytorch_training_optimization_with_memory_analysis icon pytorch_training_optimization_with_memory_analysis

In this project, training stage is optimized using memory analysis in Pytorch Tensorboard. Automatic mixed precision, increased batch size, reduced H2D copy, multiprocessing and pinned memory techniques are used for optimizations.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.