📝 I regularly write articles on GFG webiste on topics related to ML/DL/LLM
📫 Reach me [email protected]
Index
-
End-to-end Machine Learning Projects: Repo containing projects that are built in a production-grade style by using modular coding, code containerization, deployment on cloud technology and open source ML tools.
E2E Ml project using MLflow, Pytorch Lightning, GCP cloud, Docker
Data Preprocessing Techniques for ETL using Dask data frame to process 3 Million records
Predict House price using ML regression techniques
-
MLOps Projects: Repo containing projects that focus on using specific open-source ML Ops tools. The below repo can be used as a template to incorporate the tool within an end-to-end ML project.
Deploy MLflow tracking server on GCP instance with PostgreSQL database and GCP cloud storage as backend
Use of DVC for versioning data and storing it into GCP cloud storage.
Steps to install self-managed Kubernetes cluster on GCP and then install Kubeflow and Dask Operator. After the infrastructure setup we can use the kubeflow notebook to run distributed training on a dask cluster
Use Github actions to auto-deploy code on AWS Beanstalk
-
Classical ML Projects: Classical ML algorithm applied to business problems
Fraudulent transaction identification using classical ML algorithms like KNN,LR DT,RF,XG
Churn Prediction using Logistic Regression and PCA for a telecom company
RFM analysis + Kmeans Clustering on Customer Invoice data
Clustering of Countries based on their important Economic Parameters
Using various Auto EDA tools like YDATA PROFILING, AutoViz , Sweetviz, Data prep, Dtale
-
LLM/NLP/DL projects:
Image Classification using the Tensorflow framework and Efficient Net model for a vehicle insurance company during claims processing
Automatic Number Plate Recognition (ANPR) system using YOLO (You Only Look Once) for automatic reading of number plate
Using LLM to build applications like chat with pdf and YouTube video summarizer
-
- DAE -> Use DAE to generate new images
- MCTCT -> Fine tune MCTC model to translate audio to text
- PEFT -> Use PEFT to finetune flan-t5-base model for dialog summarization
- Reinforcement Learning -> Use open ai gym to train an agent in Taxi-v3 environment
- Semantic Similarity -> Use of DOC2VEC, SBERT, INFERSENT, USE for encoding sentences and prediction similarity score
- SeqtoSeq -> Fine tune whisper model for translating audio to text
- Text Generation -> Implement FNet from scratch for text generation
- Wav2Vec2 -> Use of wav2vec2 model for automatic speech recognition
- Translation -> Translate text from English to Hindi
-
-
Python Concepts: Python concepts required for AI/ML/DS
Collection of Python Basic and Advanced Concepts, Programming Questions
Data Structure and algorithms using Python - Basic and advanced level
-
SQL Concepts: SQL concepts required for data analysis
Basic intermediate and advanced concepts for data analysis
-
GENAI_NOTES : Collection of my personal notes plus documents from the net to understand the working of LLM and LIM
LIM and LLM documents
-
CheatSheets:
Collection of good cheatsheets sourced from the internet related to Statistics, ML, Python, SQL