PeterPham's Projects
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
π WebRTC - P2P - Simple, Secure, Fast Real-Time Video Conferences Up to 4k and 60fps, compatible with all browsers and platforms.
[EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.
Code and Data for WWW'23 paper Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation
Machine Learning Engineering Guides and Tools
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization"
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
A collection of the the best ML news every week (research, news, resources)
Explanation to key concepts in ML
π₯Highlighting the top ML papers every week.
The code from the Machine Learning Bookcamp book and a free course based on the book
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Open source platform for the machine learning lifecycle
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Free MLOps course from DataTalks.Club
MLX: An array framework for Apple silicon
Examples in the MLX framework
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
This is the offiicial code for Faster Segment Anything (MobileSAM) project that makes SAM lightweight
stock market models - have fun
Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
π₯ (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.
Multi-Joint dynamics with Contact. A general purpose physics simulator.
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
Muzic: Music Understanding and Generation with Artificial Intelligence
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch