Doctor GPT

Overview

DoctorGPT is a Large Language Model that can pass the US Medical Licensing Exam. This is an open-source project with a mission to provide everyone their own private doctor. DoctorGPT is a version of Meta's Llama2 7 billion parameter Large Language Model that was fine-tuned on a Medical Dialogue Dataset, then further improved using Reinforcement Learning & Constitutional AI. Since the model is only 3 Gigabytes in size, it fits on any local device, so there is no need to pay an API to use it. It's free, made for offline usage which preserves patient confidentiality, and it's available on iOS, Android, and Web. Pull requests for feature additions and improvements are encouraged.

Dependencies

Numpy (Use matrix math operations)
PyTorch (Build Deep Learning models)
Datasets (Access datasets from huggingface hub)
Huggingface_hub (access huggingface data & models)
Transformers (Access models from HuggingFace hub)
Trl (Transformer Reinforcement Learning. And fine-tuning.)
Bitsandbytes (makes models smaller, aka 'quantization')
Sentencepiece (Byte Pair Encoding scheme aka 'tokenization')
OpenAI (Create synthetic fine-tuning and reward model data)
TVM (Tensor Virtual Machine, converts onnx model to efficient cross-platform use)
Peft (Parameter Efficient Fine Tuning, use low rank adaption (LoRa) to fine-tune)
Onnx (Convert trained model to universal format)

Installation

Install all dependencies in one line using pip

pip install numpy torch datasets huggingface_hub transformers trl bitsandbytes sentencepiece openai tvm peft onnx

Training

In order to train the model, you can run the training.ipynb notebook locally or remotely via a cloud service like Google Colab Pro. The training process requires a GPU, and if you don't have one then the most accessible option i found was using Google Colab Pro which costs $10/month. The total training time for DoctorGPT including supervised fine-tuning of the initial LLama model on custom medical data, as well as further improving it via Reinforcement Learning from Constitional AI Feedback took 24 hours on a paid instance of Google Colab. If you're interested in learning more about how this process works, details are in the training.ipynb notebook.

Cloud Training

click here: https://colab.research.google.com/github/llSourcell/DoctorGPT/blob/main/llama2.ipynb

Local Training

git clone https://github.com/llSourcell/DoctorGPT.git
jupyter training.ipynb

Get jupyter here

Usage

iOS

Step 1: Download the iOS Machine Learning Compilation Chat Repository
Step 2: Follow the installation steps
Step 3: Once the app is running on your iOS device or simulator, tap "add model variant"
Step 4: Enter the URL for the latest DoctorGPT model to download it: https://huggingface.com/llsourcell/doctorgpt
Step 5: Tap 'Add Model' and start chatting locally, inference runs on device. No internet connection needed!

Android

Step 1: Download the Android Machine Learning Compilation Chat Repository
Step 2: Follow the installation steps
Step 3: Tap "add model variant"
Step 4: Enter the URL for the latest DoctorGPT model to download it: https://huggingface.com/llsourcell/doctorgpt
Step 5: Tap 'Add Model' and start chatting locally! No internet needed.

Web (TODO)

As an experiment in Online Learning using actual human feedback, i want to deploy the model as a Flask API with a React front-end. In this case, anyone can chat with the model at this URL. After each query, a human can rate the model's response. This rating is then used to further improve the model's performance through reinforcement learning. to run the app, download flask and then you can run:

flask run

Then visit localhost:3000 to interact with it! You can also deploy to vercel

Credits

Meta, MedAlpaca, Apache, MLC Chat & OctoML

hengleeme / doctorgpt Goto Github PK

doctorgpt's Introduction

Doctor GPT

Overview

Dependencies

Installation

Training

Cloud Training

Local Training

Usage

iOS

Android

Web (TODO)

Credits

doctorgpt's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent