DoctorGLM

基于 ChatGLM-6B的中文问诊模型

训练数据

Dataset	Department	Language	Q&A	Chat	Number	Syn.	Size	Weight
CMD.	Surgical	CN	✔	×	116K	×	52MB
	Obstetrics and Gynecology	CN	✔	×	229K	×	78MB
	Pediatrics	CN	✔	×	117K	×	47MB
	Internal Medicine	CN	✔	×	307K	×	102MB
	Andriatria	CN	✔	×	113K	×	44MB
	Merged	CN	✔	×	1.9M	×		Doctor_GLM/ckpt
MedDialog	Multiple	CN&EN	✔	✔	3.4M	×	1.5GB	ptuning_weight
ChatDoctor	Multiple	EN	✔	×	5.4K	✔	2.9MB	Coming soon
HearlthcareMagic	Multiple	EN	✔	×	200K	×	216MB	Coming soon

https://github.com/Toyhom/Chinese-medical-dialogue-data

使用

lora

显存 >= 13G （未量化版本）
pip install deep_training cpm_kernels icetk transformers>=4.26.1
torch >= 1.12.0 (icetk依赖cpu版torch, 建议先安装icetk后安装gpu版torch)
lora的finetune代码来自 https://github.com/ssbuild/chatglm_finetuning

对于fp16模型，直接使用Doctor_GLM/chat_lora.ipynb，由于官方更新了chatglm的权重，我们将老版权重放在了 old_pretrain_model 可以下载后解压到old_pretrain_model目录

量化的模型我们打了个包，使用方便，但是效果目前来看很成问题：INT4需要大约6G显存，INT8需要大约8G显存，在Doctor_GLM/chat_lora_quant.ipynb下使用

from load_quantization import load_int
tokenizer, model = load_int('DoctorGLM-6B-INT8-6merge-int8.pt',8)
response, history = model.chat(tokenizer,
                               "我爷爷高血压可以喝咖啡吗",
                               history=[],
                               max_length=2048)
print(response)

模型下载链接： INT4 INT8 量化方法均为分层的线性量化。目前量化模型的性能仍有较大问题，后期我们会对量化方法和模型进行更新

p-tuningv2

我们实现p-tuningv2时基于官方新版本权重，可以在hugging face上下载，也可以从我们的链接下载 pretrain_model
p-tuningv2的权重在 ptuning_weight ，下载后解压到ckpt/ptuningv2目录下, 然后使用Doctor_GLM/chat_ptuning_v2.ipynb，根据需要调整quantization_bit为4或8

模型在线部署

为了方便部署并随时调整模型生成回答时的参数，我们提供了基于 Gradio 库的部署代码，路径为 Doctor_GLM/gradio.ipynb。运行之后，访问本机的7860或者代码声明的其他端口即可以运行Demo，模型在生成回答时的参数可以由用户自由调控。若想让部署的模型可以被局域网之外的其他用户访问，需要将sharing设置为 True（默认为False）。部署之后运行效果如下所示：

即将到来的更新

(2023.4.21) 对话中加入参考文献，模型上传到huggingface

第一次运行会下载chatGLM-6B权重, 如果已有chatGLM-6B权重可以将data_utils.py里的路径修改为自己的权重目录

结果示例

我们随机跑了100个结果，在 ./results目录下，两份json文件分别为由ChatGLM, DoctorGLM得到的结果，目前存在大量复读机。

开发者群

DoctorGLM开发者群，如果你也对基于ChatGLM的应用开发感兴趣，欢迎加入我们的讨论组。

引用

@article{xiong2023doctorglm,
      title={DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task}, 
      author={Honglin Xiong and Sheng Wang and Yitao Zhu and Zihao Zhao and Yuxiao Liu and Linlin Huang and Qian Wang and Dinggang Shen},
}

daxiajames / doctorglm Goto Github PK

doctorglm's Introduction

DoctorGLM

最近更新

训练数据

使用

lora

p-tuningv2

模型在线部署

最近更新

即将到来的更新

结果示例

开发者群

引用

doctorglm's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent