reason-wang / flan-alpaca-lora Goto Github PK

This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.

Python 75.47% Jupyter Notebook 24.53%

alpaca flan-t5 huggingface instruction-tuning lora low-rank peft pytorch t5

flan-alpaca-lora's Introduction

🍮🦙🤏Flan-Alpaca-LoRA: Instruction Tuning from Humans and Machines with Low-Rank Adaptation

This repo trains google/flan-t5 on alpaca dataset with low-rank adaptation training method. It reduces the GPU memory needed and speeds the training.

Jun 17, 2023: add a notebook. You can try flan-alpaca-lora with now.

May 3, 2023: train flan-t5-xl using alpaca-gpt4 dataset.

Apr 13, 2023: train flan-t5-xl using GPTeacher dataset (Instruct and Roleplay), which seems to perform well.

Apr 5, 2023: train flan-t5-xxl using 8bit quantization. The model can be fitted into a single 3090 GPU. All of the models can be found in huggingface.

model	adapter_params	data	GPU	time
flan-alpaca-lora-base	0.9M	alpaca cleaned	3090	20mins
flan-alpaca-lora-large	2.4M	alpaca cleaned	3090	50mins
flan-alpaca-lora-xl	4.7M	alpaca cleaned	3090	2.5hrs
flan-alpaca-lora-xxl	9.4M	alpaca cleaned	3090	10hrs
flan-gpteacher-lora-xl	4.7M	GPTeacher	3090	80mins
flan-alpaca-gpt4-lora-xl	4.7M	alpaca-gpt4	3090	3.25hrs

Dependencies

torch == 1.13.1
transformers == 4.29.1
peft == 0.3.0
bitsandbytes==0.38.1
accelerate==0.19.0

Newest version of these packages should work fine.

Training

The following command finetune Flan-T5-base with only 20 mins on a single 3090 GPU

python train.py \
    --model_name_or_path google/flan-t5-base \
    --data_path ./alpaca_data_cleaned.json \
    --bf16 True \
    --output_dir ./ckpts/ \
    --num_train_epochs 3 \
    --per_device_train_batch_size 8 \
    --gradient_accumulation_steps 8 \
    --evaluation_strategy "no" \
    --save_strategy "no" \
    --learning_rate 5e-4 \
    --weight_decay 0. \
    --warmup_ratio 0.03 \
    --lr_scheduler_type "cosine" \
    --logging_steps 50 \
    --tf32 True

Example usage:

import transformers
from peft import PeftModel

# Where peft_model_id should be the saving directory or huggingface model id
model_name = "google/flan-t5-large"; peft_model_id = "reasonwang/flan-alpaca-lora-large"
tokenizer = transformers.AutoTokenizer.from_pretrained(model_name)
base_model = transformers.AutoModelForSeq2SeqLM.from_pretrained(model_name)
peft_model = PeftModel.from_pretrained(base_model, peft_model_id)

# Input an instruction or any other questions.
inputs = tokenizer("List a few tips to get good scores in math.", return_tensors="pt")
outputs = peft_model.generate(**inputs, max_length=128, do_sample=True)
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))

flan-alpaca-lora's People

Stargazers

Watchers

Forkers

mobarski the-unsoul rrhouma apollohuang1 cnmoro zhaoyiran924 occasionallynlp oddhood

flan-alpaca-lora's Issues

flan-alpaca-gpt4-lora-xl on Google colab

Can reasonwang/flan-alpaca-gpt4-lora-xl be run on Google colab?

If yes, what setting / config would it require?

I tried A100 and it seems to fail 😔

NameError: name 'bnb' is not defined

Getting the following error -
in <cell line: 8>:8 │
│ │
│ /usr/local/lib/python3.10/dist-packages/peft/peft_model.py:143 in from_pretrained │
│ │
│ 140 │ │ if config.task_type not in MODEL_TYPE_TO_PEFT_MODEL_MAPPING.keys(): │
│ 141 │ │ │ model = cls(model, config) │
│ 142 │ │ else: │
│ ❱ 143 │ │ │ model = MODEL_TYPE_TO_PEFT_MODEL_MAPPING[config.task_type](model, config) │
│ 144 │ │ │
│ 145 │ │ # load weights if any │
│ 146 │ │ if os.path.exists(os.path.join(model_id, WEIGHTS_NAME)): │
│ │
│ /usr/local/lib/python3.10/dist-packages/peft/peft_model.py:642 in init │
│ │
│ 639 │ """ │
│ 640 │ │
│ 641 │ def init(self, model, peft_config: PeftConfig): │
│ ❱ 642 │ │ super().init(model, peft_config) │
│ 643 │ │ self.base_model_prepare_inputs_for_generation = self.base_model.prepare_inputs_f │
│ 644 │ │ self.base_model.prepare_inputs_for_generation = self.prepare_inputs_for_generati │
│ 645 │ │ self.base_model_prepare_encoder_decoder_kwargs_for_generation = ( │
│ │
│ /usr/local/lib/python3.10/dist-packages/peft/peft_model.py:79 in init │
│ │
│ 76 │ │ if isinstance(self.peft_config, PromptLearningConfig): │
│ 77 │ │ │ self._setup_prompt_encoder() │
│ 78 │ │ else: │
│ ❱ 79 │ │ │ self.base_model = LoraModel(peft_config, model) │
│ 80 │ │ if getattr(self.peft_config, "modules_to_save", None) is not None: │
│ 81 │ │ │ self.modules_to_save = self.peft_config.modules_to_save │
│ 82 │ │ │ _set_trainable(self) │
│ │
│ /usr/local/lib/python3.10/dist-packages/peft/tuners/lora.py:118 in init │
│ │
│ 115 │ │ super().init() │
│ 116 │ │ self.peft_config = config │
│ 117 │ │ self.model = model │
│ ❱ 118 │ │ self._find_and_replace() │
│ 119 │ │ mark_only_lora_as_trainable(self.model, self.peft_config.bias) │
│ 120 │ │ self.forward = self.model.forward │
│ 121 │
│ │
│ /usr/local/lib/python3.10/dist-packages/peft/tuners/lora.py:148 in _find_and_replace │
│ │
│ 145 │ │ │ │ │ is_target_modules_in_base_model = True │
│ 146 │ │ │ │ parent, target, target_name = self._get_submodules(key) │
│ 147 │ │ │ │ bias = target.bias is not None │
│ ❱ 148 │ │ │ │ if loaded_in_8bit and isinstance(target, bnb.nn.Linear8bitLt): │
│ 149 │ │ │ │ │ kwargs.update( │
│ 150 │ │ │ │ │ │ { │
│ 151 │ │ │ │ │ │ │ "has_fp16_weights": target.state.has_fp16_weights,

Training script takes more than 2 hours to finish

Hi. Thanks for your nice work!

I've tried to run your training script on a RTX3090 with exact dependencies as you suggested. It turned out that it took more than 2 hours to finish instead of 20 minutes. I also tried training flan-t5-large and it took more than 4 hours. What can be the reasons for this?

Question about training loss

Hi, I'm very interested in your project, but during the training, I found that the training loss will be very big, more than 30, is it normal?

Further fine tuning flan-alpaca-gpt4-lora-xl

Can reasonwang/flan-alpaca-gpt4-lora-xl be further fine tuned?

If yes, what would be the steps for it?

reason-wang / flan-alpaca-lora Goto Github PK

flan-alpaca-lora's Introduction

🍮🦙🤏Flan-Alpaca-LoRA: Instruction Tuning from Humans and Machines with Low-Rank Adaptation

Dependencies

Training

flan-alpaca-lora's People

Stargazers

Watchers

Forkers

flan-alpaca-lora's Issues

flan-alpaca-gpt4-lora-xl on Google colab

NameError: name 'bnb' is not defined

Training script takes more than 2 hours to finish

Question about training loss

Further fine tuning flan-alpaca-gpt4-lora-xl

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent