Light

xurui314 / glm4v-finetune Goto Github PK

View Code? Open in Web Editor NEW

9.0 1.0 0.0 31 KB

Support finetuning GLM4v with zero2

Python 100.00%

glm4v-finetune's Introduction

GLM4v-Finetune

Support finetuning GLM4v with zero2

简介

使用CogVLM2代码框架，支持GLM4v微调，具体的改动有如下：

GLM4v模型文件forward修改、数据预处理修改等
自定义数据集格式更简单

您需要从huggingface中下载的glm4v模型，把THUDM/glm-4v-9b中的modeling_chatglm.py替换为本仓库中的modeling_chatglm.py。

自定义数据集

组织格式的代码可以参考：

import os
import json
import random

folder_path = ''
# file_names = os.listdir(folder_path)
file_names = ['refexp_train.jsonl', 'widget_caption_maxlen_train.jsonl',  'taperception_train.jsonl']
step_i = 0
train_step = []
for file_name in file_names:    
    json_data = []
    with open(folder_path + file_name, 'r') as file:
        for line in file:
            json_data.append(json.loads(line))

    for sample in json_data:
        sample_json = {}
        conversations = []
        img_path = '' + '/' + sample['img_name']
        conv_user = {"role": "user", "content": ""}
        conv_user["content"] += sample['prompt']
        conv_ai = {"role": "assistant", "content": sample['text']}
        conversations.append(conv_user)
        conversations.append(conv_ai)
        sample_json['conversations'] = conversations
        sample_json['image'] = img_path
        sample_json['id'] = step_i

        train_step.append(sample_json)
        step_i += 1
random.shuffle(train_step)
print("Num of total step: " + str(len(train_step)))
json.dump(train_step, open('public_glm4_grouding_train.json', "w"), ensure_ascii=False)

运行方法

进入代码文件夹下，运行 deepspeed --include localhost:2,3,6,7 peft_lora_glm4v.py --ds_config ds_config.yaml

glm4v-finetune's People

Contributors

Stargazers

Watchers

glm4v-finetune's Issues

Inference AttributeError: 'ChatGLMForConditionalGeneration' object has no attribute 'build_conversation_input_ids'

微调包含vision模块吗

本仓库微调代码会训练vision模块吗？

请问内存要求是多大呢？8卡，每张卡40G能训起来么？

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.