Comments (12)
@HAWLYQ can you test for deepspeed stage 3 integrations, specifically for deadlock issues while training/fine-tuning?
Hi, @AR-javis , I'm debugging with deepspeed zero3~ I will try my best to release it within a week~
from mplug-docowl.
Hi, @coder4nlp , the training code is scheduled for release at the end of this month. If you are urgent to finetune our model, you can refer to the training code of mPLUG-Owl2 and make some revisions to adjust to our model. Some hyper-parameters can refer to our paper.
from mplug-docowl.
Hi, @coder4nlp @whalefa1I @AR-javis We have released training codes for finetuning docowl1.5 in https://github.com/X-PLUG/mPLUG-DocOwl/tree/main/DocOwl1.5. It's temporarily supported by DeepSpeed zero2. We meet deadlock issues with zero3, if you have any suggestions to share with us, we will appreciate very much~
from mplug-docowl.
Hi, @coder4nlp , the training code is scheduled for release at the end of this month. If you are urgent to finetune our model, you can refer to the training code of mPLUG-Owl2 and make some revisions to adjust to our model. Some hyper-parameters can refer to our paper.
almost there!
from mplug-docowl.
Hi, @coder4nlp , the training code is scheduled for release at the end of this month. If you are urgent to finetune our model, you can refer to the training code of mPLUG-Owl2 and make some revisions to adjust to our model. Some hyper-parameters can refer to our paper.
almost there!
training codes with DeepSpeed is under debugging and testing 。゚・ (>﹏<) ・゚。
from mplug-docowl.
Hi, @coder4nlp , the training code is scheduled for release at the end of this month. If you are urgent to finetune our model, you can refer to the training code of mPLUG-Owl2 and make some revisions to adjust to our model. Some hyper-parameters can refer to our paper.
almost there!
training codes with DeepSpeed is under debugging and testing 。゚・ (>﹏<) ・゚。
@HAWLYQ So sad......。゚・ (>﹏<) ・゚。
from mplug-docowl.
@HAWLYQ can you test for deepspeed stage 3 integrations, specifically for deadlock issues while training/fine-tuning?
from mplug-docowl.
where are the schedules?
from mplug-docowl.
where are the schedules?
Within this week~
from mplug-docowl.
@HAWLYQ Thank you very much!
from mplug-docowl.
hello, how about the venv requirements? I've not seen the requirements.txt.
from mplug-docowl.
hello, how about the venv requirements? I've not seen the requirements.txt.
Hi, @Coobiw , our environment is the same as mPLUG-Owl2, you can follow instructions at https://github.com/X-PLUG/mPLUG-Owl/tree/main/mPLUG-Owl2 to prepare environments.
from mplug-docowl.
Related Issues (20)
- Multilingual support HOT 1
- [Bug] Issue with inference HOT 2
- Tiny Chart inference code not available HOT 2
- Try to run sigclip model HOT 2
- M-Paper data HOT 1
- Error occurred while training DocOwl1.5 on my dataset HOT 4
- Does TinyChart inference work on a CPU? HOT 4
- requirements.txt for local running? HOT 1
- Question about OCR test results HOT 2
- DocOwl1.5-chat inference HOT 1
- DocOwl1.5-Omni Training HOT 1
- finetuning mPLUG-DocOwl for documnet data extraction HOT 2
- Finetuning Tinychart HOT 1
- When will opensrouce tinyChart data? HOT 2
- ERROR: safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge
- IndexError: piece id is out of range HOT 1
- requirements file HOT 1
- About dataset when train model HOT 4
- issue while finetuning DocOwl1.5-Omni on dataset HOT 4
- Error with DocOwl1.5/app.py
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mplug-docowl.