Comments (4)
I made a repo to comment out BOFT for now - https://github.com/danielhanchen/peft
And repro which worked after comment it out: https://colab.research.google.com/drive/1Y_MdJnS73hIlR_t2DXgXCgqKVwXHPE82?usp=sharing
from peft.
I manually added the below to every line and tried isolating the problem:
def install_llama_cpp_blocking(use_cuda = True):
import subprocess
import os
import psutil
# https://github.com/ggerganov/llama.cpp/issues/7062
# Weirdly GPU conversion for GGUF breaks??
# use_cuda = "LLAMA_CUDA=1" if use_cuda else ""
commands = [
"git clone --recursive https://github.com/ggerganov/llama.cpp",
"make clean -C llama.cpp",
# https://github.com/ggerganov/llama.cpp/issues/7062
# Weirdly GPU conversion for GGUF breaks??
# f"{use_cuda} make all -j{psutil.cpu_count()*2} -C llama.cpp",
f"make all -j{psutil.cpu_count()*2} -C llama.cpp",
"pip install gguf protobuf",
]
# if os.path.exists("llama.cpp"): return
for command in commands:
with subprocess.Popen(command, shell = True, stdout = subprocess.PIPE, stderr = subprocess.STDOUT, bufsize = 1) as sp:
for line in sp.stdout:
line = line.decode("utf-8", errors = "replace")
if "undefined reference" in line:
raise RuntimeError("Failed compiling llama.cpp")
# print(line, flush = True, end = "")
pass
pass
pass
from peft.
Running this Python script reproduces the error on my machine:
import os
import subprocess
from peft import PeftModelForCausalLM
os.chdir("/tmp/")
commands = [
"git clone --recursive https://github.com/ggerganov/llama.cpp",
"make clean -C llama.cpp",
"make all -j4 -C llama.cpp",
"echo $?",
]
for command in commands:
with subprocess.Popen(command, shell=True, stdout=subprocess.PIPE, stderr=subprocess.STDOUT, bufsize=1) as sp:
for line in sp.stdout:
line = line.decode("utf-8", errors = "replace")
print(line, end = "")
if "undefined reference" in line:
raise RuntimeError("Failed compiling llama.cpp")
print(f"-------------- finished: {command} --------------")
print("done")
from peft.
Commenting out these lines seems to fix it for me:
peft/src/peft/tuners/boft/layer.py
Lines 34 to 35 in ae1ae20
from peft.
Related Issues (20)
- TypeError: unsupported operand type(s) for *: 'Parameter' and 'NoneType' HOT 1
- Add support for OpenELM LoRA fine-tuning HOT 2
- Initialization for LoRA weights A and B initialized HOT 1
- Trainer.train() giving me Key Error: [random number] HOT 3
- Delete certain layers from PEFT model. HOT 6
- DoRA training in distributed setting
- Reproducibility when using a model with batch norm
- FSDP Dora/QDora Broken HOT 4
- Adapter Merge for Idefics2 HOT 2
- `AdaLoRA` is incompatible with `gradient checkpointing` when using `torchrun` HOT 2
- LoRA adaptation shape mismatch HOT 7
- cannot import name 'get_peft_config' from 'peft' (unknown location) HOT 1
- how to fine tune LoRA HQQ? HOT 1
- How to finetune embeddings and LM head as a single layer when they are tied? HOT 1
- Help with : LoRA issue in distributed setting HOT 2
- ImportError with pkg_resources and packaging in PEFT when using setuptools >= 70.0.0 HOT 2
- High loss when init with `AdaLora` HOT 3
- LoRA Adapter from local model are leading to error HOT 1
- AdaLora: rank remains constant (to init_r value) across training HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from peft.