Comments (2)
I believe the proper filename is configs/train/finetune.yaml
from gpt4all.
I believe the proper filename is
configs/train/finetune.yaml
Thank you @sbmsr . Would you mind help me with another issue? I tried using the Training Data Without P3 and Full Dataset with P3 in the dataset_path: raw_data_sanity_cleaned/data.jsonl
with model zpn/llama-7b
, but I got TypeError: len() of a 0-d tensor
while preparing data.
Map (num_proc=64): 99%|██████████████████████████████████▊| 761922/765889 [05:46<00:00, 17174.63 examples/s]
multiprocess.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File "/anaconda3/envs/gpt4all/lib/python3.10/site-packages/multiprocess/pool.py", line 125, in worker
result = (True, func(*args, **kwds))
File "/anaconda3/envs/gpt4all/lib/python3.10/site-packages/datasets/utils/py_utils.py", line 1349, in _write_generator_to_queue
for i, result in enumerate(func(**kwargs)):
File "/anaconda3/envs/gpt4all/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3329, in _map_single
batch = apply_function_on_filtered_inputs(
File "/anaconda3/envs/gpt4all/lib/python3.10/site-packages/datasets/arrow_dataset.py", line 3210, in apply_function_on_filtered_inputs
processed_inputs = function(*fn_args, *additional_args, **fn_kwargs)
File "/llm/gpt4all/data.py", line 84, in <lambda>
lambda ele: tokenize_inputs(config, tokenizer, ele),
File "/llm/gpt4all/data.py", line 19, in tokenize_inputs
input_len = len(input_tokens)
File "/anaconda3/envs/gpt4all/lib/python3.10/site-packages/torch/_tensor.py", line 908, in __len__
raise TypeError("len() of a 0-d tensor")
TypeError: len() of a 0-d tensor
"""
Solved in #53
from gpt4all.
Related Issues (20)
- [Feature] indicate the max context size of each model in the download list ?
- [Feature] check the compatibility of a hugging face model before fully downloading it ? HOT 1
- Idk what this is honestly HOT 1
- Python Bindings: Model no longer kept in cache HOT 2
- Reliable crash test in 2.7.5 and 2.8.0pre1 HOT 3
- Python bindings: add possibility to clear history of a chat_session HOT 4
- "availableGPUDevices: built without Kompute" error when installed via pip on macOS M2 HOT 2
- [Feature] Ability to populate previous chat history when using chat_session() HOT 7
- 增加对Intel ARC A770显卡推理支持 HOT 3
- Ver. 2.7.4 nad Ver. 2.8.0 pre not starting gui on Windows HOT 2
- API service response data missing
- Building GPT4all from source - Windows - Qt.dll errors HOT 13
- Is there a WebUI available? HOT 1
- Need `#include <algorithm>` to build `gpt4all-backend/llamamodel.cpp`
- Windows 11. Nothing happens HOT 7
- llama.cpp assertion fails: "non-causal attention requires n_ubatch >= n_tokens" HOT 8
- Is it possible to make the "Stop Generating" button stop everything? HOT 2
- Default model useless/not working HOT 1
- v2.8.0 crashes and disappears when using CUDA (OOM? PTX issue?) HOT 12
- Certain models with "code" in their name crash GPT4All 2.8.0 HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpt4all.