Comments (7)
This fixes the issue for me with GGUFs:
https://www.reddit.com/r/LocalLLaMA/comments/1c7dkxh/tutorial_how_to_make_llama3instruct_ggufs_less/
Problem: Llama-3 uses 2 different stop tokens, but llama.cpp only has support for one. The instruct models seem to always generate a <|eot_id|> but the GGUF uses <|end_of_text|>.
Solution: Edit the GGUF file so it uses the correct stop token.
./gguf-py/scripts/gguf-set-metadata.py /path/to/llama-3.gguf tokenizer.ggml.eos_token_id 128009
from gpt4all.
fixed instruct model link : https://huggingface.co/bartowski/Meta-Llama-3-8B-Instruct-GGUF
I tested above model and 100% working.
from gpt4all.
No, I think then you need to change the max number of tokens you want, or manually press stop
from gpt4all.
for me it works, but there is an issue: after the first answer the end of the answer does not seem to be detected, CPU stays at 100%...
Then a second question will not be answered. If you stop the model answer before the end it works.
from gpt4all.
Yes very much hoping for Llama3 in GPT4all!
from gpt4all.
Thank you, very helpful! I'm on an M1 Macbook Air with 16GB Ram.
I downloaded this model https://huggingface.co/lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF/blob/main/Meta-Llama-3-8B-Instruct-Q5_K_M.gguf
and put it in ~/Library/Application\ Support/nomic.ai/GPT4All/
directory and it's working great at about 4.1 - 4.4 tokens per second with my RAM full and about 7.5GB swap.
from gpt4all.
Maybe I spoke too soon, in another test it kept talking and didn't stop until I told it to. So maybe I need to mess with the correct stop token too.
from gpt4all.
Related Issues (20)
- Default model useless/not working HOT 1
- v2.8.0 crashes and disappears when using CUDA (incompatible PTX) HOT 14
- Certain models with "code" in their name crash GPT4All 2.8.0 HOT 6
- Some questions about java calling gpt4all HOT 2
- python binding does not use explicitly requested NVIDIA GPU
- Cannot move window to see which models to choose to install HOT 3
- Update 2.8.0 error HOT 3
- Crash on long prompts (CPU) v2.8.0 HOT 2
- [Feature] Add support for local Nomic Text Embed models compatible models for local docs HOT 1
- Gibberish Response when using GPU (Quadro K6000) HOT 1
- How to mimic the GPT4ALL GUI output using Python library gpt4all() and embed4all() HOT 2
- Gpt4all shows only a gray screen HOT 1
- [Feature] Use CUDA device as default HOT 1
- [Feature] Button to save chat individually manually
- Download of models stuck HOT 1
- Unable to set the max context limit
- dolphin-2.9.2-qwen2-72b-gguf, Cannot run, cannot use, error HOT 1
- Screen Turns Black Intermittently When Opening GPT4all on Desktop HOT 1
- [Feature] flathub verified HOT 1
- Crashes when indexing RAG
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpt4all.