Comments (3)
when i ask a qestions it is soo slow it is taking forever to write one sentence how can i make it faster btw am using vicuna 7B to make it light wight for me and am using mac OS m2 chip and that doesnt even help :( so can i host the gpt-llama.cpp on render if so yes when i run
sh ./scripts/test-installation.sh
what should i put for the port and the locations of the file since am using render to render the model to make it faster ?
fallow up: if i use render for example and i run on my pc or somewhere else sh ./scripts/test-installation.sh
and it ask me the port am running since render uses URL base how am i gonna get this to work web-base or host the backend/model and where to host it
from gpt-llama.cpp.
try using mlock, that had historically helped me when i've had memory issues
from gpt-llama.cpp.
Also sometimes lowering the thread count helps, because it oversaturates, or perhaps uses a slower worker thread.
from gpt-llama.cpp.
Related Issues (20)
- TypeError: Window.fetch: HEAD or GET Request cannot have a body. HOT 1
- npm error on gpt-llama.cpp HOT 4
- llama.cpp GPU support HOT 1
- Are there different specific instructions for running Red Pajama?
- no response message with Readable Stream: CLOSED HOT 2
- Error: spawn ..\llama.cpp\main ENOENT at ChildProcess._handle.onexit HOT 1
- SERVER BUSY, REQUEST QUEUED
- Cannot POST /V1/embeddings HOT 1
- Bearer Token vs Model parameter?
- Why is a default chat being forced?
- Every Other Chat Response HOT 1
- Finding last messages?
- "Internal Server Error" on a remote server
- Change listening ip to public ip? HOT 1
- gguf supported? HOT 1
- llama.cpp unresponsive for 20 seconds HOT 3
- Module not found: Package path ./lite/tiktoken_bg.wasm?module is not exported from package HOT 1
- node:events:491 throw er; // Unhandled 'error' event Error: spawn YOUR_KEY=../llama.cpp/main ENOENT
- How to create a single binary
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpt-llama.cpp.