Comments (5)
FYI: q2_k is not working in my tests, I got
bad_alloc
crash orunexpectedly reached end of file
(enablingExtended Virtual Addressing
). q4_0 works fine.I also trying integrate llama.cpp to my iOS apps (latest commit), the problem is looks same with here.
You may have compiled llmfarm_core with -DGGML_QKK_64 in Package.swift. You need to rebuild the project without this flag, or requantize the models for QKK_64.
from llmfarm.
if someone makes a ggml version
from llmfarm.
This models works fine.
from llmfarm.
FYI: q2_k is not working in my tests, I got bad_alloc
crash or unexpectedly reached end of file
(enabling Extended Virtual Addressing
). q4_0 works fine.
I also trying integrate llama.cpp to my iOS apps (latest commit), the problem is looks same with here.
from llmfarm.
FYI: q2_k is not working in my tests, I got
bad_alloc
crash orunexpectedly reached end of file
(enablingExtended Virtual Addressing
). q4_0 works fine.I also trying integrate llama.cpp to my iOS apps (latest commit), the problem is looks same with here.
In the current version of llmfarm, I use this commit. q2_k works fine on iphone(without metal) and Mac.
from llmfarm.
Related Issues (20)
- How to delete downloaded models on phone to free up disk space? HOT 1
- Missing required module 'llmfarm_core_cpp' HOT 2
- Feature Suggestion: CoreML/Neural engine HOT 1
- support MiniCPM
- Support ChatML template? HOT 7
- Please add function calling! HOT 3
- (Question) How to set up MOE?
- Spews complete nonsense after any prompt HOT 1
- Add in-app option support for flash attention HOT 1
- Aya-23-8B gibberish if metal AND mmap turned on HOT 4
- Can't open file 'convert.py' HOT 1
- Add flash attention 2 support
- Improve AI reponse handling
- C_limit HOT 2
- Could not run models HOT 1
- Ability to download models within shortcuts HOT 1
- Gemma-2-2b-it crashing HOT 3
- RAG support
- Responses are too long (C_LIMIT getting ignore)
- add support for MiniCPM V2.6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llmfarm.