Git Product home page Git Product logo

Comments (18)

williamchai avatar williamchai commented on August 16, 2024 1

did the update solve the problem?

Yes! Tested in 0.5.2 it doesn't crash anymore! Thank you!

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

iPhone 13 Pro iOS 16.3.1 Orca mini 3B downloaded from readme link

Tried different params, use metal or not, always crash. Creating a new chat for each input works, but 2nd input in same chat will crash.

Thanks for the report.
Could you clarify?
Do you have the same issue with other models?
Does the error occur immediately during the sending of the message or with a delay?

from llmfarm.

williamchai avatar williamchai commented on August 16, 2024

iPhone 13 Pro iOS 16.3.1 Orca mini 3B downloaded from readme link
Tried different params, use metal or not, always crash. Creating a new chat for each input works, but 2nd input in same chat will crash.

Thanks for the report. Could you clarify? Do you have the same issue with other models? Does the error occur immediately during the sending of the message or with a delay?

I also tried vicuna-7b-q4 but it's super slow not usable (for 1st reply).
The error occur immediately after I input and click send icon

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

iPhone 13 Pro iOS 16.3.1 Orca mini 3B downloaded from readme link
Tried different params, use metal or not, always crash. Creating a new chat for each input works, but 2nd input in same chat will crash.

Thanks for the report. Could you clarify? Do you have the same issue with other models? Does the error occur immediately during the sending of the message or with a delay?

I also tried vicuna-7b-q4 but it's super slow not usable (for 1st reply). The error occur immediately after I input and click send icon

I think I found what the problem is. If you want, you can write to me at [email protected], I will invite you to early testing in TestFlight.

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

I also tried vicuna-7b-q4 but it's super slow not usable (for 1st reply). The error occur immediately after I input and click send icon

q4 too heavy for iphone
I test llama-2-chat-qK_3_M with metal, on my iPhone 12 pro max work fine

from llmfarm.

williamchai avatar williamchai commented on August 16, 2024

I think I found what the problem is. If you want, you can write to me at [email protected], I will invite you to early testing in TestFlight.

Just sent you an email, thanks!

from llmfarm.

tstanek390 avatar tstanek390 commented on August 16, 2024

Hi there, I'm experiencing same kind of issue with LLM Farm App. Tried several LLama2 models, including the original one. Im on Macbook Pro M2 16 GB, OS Ventura, using llama.cpp q4_0 quantization. Im able to load a model, and tweak the settings, but in the moment I send first prompt in the chat, app crashes with no additional info, I reported the crash using apple interface. Thx for any help:)

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

Hi there, I'm experiencing same kind of issue with LLM Farm App. Tried several LLama2 models, including the original one. Im on Macbook Pro M2 16 GB, OS Ventura, using llama.cpp q4_0 quantization. Im able to load a model, and tweak the settings, but in the moment I send first prompt in the chat, app crashes with no additional info, I reported the crash using apple interface. Thx for any help:)

Could you send a link to the model?

from llmfarm.

tstanek390 avatar tstanek390 commented on August 16, 2024

Ofc, https://huggingface.co/llSourcell/medllama2_7b.

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

Ofc, https://huggingface.co/llSourcell/medllama2_7b.

Did you quantize the model yourself? What version of llama.cpp did you use for quantization?

from llmfarm.

tstanek390 avatar tstanek390 commented on August 16, 2024

Yes, I quantized it myself a couple of hours ago, using the latest version of llama.cpp from master brach of its GitHub repository, and "make" command after that.

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

Yes, I quantized it myself a couple of hours ago, using the latest version of llama.cpp from master brach of its GitHub repository, and "make" command after that.

The latest version of llama.cpp has changed the format to gguf. I will add support for it soon, but in the meantime, could you requantize the model with this version? https://github.com/ggerganov/llama.cpp/tree/dadbed99e65252d79f81101a392d0d6497b86caa

from llmfarm.

tstanek390 avatar tstanek390 commented on August 16, 2024

I've tried requantize with the version you mentioned, but same issue :/ i guess the problem is in sth different

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

I've tried requantize with the version you mentioned, but same issue :/ i guess the problem is in sth different

There is definitely another problem, I'm trying to understand what it is. But for the current version of llmfarm, you need to use the old ggjtv3 quantization.
Have you tried running models from this list? https://github.com/guinmoon/LLMFarm/blob/main/models.md

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

Yes, I quantized it myself a couple of hours ago, using the latest version of llama.cpp from master brach of its GitHub repository, and "make" command after that.

If you have an Intel Mac then there is a problem with metal. Try turning it off.

from llmfarm.

tstanek390 avatar tstanek390 commented on August 16, 2024

I have m2 silicone Mac, but haven't used your models yet. I will give it a try !
P.S I wanted to avoid the bloke's models cause its large size and I primarily aim for an iPhone app, not Mac OS.

EDIT Still the same issue with the provided model. Any other suggestions?

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

I have m2 silicone Mac, but haven't used your models yet. I will give it a try ! P.S I wanted to avoid the bloke's models cause its large size and I primarily aim for an iPhone app, not Mac OS.

EDIT Still the same issue with the provided model. Any other suggestions?

Unfortunately, I could not repeat this error on my device, so I can only guess.

from llmfarm.

guinmoon avatar guinmoon commented on August 16, 2024

iPhone 13 Pro iOS 16.3.1 Orca mini 3B downloaded from readme link

Tried different params, use metal or not, always crash. Creating a new chat for each input works, but 2nd input in same chat will crash.

I think I found what the problem was. Tell me, did the update solve the problem?

from llmfarm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.