Comments (1)
Thanks for your issue. This issue will be fixed by this PR.
from ipex-llm.
Related Issues (20)
- [MTL][Internvl2-4B] GPU OOM for 3k input tokens HOT 1
- Please provide a method to benchmark Multimodal InternVL-4B on MTL‘s iGPU HOT 6
- Ollama already occupying port before running ./ollama serve HOT 4
- minicpm-v-2-6 can't run on A770 Ubuntu HOT 4
- MiniCPM-V-2.6 load_low_bit fails HOT 4
- GPU memory continue increase when in Deepspeed TP benchmark HOT 3
- ollama runs gemma:2b, asks a question, does not answer, and reports no error. HOT 2
- failure load the Qwen2-72B-Instruct with FP6 on 4 ARC GPU HOT 1
- Failure to load the LLM model in vLLM on 8 ARC HOT 4
- New model support request
- Result is wrong when running Qwen2-1.5B-Instruct on Intel NPU HOT 3
- `Qwen/Qwen2-7B-Instruct` gives garbled outputs in LongBench with `load_in_low_bit="fp16"` and `optimize_model=False`
- MiniCPM-V-2_6 load_low_bit mode.chat fails on MTL iGPU HOT 1
- Issue running throughput test with vllm on 4 Arc A770: "Current platform can NOT allocate memory block with size larger than 4GB! " HOT 2
- Inference produced repetitive and erroneous output by a fintuned qwen2 model HOT 4
- Running benchmark/all-in-one with GLM-4-9B-Chat model report "AutoTP not support for models" HOT 1
- [Qwen2-Audio-7B-Instruct] model support HOT 1
- Error: 'Not enough data for satisfy transfer length header.' during vllm serving on Arc HOT 2
- Qwen1.5-14B-Chat 支持问题,返回空 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ipex-llm.