Comments (4)
Hi, @tjtanaa I wonder how the roadmap is going on. I quite excited to use AWQ quantized format, when can it be supported?
@HAN-oQo Hi, vLLM authors said they are working on more efficient AWQ implementation on triton. So, we will address the AWQ on ROCm after they have released their new kernel.
from vllm-rocm.
Thank you for answer! @tjtanaa I also wonder why safetensor format is not supported, and do you have a plan to support it!
Thank you for offering the nice project.
@HAN-oQo The loading of safetensors is buggy on ROCm platform. The memory management during loading of safetensors might be causing the issue on ROCm platform. It often encounters this issue when tensor-parallelism is larger than 1; however, loading from pt
is totally fine.
from vllm-rocm.
Hi, @tjtanaa
I wonder how the roadmap is going on.
I quite excited to use AWQ quantized format, when can it be supported?
from vllm-rocm.
Thank you for answer! @tjtanaa
I also wonder why safetensor format is not supported, and do you have a plan to support it!
Thank you for offering the nice project.
from vllm-rocm.
Related Issues (13)
- Conflicts version of PyTorch on ROCm
- ImportError: cannot import name 'cuda_utils' from partially initialized module 'vllm' HOT 2
- AssertionError assert output == other_output HOT 3
- benchmark-latncy test bug??? HOT 2
- Merging with vLLM main branch HOT 2
- Compatible GPU architectures HOT 3
- Model architectures ['MixtralForCausalLM'] are not supported for now
- Unable to load models on RX 6800 HOT 2
- vllm 0.1.4 with ROCm 5.6
- vLLM >= 0.2.4 with ROCm 5.6.1
- [Installation]: Is Branch v0.4.0.post1-rocm available for rocm-5.7? HOT 1
- [Feature]: vllm 0.4.1 in ROCM
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vllm-rocm.