Git Product home page Git Product logo

Comments (4)

ys-2020 avatar ys-2020 commented on July 2, 2024

Hi @Tracin! Thanks a lot for bringing the issue to our attention. We have fixed the problem in #8 , please have a look! Once again, thank you for your interest in AWQ.

from llm-awq.

Tracin avatar Tracin commented on July 2, 2024

Hi @ys-2020! Very appreciated for your MR for this. It is very impressive both the AWQ method and this fast kernel.
However I am afraid there is still something wrong in it. When I continue doubling the M up to 2048 * 32, I met a CUDA error: an illegal memory access was encountered.
I believe you understand it is 32 batchsize and 2k sentence length which is quite often used for LLM inference.

from llm-awq.

ys-2020 avatar ys-2020 commented on July 2, 2024

Hi @Tracin , we have reproduced the illegal memory access issue and are trying to fix it. Please stay tuned for our updates. Thanks!

from llm-awq.

jamesdborin avatar jamesdborin commented on July 2, 2024

I got this error today as well for long generation. Do you know what the issue is, and if there is a patch / fix we can do locally?

Thanks!

from llm-awq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.