Git Product home page Git Product logo

Comments (9)

Jack-Ye avatar Jack-Ye commented on May 23, 2024 4

"12GiB"改成"8GiB" 可以在4070ti 12GB的显卡上跑起来, 就是回答需要5分钟

from moss.

stevezhang88 avatar stevezhang88 commented on May 23, 2024 1

我使用load in 8 bit, 成功加载模型。运行速度也很快。比你这个方法的速度要快。基本上秒出。 我是3090, 24G , 单卡单机。

from moss.

stevezhang88 avatar stevezhang88 commented on May 23, 2024

人才啊。 GPU不够的地方用CPU来补充吗?

from moss.

lwh9346 avatar lwh9346 commented on May 23, 2024

我使用load in 8 bit, 成功加载模型。运行速度也很快。比你这个方法的速度要快。基本上秒出。 我是3090, 24G , 单卡单机。

不知道load_checkpoint_and_dispatchload_in_8bit能不能一起用?如果可以的话就可以在更低显存的设备上运行,在中等显存的的机器上避免内存带宽限制导致的性能下降了。

from moss.

licongguan avatar licongguan commented on May 23, 2024

我使用load in 8 bit, 成功加载模型。运行速度也很快。比你这个方法的速度要快。基本上秒出。 我是3090, 24G , 单卡单机。

请问如何修改代码?

from moss.

stevezhang88 avatar stevezhang88 commented on May 23, 2024

我使用load in 8 bit, 成功加载模型。运行速度也很快。比你这个方法的速度要快。基本上秒出。 我是3090, 24G , 单卡单机。

请问如何修改代码?

#38

from moss.

wktdwktd avatar wktdwktd commented on May 23, 2024

我买的阿里云gpu服务器,30GiB显存,回答都很慢 十几秒,你们怎么忍受的?
max_memory={0: "30GiB", "cpu": "60GiB"}

from moss.

PangXitong avatar PangXitong commented on May 23, 2024

请问您用的是windows系统吗,您能否将您更改后的moss_cli_demo.py发送过来,谢谢!

from moss.

wanglaiqi avatar wanglaiqi commented on May 23, 2024

尝试使用load_in_8bit 加载 int4的模型,在NVIDIA GeForce RTX 3090 24G一块卡上运行很慢,生成一篇600字的文章要4minute

from moss.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.