Hi, I was running evaluation on MMLU and GSM8K datasets using the fo

Out-Of-Memory Error for same batch size but different dataset,about eleutherai/lm-evaluation-harness

Comments (7)

FangHainannn commented on June 2, 2024

I have the same problem, does anyone know how to solve it? 🌹

from lm-evaluation-harness.

baberabb commented on June 2, 2024

Batching is handled internally by vllm so it shouldn't matter much. Have you tried reducing the gpu_memory_utilization? If you aren't using data parallel, then you could also try using the latest release of vllm if still on 0.3.2.

from lm-evaluation-harness.

richardzhuang0412 commented on June 2, 2024

Could you explain how data_parallel works? I don't seem to see it in vLLM's documentation.

from lm-evaluation-harness.

baberabb commented on June 2, 2024

Could you explain how data_parallel works? I don't seem to see it in vLLM's documentation.

We create multiple vllm models across different devices so each can process a subset of the data concurrently. But we need to update it for vllm > 0.3.2. Latest vllm versions should work without it though.

from lm-evaluation-harness.

richardzhuang0412 commented on June 2, 2024

Thanks for the clarification. And also why do we want to lower the gpu_memory_utilization? I thought if we make it close to 1 it won't reserve any memory which means it uses more memory to do the inference. Is it not true?

from lm-evaluation-harness.

baberabb commented on June 2, 2024

Thanks for the clarification. And also why do we want to lower the gpu_memory_utilization? I thought if we make it close to 1 it won't reserve any memory which means it uses more memory to do the inference. Is it not true?

I think it's the opposite 😅. Setting it lower helps with OOM most of the time.

from lm-evaluation-harness.

richardzhuang0412 commented on June 2, 2024

My bad you are absolutely right 🧎‍♀️. Let me try these new settings and see if that helps. Thanks a lot!

from lm-evaluation-harness.

Recommend Projects

Out-Of-Memory Error for same batch size but different dataset about lm-evaluation-harness HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent