Decrease 131072 by 131071</

I tacked on a quick fix with <a class="issue-link js-issue-link" data-error-text="Fail

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

[BUG] Matmul gives wrong output for large sizes about mlx HOT 4 CLOSED

awni commented on July 28, 2024

[BUG] Matmul gives wrong output for large sizes

from mlx.

Comments (4)

jagrit06 commented on July 28, 2024 1

I tacked on a quick fix with #1058

from mlx.

awni commented on July 28, 2024

@jagrit06 this seems that we are overflowing an integer index into the output as it starts to break in the 2B range. INT_MAX is on the small side for the largest output we can support though.

Anything we can do to support larger sizes?

If not, we should put some throws in the ops as these are sneaky to debug.

from mlx.

jagrit06 commented on July 28, 2024

This particular case is simple since what happens is when we try to compute auto batch_size_out = out.size() / (M *N);, the int M and N multiple to overflow and then the batch_size_out comes out to 0
The simple fix here is do that in size_t and I can make a couple other changes to make sure we can handle the large shapes

The only things I'm wondering about is if batch_size_out >= UINT32_MAX, then we will need to launch multiple matmul kernels since the grid dims can only be uint

from mlx.

awni commented on July 28, 2024

The only things I'm wondering about is if batch_size_out >= UINT32_MAX, then we will need to launch multiple matmul kernels since the grid dims can only be uint

That seems like a much more rare case.

from mlx.

[BUG] Matmul gives wrong output for large sizes about mlx HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent