Hi, as far as I can see, there is no support for real-valued input a

Hello <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-ur

Support for NFFTs of real valued Arrays about torchkbnufft HOT 6 OPEN

mmuckley commented on May 18, 2024

Support for NFFTs of real valued Arrays

from torchkbnufft.

Comments (6)

roflmaostc commented on May 18, 2024 1

What is the estimated overhead in your case?
Is that interpolator that slow? I mean, normal cubic interpolation is much faster hence I wonder what's so different in this case.
Is that better in different libraries?

I observed the following.

Does that match with your experiences? So NUFFT along the last two dims, and a leading batch dim.

voxels = torch.zeros(100,1, 768,768, device="cuda")+ 0j
# 300 angles
nufft, adjnufft, ktraj = prepare_nufft(voxels, 300, numpoints=3, gridfactor=1.25, device="cuda")

%%timeit
torch.fft.fft2(voxels +0j)
torch.cuda.synchronize()
# 9ms

%%timeit
nufft(voxels + 0j, ktraj)
torch.cuda.synchronize()
# 61ms

In that case, a RFFT really would only save ~4 to 5ms. (maybe a little more because of 1.25 padding)

from torchkbnufft.

mmuckley commented on May 18, 2024

Hello @roflmaostc, it wasn't a priority for me since MRI data is always complex. Today I'm unable to implement it due to time constraints, but I could possibly consider a PR if someone else did it.

from torchkbnufft.

roflmaostc commented on May 18, 2024

I can try to help since I would love that feature! Maybe specified with another parameter?
Doing that implicitly might be dangerous. fftmethod="fft" or fftmethoid="rfft".

What is requried?

Would that be here?

torchkbnufft/torchkbnufft/_nufft/fft.py

Line 9 in 20c45a2

def fft_fn(image: Tensor, ndim: int, normalized: bool = False) -> Tensor:

There is probably more steps to do with the interpolation?

from torchkbnufft.

mmuckley commented on May 18, 2024

I think the interpolator could be a little bit involved. First you'd want to check that it has no imaginary values (I don't remember). Then there might be a question of reflecting the results to the Hermitian-symmetric part of Fourier space so that the k-nearest neighbors algorithm works correctly.

After doing these things I'm actually guessing there would be no speedup, because the interpolator is the bottleneck anyway. Maybe a deeper look into the NUFFT algorithm to consider real inputs would be necessary to get an efficient implementation. I don't think it's as simple as changing the forward FFT op.

from torchkbnufft.

mmuckley commented on May 18, 2024

85% overhead seems a little high but could be possible for some trajectories. It's actually trajectory-dependent. If you only have one radial spoke then it's possible the FFT takes longer.

from torchkbnufft.

roflmaostc commented on May 18, 2024

I actually used 300 radial spokes of length 768 each.

And a batch size of 100 maybe, in max.

What should be the shape of ktraj in this case?
On my case it was (2, 768*300).

Tbh, in the documentation I was confused about coils, etc.

Should I rearrange the spokes along another dim for more parallelization?

from torchkbnufft.

Support for NFFTs of real valued Arrays about torchkbnufft HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent