Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FFT using pyvkfft and use loopy callables #114

Merged
merged 67 commits into from
Aug 1, 2022
Merged

FFT using pyvkfft and use loopy callables #114

merged 67 commits into from
Aug 1, 2022

Conversation

isuruf
Copy link
Collaborator

@isuruf isuruf commented May 2, 2022

@isuruf isuruf marked this pull request as draft May 2, 2022 03:43
@isuruf
Copy link
Collaborator Author

isuruf commented May 25, 2022

@inducer, this is ready for another look

sumpy/fmm.py Outdated
Comment on lines 809 to 811
evt_fft = self.run_opencl_fft(queue,
preprocessed_source_mpoles_view,
inverse=False, wait_for=wait_for)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Convert to out-of-place to be nice to our future, lazy-eval selves?

sumpy/fmm.py Outdated
Comment on lines 809 to 811
evt_fft = self.run_opencl_fft(queue,
preprocessed_source_mpoles_view,
inverse=False, wait_for=wait_for)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
evt_fft = self.run_opencl_fft(queue,
preprocessed_source_mpoles_view,
inverse=False, wait_for=wait_for)
preprocessed_source_mpoles_view, evt_fft = self.run_opencl_fft(queue,
preprocessed_source_mpoles_view,
inverse=False, wait_for=wait_for)

sumpy/fmm.py Outdated
Comment on lines 546 to 547
def get_opencl_fft_app(self, queue, shape, dtype):
return get_opencl_fft_app(queue, shape, dtype)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
def get_opencl_fft_app(self, queue, shape, dtype):
return get_opencl_fft_app(queue, shape, dtype)
def get_opencl_fft_app(self, shape, dtype):
with cl.CommandQueue(self.cl_context) as queue:
return get_opencl_fft_app(queue, shape, dtype)

sumpy/fmm.py Outdated Show resolved Hide resolved
@isuruf
Copy link
Collaborator Author

isuruf commented Jun 22, 2022

Using the private API from pyvkfft, this passes.

@inducer
Copy link
Owner

inducer commented Jun 23, 2022

Ready for review from your perspective? Or should we wait?

@isuruf
Copy link
Collaborator Author

isuruf commented Jun 23, 2022

Ready for review

sumpy/tools.py Outdated Show resolved Hide resolved
sumpy/fmm.py Outdated
Comment on lines 556 to 558
def run_opencl_fft(self, queue, input_vec, inverse, wait_for, inplace):
app = self.tree_indep.opencl_fft_app(input_vec.shape, input_vec.dtype,
inplace)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ditch inplace?

Copy link
Owner

@inducer inducer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🎉 LGTM (after inplace is gone)

@inducer inducer merged commit 9849af5 into inducer:main Aug 1, 2022
@inducer
Copy link
Owner

inducer commented Aug 1, 2022

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants