-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FFT using pyvkfft and use loopy callables #114
Conversation
@inducer, this is ready for another look |
sumpy/fmm.py
Outdated
evt_fft = self.run_opencl_fft(queue, | ||
preprocessed_source_mpoles_view, | ||
inverse=False, wait_for=wait_for) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Convert to out-of-place to be nice to our future, lazy-eval selves?
sumpy/fmm.py
Outdated
evt_fft = self.run_opencl_fft(queue, | ||
preprocessed_source_mpoles_view, | ||
inverse=False, wait_for=wait_for) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
evt_fft = self.run_opencl_fft(queue, | |
preprocessed_source_mpoles_view, | |
inverse=False, wait_for=wait_for) | |
preprocessed_source_mpoles_view, evt_fft = self.run_opencl_fft(queue, | |
preprocessed_source_mpoles_view, | |
inverse=False, wait_for=wait_for) |
sumpy/fmm.py
Outdated
def get_opencl_fft_app(self, queue, shape, dtype): | ||
return get_opencl_fft_app(queue, shape, dtype) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
def get_opencl_fft_app(self, queue, shape, dtype): | |
return get_opencl_fft_app(queue, shape, dtype) | |
def get_opencl_fft_app(self, shape, dtype): | |
with cl.CommandQueue(self.cl_context) as queue: | |
return get_opencl_fft_app(queue, shape, dtype) |
Using the private API from pyvkfft, this passes. |
Ready for review from your perspective? Or should we wait? |
Ready for review |
sumpy/fmm.py
Outdated
def run_opencl_fft(self, queue, input_vec, inverse, wait_for, inplace): | ||
app = self.tree_indep.opencl_fft_app(input_vec.shape, input_vec.dtype, | ||
inplace) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ditch inplace
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🎉 LGTM (after inplace
is gone)
Thanks! |
- [ ] Needs Allow separate queue for fft and ifft vincefn/pyvkfft#17 and patch the conda package