CUDA: Add overloads generated by specialization to the current dispatcher. #9106

gmarkall · 2023-07-28T13:33:59Z

Adding overloads to the current dispatcher when specializing saves recreating the overload for subsequent calls to the unspecialized dispatcher that would have used the overload.

This also has the side effects of making overloads available after a call to ForAll() on a dispatcher.

This provides an alternative to the implementation in #9057, which also aimed to make overloads available after a ForAll() call, but did it by changing the way ForAll() worked and potentially increasing latency for calls on specialized dispatchers (of which there are some use cases in cuDF).

Whilst looking into this I also noticed that all the CUDA dispatcher specialize tests worked in an "unexpected" way and didn't provide a good pattern to follow for this test added in this PR, so they are all fixed up in this PR too.

Dispatcher specialization takes actual arguments, not types. So although these tests did technically test the specialization mechanism, the way in which they were doing it was extremely weird because a kernel would never actually be launched with a type as an argument.

Adding overloads to the current dispatcher when specializing saves recreating the overload for subsequent calls to the unspecialized dispatcher that would have used the overload. This also has the side effects of making overloads available after a call to `ForAll()` on a dispatcher.

gmarkall · 2023-07-28T13:34:18Z

gpuci run tests

gmarkall · 2023-07-28T13:41:58Z

gpuci run tests

Matt711 · 2023-07-31T11:51:42Z

The PR still works with my cuDF example for numba-inspector.

stuartarchibald

Thanks for the patch, couple of minor suggestions RE variable naming else looks good.

stuartarchibald · 2023-09-12T09:22:23Z

numba/cuda/tests/cudapy/test_dispatcher.py

+ f_f32a_f32a = f.specialize(f_arg, f_arg)
+ self.assertEqual(len(f.overloads), 1)
+ self.assertIs(f_f32a_f32a.overloads[f_arg_ty, f_arg_ty],


Suggested change

f_f32a_f32a = f.specialize(f_arg, f_arg)

self.assertEqual(len(f.overloads), 1)

self.assertIs(f_f32a_f32a.overloads[f_arg_ty, f_arg_ty],

f_f32f_f32f = f.specialize(f_arg, f_arg)

self.assertEqual(len(f.overloads), 1)

self.assertIs(f_f32f_f32f.overloads[f_arg_ty, f_arg_ty],

suggest naming scheme consistency with F-ordered input.

stuartarchibald · 2023-09-12T09:25:20Z

numba/cuda/tests/cudapy/test_dispatcher.py

- f_f32a_f32a = f.specialize(float32[:], float32[:])
+ # 'F' order specialization
+ f_arg = np.zeros((2, 2), order='F')
+ f_f32a_f32a = f.specialize(f_arg, f_arg)


Suggested change

f_f32a_f32a = f.specialize(f_arg, f_arg)

f_f32f_f32f = f.specialize(f_arg, f_arg)

suggestion to match ordering F.

github-actions · 2024-03-05T01:44:11Z

This pull request is marked as stale as it has had no activity in the past 3 months. Please respond to this comment if you're still interested in working on this. Many thanks!

gmarkall · 2024-03-05T14:12:31Z

Still planning to finish this when I get a moment.

github-actions · 2024-07-09T01:49:46Z

This pull request is marked as stale as it has had no activity in the past 3 months. Please respond to this comment if you're still interested in working on this. Many thanks!

gmarkall · 2024-07-16T09:54:53Z

Missed this due to being on PTO, going to reopen it to keep it relevant (to get round to one day).

gmarkall added 2 commits July 28, 2023 14:18

gmarkall added CUDA CUDA related issue/PR 2 - In Progress labels Jul 28, 2023

Add changelog for PR numba#9106

86250d6

gmarkall added 3 - Ready for Review and removed 2 - In Progress labels Jul 31, 2023

guilhermeleobas added this to the 0.59.0-rc1 milestone Aug 1, 2023

sklam self-requested a review August 1, 2023 14:31

stuartarchibald reviewed Sep 12, 2023

View reviewed changes

stuartarchibald added 4 - Waiting on author Waiting for author to respond to review and removed 3 - Ready for Review labels Sep 12, 2023

gmarkall modified the milestones: 0.59.0-rc1, 0.60.0-rc1 Dec 5, 2023

github-actions bot added the stale Marker label for stale issues. label Mar 5, 2024

github-actions bot removed the stale Marker label for stale issues. label Mar 6, 2024

gmarkall modified the milestones: 0.60.0-rc1, 0.61.0-rc1 Apr 9, 2024

github-actions bot added the stale Marker label for stale issues. label Jul 9, 2024

github-actions bot added the abandoned - stale PRs automatically closed due to stale status label Jul 16, 2024

github-actions bot closed this Jul 16, 2024

gmarkall reopened this Jul 16, 2024

gmarkall removed the abandoned - stale PRs automatically closed due to stale status label Jul 16, 2024

gmarkall removed the stale Marker label for stale issues. label Jul 16, 2024

gmarkall modified the milestones: 0.61.0-rc1, nvidia-cuda-next Oct 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: Add overloads generated by specialization to the current dispatcher. #9106

CUDA: Add overloads generated by specialization to the current dispatcher. #9106

gmarkall commented Jul 28, 2023

gmarkall commented Jul 28, 2023

gmarkall commented Jul 28, 2023

Matt711 commented Jul 31, 2023

stuartarchibald left a comment

stuartarchibald Sep 12, 2023

stuartarchibald Sep 12, 2023

github-actions bot commented Mar 5, 2024

gmarkall commented Mar 5, 2024

github-actions bot commented Jul 9, 2024

gmarkall commented Jul 16, 2024

	f_f32a_f32a = f.specialize(f_arg, f_arg)
	f_f32f_f32f = f.specialize(f_arg, f_arg)

CUDA: Add overloads generated by specialization to the current dispatcher. #9106

Are you sure you want to change the base?

CUDA: Add overloads generated by specialization to the current dispatcher. #9106

Conversation

gmarkall commented Jul 28, 2023

gmarkall commented Jul 28, 2023

gmarkall commented Jul 28, 2023

Matt711 commented Jul 31, 2023

stuartarchibald left a comment

Choose a reason for hiding this comment

stuartarchibald Sep 12, 2023

Choose a reason for hiding this comment

stuartarchibald Sep 12, 2023

Choose a reason for hiding this comment

github-actions bot commented Mar 5, 2024

gmarkall commented Mar 5, 2024

github-actions bot commented Jul 9, 2024

gmarkall commented Jul 16, 2024