Add optional stream/queue argument for all device math routines #1100

njansson · 2024-01-05T17:49:12Z

This adds the feature of running device math routines in optional streams

timofeymukha · 2024-02-04T16:08:59Z

With this kind of stuff, one really wishes one had CI on the device. I didn't check every function of course, and this refactoring is somehow prone to copy-paste mistakes. Did you run some examples on CUDA and HIP to see that things are still fine?

njansson · 2024-02-04T16:11:43Z

With this kind of stuff, one really wishes one had CI on the device. I didn't check every function of course, and this refactoring is somehow prone to copy-paste mistakes. Did you run some examples on CUDA and HIP to see that things are still fine?

Very soon we will have proper accelerator based CI runners 🦫

Add optional stream/queue argument for all device math routines

5362e4b

njansson added GPU GPU NVIDIA NVIDIA GPUs and CUDA AMD AMD GPUs and HIP OpenCL OpenCL backend refactor labels Jan 5, 2024

timofeymukha approved these changes Feb 4, 2024

View reviewed changes

njansson requested a review from MartinKarp March 20, 2024 08:10

njansson self-assigned this Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optional stream/queue argument for all device math routines #1100

Add optional stream/queue argument for all device math routines #1100

njansson commented Jan 5, 2024

timofeymukha commented Feb 4, 2024

njansson commented Feb 4, 2024

Add optional stream/queue argument for all device math routines #1100

Are you sure you want to change the base?

Add optional stream/queue argument for all device math routines #1100

Conversation

njansson commented Jan 5, 2024

timofeymukha commented Feb 4, 2024

njansson commented Feb 4, 2024