Add generic vector/matrix operations #1376

vbaconnet · 2024-07-18T10:35:35Z

Overloading some operators for the vector_t and matrix_t classes, and add some additional math routines like cadd2, device_cadd2 (a(i) = b(i) + c) and device_add3 (a(i) = b(i) + c(i)).

This is to facilitate the usage of vector_t and matrix_t especially with GPUs. This should not break anything since it's just adding to what we have, only thing I changed is the intent on some arguments in sub3 and add3, @njansson will this be a problem?

Tested on cpus, nvidia, and amd gpus. Not tested with OpenCL.

Contrary to the assignment operator where if we do v = w and v%x is already allocated, we free v and re-initialize it to have the same size as w, all the other operations assume that if one does v = a + b and v is already allocated then v should have the same size as b and a. So there is no implicit reallocation except for the assignment operator.

…ctor_ops

… feature/vector_ops

…ctor_ops

vbaconnet · 2024-07-19T08:41:23Z

Not sure I understand what is wrong with the checks, if someone could take a look that would be much appreciated :)

src/math/matrix.f90

src/math/vector.f90

tests/vector/vector_parallel.pf

njansson · 2024-08-22T19:38:59Z

I would say we should merge this, and I can take upon me to fix the generic interface (+ ensure correct inlining in key kernels)

vbaconnet added 20 commits July 17, 2024 16:17

edit imports

3501641

change intent

4029c1f

edit depends

06a5552

do the same for matrix

be454f4

doc

e523f0f

add device kernels for add3

8f915a5

fix bug

725e470

small changes

51d4702

fix bug

3ee0b7d

Merge branch 'develop' of github.com:ExtremeFLOW/neko into feature/ve…

54d5f3a

…ctor_ops

add cadd2

6eb5105

Merge branch 'feature/vector_ops' of github.com:ExtremeFLOW/neko into…

a0ac6ee

… feature/vector_ops

add device cadd2

d3acb92

Reformat

d1af9dd

Add doc to device math

c2800f4

fix bug

f9b3ff5

fix deps

1d3919f

rewrite to not reallocate if already allocated

0529058

add cuda wrapper

0e500c6

fix doc

8a3b61f

vbaconnet added enhancement New feature or request GPU GPU NVIDIA NVIDIA GPUs and CUDA AMD AMD GPUs and HIP OpenCL OpenCL backend labels Jul 18, 2024

vbaconnet added 5 commits July 18, 2024 12:40

check ncols and nrows in matrix

3190f5e

linter

52f2a8b

linter

53d66cd

reallocate for assignement

8dc9876

revert to matrix realloc in assign

87c8201

vbaconnet added 5 commits July 18, 2024 14:06

add unit tests for matrix

b239814

fix unit tests

02f76dd

fix unit test

0bf697c

add matrix_test

493e637

Merge branch 'develop' of github.com:ExtremeFLOW/neko into feature/ve…

72db1d8

…ctor_ops

vbaconnet requested review from timfelle and MartinKarp July 19, 2024 13:26

timofeymukha reviewed Jul 21, 2024

View reviewed changes

src/math/matrix.f90 Outdated Show resolved Hide resolved

src/math/vector.f90 Outdated Show resolved Hide resolved

src/math/vector.f90 Outdated Show resolved Hide resolved

tests/vector/vector_parallel.pf Outdated Show resolved Hide resolved

vbaconnet added 2 commits July 21, 2024 21:45

fix tests and remove logic for allocation

deb0037

use intrinsic instead of copy

eeea319

njansson approved these changes Aug 22, 2024

View reviewed changes

njansson enabled auto-merge August 22, 2024 19:40

timfelle approved these changes Aug 23, 2024

View reviewed changes

njansson merged commit 768f423 into develop Aug 23, 2024
27 checks passed

njansson deleted the feature/vector_ops branch August 23, 2024 11:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add generic vector/matrix operations #1376

Add generic vector/matrix operations #1376

vbaconnet commented Jul 18, 2024 •

edited

Loading

vbaconnet commented Jul 19, 2024

njansson commented Aug 22, 2024

Add generic vector/matrix operations #1376

Add generic vector/matrix operations #1376

Conversation

vbaconnet commented Jul 18, 2024 • edited Loading

vbaconnet commented Jul 19, 2024

njansson commented Aug 22, 2024

vbaconnet commented Jul 18, 2024 •

edited

Loading