-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some CATKE performance optimizations #3453
Draft
simone-silvestri
wants to merge
148
commits into
main
Choose a base branch
from
ss/optimize-catke
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from 146 commits
Commits
Show all changes
148 commits
Select commit
Hold shift + click to select a range
ee1b34e
full interior map
simone-silvestri 31ba1d6
bugfix
simone-silvestri 1403161
bugfix
simone-silvestri 7d76203
bugfixes
simone-silvestri b5d6a42
hmmm
simone-silvestri ee62bec
disambiguate
simone-silvestri e880b2c
some organizing
simone-silvestri 6f4aaad
hmmm
simone-silvestri f0b59a3
improve speed
simone-silvestri 5dddbb9
now we get going
simone-silvestri 90b0f7a
check it out
simone-silvestri 6ce5b45
check bathymetry
simone-silvestri 2875984
fixit
simone-silvestri 6cd7444
rmove distributed
simone-silvestri 8319dbc
test it like this
simone-silvestri b78b042
I hope it works!
simone-silvestri 11e8143
bugfix
simone-silvestri 2a2f772
bugfix
simone-silvestri d91cd93
bugfix
simone-silvestri 8eaf808
bugfix
simone-silvestri 023e404
couple of bugfixes
simone-silvestri 81e70f6
bugfix
simone-silvestri 7ab646b
bugfixes
simone-silvestri a6ab840
changes
simone-silvestri 6c96131
try like this
simone-silvestri 8057909
some tests...
simone-silvestri 183ea90
show the coordinate
simone-silvestri 05593e2
bugfix
simone-silvestri 44dbee0
bugfix
simone-silvestri 8c81e15
test this hypothesis
simone-silvestri 3e88fb4
another test
simone-silvestri d74f5f5
bugfix
simone-silvestri 25a1dbb
other bugfix
simone-silvestri 188eedc
now we'll see...
simone-silvestri 7d418e0
now it will work hopefully
simone-silvestri 83b4d5b
all bugs fixed?
simone-silvestri 7a0df19
bugfix
simone-silvestri 2626579
remove the shows
simone-silvestri 95e90e7
unroll the loop
simone-silvestri 4b1f2cd
fully unrolled
simone-silvestri 20a12d1
split explicit loop unrolling
simone-silvestri 58e7acb
update
simone-silvestri 217e3af
annotations
simone-silvestri 8cf6453
using NVTX
simone-silvestri 3cc1468
add NVTX
simone-silvestri 1c1ff63
bugfix
simone-silvestri e0bedee
bugfix
simone-silvestri b2f92dd
utils
simone-silvestri 00458ab
try like this
simone-silvestri 08a86b5
text like this
simone-silvestri e402f5c
remove reduced fields
simone-silvestri 6cf89bc
small test
simone-silvestri 3769873
small change
simone-silvestri d60b643
nvtx on fill halos
simone-silvestri 1b0a440
all NVTX
simone-silvestri 6f9d400
fill it all
simone-silvestri ea5e56b
check it out
simone-silvestri 47dd569
bugfixxed
simone-silvestri 62dad92
bugfixed
simone-silvestri 9d5ada2
bugfix
simone-silvestri 76bfb5e
annotate the convert
simone-silvestri 3f645ce
bugfix
simone-silvestri 324aaef
bugfix
simone-silvestri 67df158
add cudaconvert
simone-silvestri 74d3bad
remove NVTX
simone-silvestri 955d2c1
model grid
simone-silvestri 15f60f7
try like this?
simone-silvestri 3c8e34f
bugfix
simone-silvestri 246c6d9
fix
simone-silvestri 837a119
should work?
simone-silvestri 148a2c8
add here
simone-silvestri 0cf5c77
add here
simone-silvestri d1f4f83
bugfix
simone-silvestri 41a0857
back to how it was
simone-silvestri ee97dde
try it like this maybe?
simone-silvestri 6f5e6b7
convert
simone-silvestri acd1a54
fixxing
simone-silvestri ca73268
try it now?
simone-silvestri 2d8ae26
bugfix
simone-silvestri 5341b71
add distributed
simone-silvestri 1ce6a5a
bugfix
simone-silvestri 53055d2
allow unrolling
simone-silvestri 4185152
convert in archs
simone-silvestri 0aa5b10
bugfix
simone-silvestri f282bbe
Merge branch 'main' into ss/no-immersed-cells2
simone-silvestri 7428374
just for testing
simone-silvestri ec911ce
Merge branch 'ss/no-immersed-cells2' of github.com:CliMA/Oceananigans…
simone-silvestri b48d00c
removed useless particles
simone-silvestri 4d36cc4
removed bacthed stuff
simone-silvestri 8842d05
tracer advetion type
simone-silvestri 7656643
Merge branch 'main' into ss/no-immersed-cells2
simone-silvestri 3454818
Merge branch 'main' into ss/no-immersed-cells2
navidcy 47ab44b
bugfix
simone-silvestri 881bdb5
bugfix
simone-silvestri f79a056
other bugfix
simone-silvestri c3a21a4
other small bugfix
simone-silvestri 782f247
first bugfix
simone-silvestri 7b92c64
correct error
simone-silvestri 5e6dcb9
some bugfixes
simone-silvestri 056def5
bugfix
simone-silvestri 630f0fa
slightly more optim
simone-silvestri e70a57d
simplifying more
simone-silvestri c1c3101
all tests should be ok
simone-silvestri ff66175
try it
simone-silvestri 469224b
correct for last time
simone-silvestri d09e5fe
try again
simone-silvestri 73f8b09
fixed
simone-silvestri 69b9b98
tests fixxed
simone-silvestri b42b115
finally tests fixed
simone-silvestri dcffb79
back to previous dt
simone-silvestri 9801ec0
bugfix
simone-silvestri c64f404
tests fixed?
simone-silvestri 359a083
ale
simone-silvestri f34a0e0
Merge remote-tracking branch 'origin/main' into ss/no-immersed-cells2
simone-silvestri 72f286e
Update src/Models/HydrostaticFreeSurfaceModels/update_hydrostatic_fre…
simone-silvestri 46ef24c
Update src/TimeSteppers/quasi_adams_bashforth_2.jl
simone-silvestri 6b74e7a
removed NVTX
simone-silvestri 950606f
Merge branch 'ss/no-immersed-cells2' of github.com:CliMA/Oceananigans…
simone-silvestri 2061300
remove one line
simone-silvestri f46f7a9
if inside
simone-silvestri a276111
better comment
simone-silvestri 4fedc37
some docstrings
simone-silvestri 8f342fa
remove NVTX
simone-silvestri b7c871a
test an hypothesis
simone-silvestri 7ff259d
test it now
simone-silvestri f3fa448
optimization
simone-silvestri 0f946df
bugfixes
simone-silvestri 27f1a28
bugfix
simone-silvestri 0c18521
bugfixxes
simone-silvestri e91a0c9
adapt
simone-silvestri ce9e49e
Merge branch 'ss/optimize-catke' of github.com:CliMA/Oceananigans.jl …
simone-silvestri 5fc071c
clipping to zero
simone-silvestri 3ef7984
shear is at faces in z
simone-silvestri b7efdd5
code alignment
navidcy 882706f
add capitalization
simone-silvestri 62c5621
Merge branch 'ss/optimize-catke' of github.com:CliMA/Oceananigans.jl …
simone-silvestri e27728f
Update src/Advection/tracer_advection_operators.jl
simone-silvestri dacd623
adding a minimum dissipation length scale
simone-silvestri de87ac4
Merge branch 'ss/optimize-catke' of github.com:CliMA/Oceananigans.jl …
simone-silvestri f316157
remove zero clipping
simone-silvestri a394670
conditional for `ϵ == Inf`
simone-silvestri ac08ab5
bugfix
simone-silvestri b13c728
should preserve positivity
simone-silvestri 9832f00
better comment
simone-silvestri 93a4932
correct the sign of implicit w'b'
simone-silvestri 3e5a0d9
bugfix
simone-silvestri 5958cb6
back to fully implicit dissipation
simone-silvestri 8ffbff0
Merge remote-tracking branch 'origin/main' into ss/optimize-catke
simone-silvestri File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What does this stuff have to do with CATKE performance optimization?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
that is code from another PR #3404 we have to merge before this one