Complete redesign of "Observations" object enabling introduction of minibatching #384

odunbar · 2024-06-17T20:31:32Z

Purpose

Closes #382
Closes #383
Closes #385
Improves some of the getters in #386

Summarized by the docs here
and the API docs here

Content

Remove the unnecessary Module wrapping Observations object. And Replaces the old and largely useless Observation object
Create new Observation ObservationSeries and Minibatcher object. that can store y's Gamma's and minibatching framework to batch up epochs over them.
EKP now always internally stores an ObservationSeries rather than y and \Gamma separately.
Observations are now accessed using the get functions that pull the stacked-y and blocked-\Gamma from the ObservationSeries object for the current minibatch
update_minibatch!( is called at the end of each EKP step, updating the batch. At the end of the epoch, this also calls create_new_epoch!( for the minibatcher to create a new epoch of minibatches
compatible with EKI, ETKI, EKS and UKI - as Observation stores the inverse obs noise cov too.
learning rate schedulers compatible with minibatching
Back-compatible with old interface passing in y and Gamma to EKP
changes examples that used the old Observation object
unit tests for all new structs
added a docs page with small examples
API & docstrings
resolves the ETKI bug with the timestepper, and ensured scaling preserved in new update

I have read and checked the items on the review checklist.

removed Observations Module format Redesign of Observation Observation tests and format tests for minibatchers ObservationSeries tested build=true default for get_obs and get_obs_noise_cov interface for EKP add some more convenience functions for ObservationSeries test no_minibatching setup updated examples with MB UKI constructor remove build-bug where obs_noise_cov append flattens array typo format add vec typo added Dict to construct ObservationSeries, and added == operations added storage of observation inverses

…s now

docs/src/observations.md

src/EnsembleKalmanInversion.jl

src/EnsembleKalmanProcess.jl

src/EnsembleTransformKalmanInversion.jl

…scaling

eviatarbach

This is excellent, thank you for all the work on this!

I made a few comments in the code. Besides these comments, I was wondering why you removed the multiple samples of y in Cloudy_example_eki.jl and aerosol_activation.jl? It seems like it would be useful to have an example with multiple samples, especially now that they can be handled better.

docs/src/observations.md

src/EnsembleKalmanProcess.jl

eviatarbach · 2024-07-24T23:18:41Z

src/EnsembleTransformKalmanInversion.jl

    X = FT.((u .- mean(u, dims = 2)) / sqrt(m - 1))
    Y = FT.((g .- mean(g, dims = 2)) / sqrt(m - 1))
-    Ω = inv(I + Y' * Γ_inv * Y)
-    w = FT.(Ω * Y' * Γ_inv * (y .- mean(g, dims = 2)))
+    tmp = get_buffer(get_process(ekp)) # the buffer stores Y' * Γ_inv of [size(Y,2),size(Y,1)]


This whole section of code is quite difficult to read and verbose. Any way it can be simplified?

+ 1 . If it can't be simplified, some additional comments would also be helpful (and commented out code snippets removed)

I managed to allocate all the buffers up front, then I could remove the logic for the computation, I think it looks much cleaner now!

src/EnsembleKalmanProcess.jl

costachris · 2024-07-26T04:16:31Z

src/EnsembleTransformKalmanInversion.jl

    X = FT.((u .- mean(u, dims = 2)) / sqrt(m - 1))
    Y = FT.((g .- mean(g, dims = 2)) / sqrt(m - 1))
-    Ω = inv(I + Y' * Γ_inv * Y)
-    w = FT.(Ω * Y' * Γ_inv * (y .- mean(g, dims = 2)))
+    tmp = get_buffer(get_process(ekp)) # the buffer stores Y' * Γ_inv of [size(Y,2),size(Y,1)]


+ 1 . If it can't be simplified, some additional comments would also be helpful (and commented out code snippets removed)

src/LearningRateSchedulers.jl

costachris

Thanks for addressing the comments - Looks good to me

odunbar · 2024-07-30T23:26:30Z

@eviatarbach In response to your comment on the examples, before we were not actually using more than one sample even though we gathered many, so what was happening in those examples did not really make sense anyway. My replacement does not effect the functionality. However you are right - we could include the multiple samples in examples in future, but perhaps this can be left to a future PR?

eviatarbach · 2024-07-30T23:39:56Z

LGTM! Thank you.

odunbar changed the title ~~Clean-up "Observations" object and introducing minibatching~~ [WIP] Clean-up "Observations" object and introducing minibatching Jun 17, 2024

odunbar force-pushed the orad/minibatch branch from 1b4889f to 586c48a Compare June 25, 2024 20:37

odunbar added 4 commits June 25, 2024 13:58

remove arg from etki constructor

1c02b0e

commas to semicolons

50c018b

performance improvements and bug-fix, works well with new Observation…

5c3e30c

…s now

add comment

2358431

odunbar changed the title ~~[WIP] Clean-up "Observations" object and introducing minibatching~~ [WIP] Complete redesign of "Observations" object enabling introduction of minibatching Jun 26, 2024

odunbar added 13 commits June 25, 2024 20:16

format

de2a431

adds average update time

73bdd0d

new docs page

57cbd6d

example block fixed

1c493bb

new blocks

68507bf

new blocks

a5a137f

add kwargs...

0b7276d

format

81952ff

new API docstrings

460e797

format api

58a5b03

reduce test time, and convert all UniformScalings to Diagonals

cc33614

format

a2e293d

docs subtitle and move up observations

c13462a

odunbar changed the title ~~[WIP] Complete redesign of "Observations" object enabling introduction of minibatching~~ Complete redesign of "Observations" object enabling introduction of minibatching Jul 11, 2024

odunbar requested review from eviatarbach and costachris July 11, 2024 09:37

costachris reviewed Jul 15, 2024

View reviewed changes

docs/src/observations.md Outdated Show resolved Hide resolved

src/EnsembleKalmanInversion.jl Outdated Show resolved Hide resolved

src/EnsembleKalmanProcess.jl Outdated Show resolved Hide resolved

src/EnsembleTransformKalmanInversion.jl Show resolved Hide resolved

odunbar added 5 commits July 15, 2024 11:47

rm "noise"

c74716c

clearer N_obs in constructor

f1d90f2

note on identifiers

c8983b4

compatible learning rate schedulers

c124e3d

remove build of inverse covariance to accelerate ETKI back to linear …

09962fd

…scaling

odunbar added 2 commits July 17, 2024 13:39

add timer check to ETKI

8069dbe

format

d769eff

eviatarbach requested changes Jul 24, 2024

View reviewed changes

costachris reviewed Jul 26, 2024

View reviewed changes

odunbar added 4 commits July 26, 2024 11:26

another constructor option

a225c6b

addresses review comments

8e23b32

cleaned up ETKI look, by allocating buffers first and removing logic

e3a591c

add comment

d3d2a69

odunbar requested review from eviatarbach and costachris July 26, 2024 19:31

odunbar and others added 3 commits July 26, 2024 12:33

typo docs

c33c673

neaten format

9556020

Merge branch 'main' into orad/minibatch

3327533

costachris approved these changes Jul 30, 2024

View reviewed changes

eviatarbach approved these changes Jul 30, 2024

View reviewed changes

odunbar merged commit dab5aff into main Jul 31, 2024
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complete redesign of "Observations" object enabling introduction of minibatching #384

Complete redesign of "Observations" object enabling introduction of minibatching #384

odunbar commented Jun 17, 2024 •

edited

Loading

eviatarbach left a comment

eviatarbach Jul 24, 2024

costachris Jul 26, 2024

odunbar Jul 26, 2024 •

edited

Loading

costachris Jul 26, 2024

costachris left a comment

odunbar commented Jul 30, 2024 •

edited

Loading

eviatarbach commented Jul 30, 2024

Complete redesign of "Observations" object enabling introduction of minibatching #384

Complete redesign of "Observations" object enabling introduction of minibatching #384

Conversation

odunbar commented Jun 17, 2024 • edited Loading

Purpose

Content

eviatarbach left a comment

Choose a reason for hiding this comment

eviatarbach Jul 24, 2024

Choose a reason for hiding this comment

costachris Jul 26, 2024

Choose a reason for hiding this comment

odunbar Jul 26, 2024 • edited Loading

Choose a reason for hiding this comment

costachris Jul 26, 2024

Choose a reason for hiding this comment

costachris left a comment

Choose a reason for hiding this comment

odunbar commented Jul 30, 2024 • edited Loading

eviatarbach commented Jul 30, 2024

odunbar commented Jun 17, 2024 •

edited

Loading

odunbar Jul 26, 2024 •

edited

Loading

odunbar commented Jul 30, 2024 •

edited

Loading