Implement Batching for input and output groups (i.e. Domain localization and observation localization) #377

odunbar · 2024-05-16T18:30:10Z

Purpose

To reduce computational complextity via blocking the dependence of parameters and data. The blocking would break and update of all parameters into several small updates of partitions of the parameter space. All this would take place within each single iteration of EKP.

Example

Prototype a user interface to pass

parameter partitioning (via e.g. names of the parameter), NB we may wish to enforce that no two parameters can be in different groups
data selection (not necessarily a partition, as data can be reused to update different parameter groups parameters)
maybe using different EK process/ localizer/ inflation etc. for these different update stages (all within the same iteration)

Simultaneously we should be thinking about how this interface could be used within a setting of mini-batching, where different partial observations are used in different EKP iterations

Discussed implementation

Create an array of structs defining a conditionally dependent parameters and , passed into EKP constructor update_groups = in addition to current arguments

struct UpdateGroup {VV <: AbstractVector}
    u_subset::VV 
    y_subset::VV
    # process::Process # in future
    # localizer::Localizer # in future
    # inflation::Inflation # in future
end

check all u_subsets form a partition of 1:p
if not provided create one UpdateGroup with u_subsets and y_subsets being the full indices
In the EnsembleKalmanProcess.jl update_ensemble!(...) function, Replace

u = update_ensemble!(ekp,...)

with

u = zeros(param_size,ens_size)
for group in update_groups # parallelizable!
    idx =group.u_subset
    u[idx, :] = update_ensemble!(ekp, group, ...) 
end

Possible hurdles

move push!(g,...) out of inner update_ensemble! call to be outside
where processes use priors, e.g. UKI/EKS (e.g. need to subset process.cov)
UKI generating initial ensemble
do we calculate the parameter dimension from a fixed quantity anywhere? or is it all deduced from passed in values

Example problem

Non-timescale separated coupled system with data

The text was updated successfully, but these errors were encountered:

odunbar mentioned this issue May 16, 2024

O3.7.5 Formulate and implement variant of EKI that scales linearly in output dimension and allows for sampling error correction #374

Open

odunbar assigned odunbar and eviatarbach May 16, 2024

odunbar linked a pull request May 23, 2024 that will close this issue

[WIP] update groups, i.e. domain and observation localization #380

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Batching for input and output groups (i.e. Domain localization and observation localization) #377

Implement Batching for input and output groups (i.e. Domain localization and observation localization) #377

odunbar commented May 16, 2024 •

edited

Loading

Implement Batching for input and output groups (i.e. Domain localization and observation localization) #377

Implement Batching for input and output groups (i.e. Domain localization and observation localization) #377

Comments

odunbar commented May 16, 2024 • edited Loading

Purpose

Example

Discussed implementation

Possible hurdles

Example problem

odunbar commented May 16, 2024 •

edited

Loading