Discrete Fully-Probabilistic Design (D-FPD)

Introduction and related publications

D-FPD is a numerical/discrete version of an algorithm from [1]: this latter algorithm can tackle both discrete and continuous control problems by finding an analytical solution. D-FPD instead finds a purely numerical solution. As the original algorithm, the numerical/discrete implementation of D-FPD can be used to compute policies from examples for constrained, possibly stochastic/nonlinear, systems.

Refer to the accompanying paper for more information:

E. Ferrentino, P. Chiacchio, G. Russo, "Discrete fully probabilistic design: a tool to design control policies from examples". 2021. E-print URL: https://arxiv.org/abs/2112.11210.

The code at a glance

This MATLAB repo is composed of two classes of scripts:

Proof-of-concept: set of scripts to demonstrate how D-FPD works and compare it with the continuous counterpart
Inverted pendulum example: set of scripts to demonstrate the effectiveness of D-FPD in generating a data-driven control policy on an inverted pendulum

Executing proof-of-concept scripts

Open up MATLAB and make the repo main folder the MATLAB current directory. To run the proof-of-concept (D-FPD on a made-up example), run

demo_discrete_fpd_base

Brief description of the other scripts:

demo_continuous_fpd provides a continuous implementation of FPD, meaning that state/input domains are continuous;
discrete_continuous_comparison performs a comparison between discrete and continuous FPD using data files generated from the scripts above. The user user might need to configure some parameters in the script before executing it.

Executing the pendulum example

The pendulum example is divided into four phases

Data generation
Probabilistic model generation
D-FPD optimization
Probabilistic policy validation

Most of the scripts for this use case are contained in the example folder.

Data generation

Data are generated with a model-based controller. The controller is given a time-varying trajectory reference bringing the pendulums to the unstable equilibrum state with randomized time parametrizations.

The actuated pendulum is made noisy through the introduction of a Gaussian noise acting at acceleration level.

Generate trajectories by running the script

demo_noisy_pendulum_data_generation

A data file is generated in the data folder. If you do not want to re-generate data files, you can use those already available in this repo.

Probabilistic model generation

The data files above can be used to generate a probabilistic model:

demo_probabilistic_model_generation

The script generates a data file containing the state evolution models and the reference's randomized control law. The output data file will also contain information about the discretization of states and input. If you do not want to re-generate this data file, you can use that already available in this repo.

D-FPD optimization

The probabilistic model can be used to generate an optimal control policy throguh D-FPD:

demo_dfpd_2states_1input

Being application independent, the script above is located in the top folder of the repo.

The script generates the randomized control law for the target system in a data file in the results folder. If you do not want to re-generate this data file, you can use that already available in this repo. You can analyze these results by launching the script

dfpd_2states_1input_results_analysis

Probabilistic policy validation

The control policy can be loaded in a data-driven controller acting on the target system through the script

demo_noisy_pendulum_validation

The script above will also launch the simulation showing the pendulum evolution subject to the probabilistic control policy.

Authors and contributors

Enrico Ferrentino (author)

References

[1] Davide Gagliardi and Giovanni Russo. On a probabilistic approach to synthesize control policies from example datasets. Automatica, (in press), 2021. URL: https://arxiv.org/pdf/2005.11191.pdf.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
example		example
results		results
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md
compute_alpha.m		compute_alpha.m
compute_beta.m		compute_beta.m
compute_dkl.m		compute_dkl.m
compute_domain_from_pdf.m		compute_domain_from_pdf.m
compute_optimal_policy.m		compute_optimal_policy.m
compute_probabilities.m		compute_probabilities.m
compute_variance.m		compute_variance.m
demo_continuous_fpd.m		demo_continuous_fpd.m
demo_dfpd_2states_1input.m		demo_dfpd_2states_1input.m
demo_discrete_fpd_base.m		demo_discrete_fpd_base.m
dfpd_2states_1input_results_analysis.m		dfpd_2states_1input_results_analysis.m
discrete_continuous_comparison.m		discrete_continuous_comparison.m
is_within_threshold.m		is_within_threshold.m
non_linear_constraints.m		non_linear_constraints.m
normal_distribution.m		normal_distribution.m
objective_function.m		objective_function.m
paper.pdf		paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Discrete Fully-Probabilistic Design (D-FPD)

Introduction and related publications

The code at a glance

Executing proof-of-concept scripts

Executing the pendulum example

Data generation

Probabilistic model generation

D-FPD optimization

Probabilistic policy validation

Authors and contributors

References

About

Releases

Packages

Languages

License

unisa-acg/discrete-fpd

Folders and files

Latest commit

History

Repository files navigation

Discrete Fully-Probabilistic Design (D-FPD)

Introduction and related publications

The code at a glance

Executing proof-of-concept scripts

Executing the pendulum example

Data generation

Probabilistic model generation

D-FPD optimization

Probabilistic policy validation

Authors and contributors

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages