Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add examples highlighting the learning process with state-of-the-art RL algorithms #51

Open
wallscheid opened this issue Jul 10, 2020 · 2 comments
Assignees

Comments

@wallscheid
Copy link
Contributor

Based on the available expert controller design examples (PI-based inner current/voltage control + droop control for power sharing) it will be very interesting to highlight the shortcomings and adavantages of applying state-of-the-art RL algorithms as a replacement for the expert-based controllers.

  • The task is therefore to implement a contemporary model-free RL, completely data-driven control approach for continuous states and actions learning an optimal control policy from scratch (e.g. DDPG).
  • Similar to the procedure for the implementation of the expert controllers, the RL implementation should first be approached for the simplest problems (single inverter current control) and from there be extended for further, more complex tasks.
  • It can be assumed that safety critical states (e.g. overcurrent) will occur during the training of the RL controller and perhaps even after its convergence to a local optimum. An essential object of investigation should therefore also be monitoring how often or in which particular situations the RL controller causes unsafe system states.
  • And finally the converged/learned RL controller should be compared to the previous examples using standard control approaches using the usual performance metrics (integrated mean-squared control error, mean absolute control error,...)
@wallscheid
Copy link
Contributor Author

Proposal: Use available, pre-fabricated RL toolboxes (Pytorch -> Stable Baseline3, Tensorflow -> Tensorforce / Keras-RL2).

stheid pushed a commit that referenced this issue Oct 23, 2020
stheid pushed a commit that referenced this issue Oct 23, 2020
@stheid
Copy link

stheid commented Nov 2, 2020

blocked by #101

Webbah pushed a commit that referenced this issue Dec 3, 2020
Webbah pushed a commit that referenced this issue Dec 18, 2020
stheid pushed a commit that referenced this issue Dec 18, 2020
stheid pushed a commit that referenced this issue Dec 18, 2020
stheid pushed a commit that referenced this issue Dec 18, 2020
stheid pushed a commit that referenced this issue Dec 18, 2020
Webbah pushed a commit that referenced this issue Jan 14, 2021
Webbah pushed a commit that referenced this issue Jan 14, 2021
Webbah pushed a commit that referenced this issue Jan 14, 2021
Webbah pushed a commit that referenced this issue Jan 18, 2021
Webbah pushed a commit that referenced this issue Jan 18, 2021
Webbah pushed a commit that referenced this issue Jan 18, 2021
Webbah pushed a commit that referenced this issue Jan 18, 2021
Webbah pushed a commit that referenced this issue Jan 18, 2021
Webbah pushed a commit that referenced this issue Jan 18, 2021
Webbah pushed a commit that referenced this issue Jan 20, 2021
Webbah pushed a commit that referenced this issue Jan 20, 2021
Webbah pushed a commit that referenced this issue Jan 20, 2021
Webbah pushed a commit that referenced this issue Jan 26, 2021
Webbah pushed a commit that referenced this issue Jan 26, 2021
Webbah pushed a commit that referenced this issue Jan 26, 2021
Webbah pushed a commit that referenced this issue Feb 5, 2021
Webbah pushed a commit that referenced this issue Feb 5, 2021
Webbah pushed a commit that referenced this issue Feb 5, 2021
Webbah pushed a commit that referenced this issue Apr 7, 2021
Webbah pushed a commit that referenced this issue Apr 7, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 16, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 27, 2021
Webbah pushed a commit that referenced this issue Apr 28, 2021
Webbah pushed a commit that referenced this issue Apr 28, 2021
Webbah pushed a commit that referenced this issue Apr 28, 2021
Webbah pushed a commit that referenced this issue Apr 28, 2021
Webbah pushed a commit that referenced this issue May 17, 2021
Webbah pushed a commit that referenced this issue May 17, 2021
Webbah pushed a commit that referenced this issue May 17, 2021
Webbah pushed a commit that referenced this issue May 17, 2021
Webbah pushed a commit that referenced this issue May 19, 2021
Webbah pushed a commit that referenced this issue May 19, 2021
Webbah pushed a commit that referenced this issue May 19, 2021
Webbah pushed a commit that referenced this issue May 19, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants