Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong answer 2.1 #3

Open
mantasu opened this issue Apr 5, 2023 · 1 comment
Open

Wrong answer 2.1 #3

mantasu opened this issue Apr 5, 2023 · 1 comment

Comments

@mantasu
Copy link

mantasu commented Apr 5, 2023

If we have 2 actions and $\varepsilon=0.5$, the probability of selecting the greedy action is $0.5 + 0.25=0.75$. We add $0.25$ because we may select a random action with $\varepsilon=0.5$ which can be the greedy action itself. Since there are 2 actions, the "random" probability will be distributed equally $\frac{0.5}{2}=0.25$.

@AbhiDu96
Copy link

Correct me if my following reasoning is wrong, but I would like to elaborate on your answer.
With probability 0.5 you select a greedy action and with probability 0.5 you select an action at random from the set {greed, non-greedy}. Thus the probability of selecting a greedy action becomes:
0.5 (from selecting greedy action) + 0.5*0.5 (probability of choosing from the set * probability to choose a greedy action from the set) = 0.75
We add the two probabilities because the two events are exclusive.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants