Environment simulation and adaptive learning algorithm development for a cleaning robot. Using a probabilistic model to estimate the environment state during decision-making. Also comparing the effects of move penalty between two agents.
Developed with my teammates Mustafa Yanar and Oruç Berat Turan.