Reinforcement learning series – getting the basics – part 3
In the previous part we’ve learned about the standard formulation setting of RL (MDP): https://g-stat.com/reinforcement-learning-series-getting-the-basics-part-2. This article would be about a basic method in RL - Monte-Carlo (MC). As stated in…