policy improvement
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars #WhereLearningNeverStops In recent weeks, I had presented a session on “AlphaZero with Monte Carlo Tree Search” algorithm at the CellStrat AI Lab. This is an algorithm developed by Google Deepmind in 2016. It mastered the game of GO and beat the 18-time world champion at the time Lee Sedol. Go is an ancient Chinese abstract strategy […]
This post discusses temporal difference (TD) methods, used in Reinforcement Learning. It contrasts TD methods with Monte Carlo (MC) methods and dynamic programming. You need to have a thorough understanding of Markov Decision Process (MDP) to understand this post. Prediction and Control : In general, RL methods have two components 1) Prediction / Evaluation : where […]
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars We had fantastic presentations on advanced Deep Learning concepts at the last Saturday AI Lab. Reinforcement Learning (RL) with Dynamic Programming : First Shubha M. started with a superb session on RL with Dynamic Programming. Dynamic Programming is a concept of breaking a problem into subproblems, solving them and then combining the […]