TD Learning
This post discusses temporal difference (TD) methods, used in Reinforcement Learning. It contrasts TD methods with Monte Carlo (MC) methods and dynamic programming. You need to have a thorough understanding of Markov Decision Process (MDP) to understand this post. Prediction and Control : In general, RL methods have two components 1) Prediction / Evaluation : where […]
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars #AlwaysUpskilling Reinforcement Learning (RL) refers to training agents with help of incentive-driven environments. RL typically involves a tuple of <state, action, reward> paradigm, which means that the agent has action choices to make in various states, and each action entails a potential reward. This also means that each state has a “value” […]
#CellStratAILab #disrupt4.0 #WeCreateAISuperstars The last meetup of Year 2019 at the CellStrat AI Lab saw incredible presentations by our AI Lab Members. First Sujith Kamath presented a superb session on Speech Recognition accompanied by a demo. Speech to Text Recognition is the buzz of the day. Though this area started 7 decades back, it has […]
Reinforcement Learning (RL) is set to be the next big thing in the world of AI and Machine Learning. RL models learn by accumulating rewards for designated actions in certain states. Essentially it is a (State, Action, Reward) optimization system. TD Learning is one type of RL algo which helps solve the problem when state […]