fbpx
Understanding the Temporal Difference Learning and its Predication 
The temporal difference learning algorithm was introduced by Richard S. Sutton in 1988.  The reason the temporal difference learning method became popular was that it combined the advantages of dynamic programming and the Monte Carlo method. But what are those advantages?  This article is an excerpt from the... Read more