TD-Gammon is a computer backgammon program developed in 1992 by Gerald Tesauro at IBM. It uses an artificial neural net trained by temporal-difference learning to achieve a level of play just below that of the top human players. It also led to advances in the theory of correct backgammon play.
UC Berkeley
Fall 2008
This advanced course focuses on the applications of machine learning in the robotics and control field. It covers a wide range of topics including Markov Decision Processes, control theories, estimation methodologies, and robotics principles. Recommended for graduate students.
No concepts data
+ 27 more concepts