TD-Gammon

TD-Gammon

TD-Gammon is a computer backgammon program developed in 1992 by Gerald Tesauro at IBM. It uses an artificial neural net trained by temporal-difference learning to achieve a level of play just below that of the top human players. It also led to advances in the theory of correct backgammon play.

1 courses cover this concept

CS 294-40: Learning for robotics and control

UC Berkeley

Fall 2008

This advanced course focuses on the applications of machine learning in the robotics and control field. It covers a wide range of topics including Markov Decision Processes, control theories, estimation methodologies, and robotics principles. Recommended for graduate students.

No concepts data

+ 27 more concepts