Unin

#temporal_difference_learning

Temporal difference learning

Computer programming concept

Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods.

Sat 27th

Provided by Wikipedia

Learn More

This keyword could refer to multiple things. Here are some suggestions:

Temporal difference learning Deep reinforcement learning Richard S. Sutton Q-learning Reinforcement learning Timeline of machine learning 2048 (video game) Outline of machine learning Backgammon Conference on Neural Information Processing Systems

0 searches

This keyword has never been searched before

This keyword has never been searched for with any other keyword.