Neuroscience and Reinforcement Learning
Reinforcement Learning ⭐ is similar to how humans learn:
- Learning by trial-and-error
- Reward is often delayed
TD errors (Temporal Difference Learning) model the activity of dopamine neurons (Schultz, Dayan, and Montague, n.d.)
Bibliography
Schultz, W., P. Dayan, and P. R. Montague. n.d. “A Neural Substrate of Prediction and Reward” 275 (5306):1593–99. https://doi.org/10.1126/science.275.5306.1593.