Jethro's Braindump

Neuroscience and Reinforcement Learning

tags: Reinforcement Learning ⭐, Neuroscience ⭐

Reinforcement Learning ⭐ is similar to how humans learn:

Learning by trial-and-error
Reward is often delayed

TD errors (Temporal Difference Learning) model the activity of dopamine neurons (Schultz, Dayan, and Montague, n.d.)

Bibliography

Schultz, W., P. Dayan, and P. R. Montague. n.d. “A Neural Substrate of Prediction and Reward” 275 (5306):1593–99. https://doi.org/10.1126/science.275.5306.1593.