Jethro's Braindump

Meta Learning

tags
Reinforcement Learning ⭐

Learning to learn: learn an update rule from related tasks

For example, tasks are related through a low-dimensional embedding.

Model-Agnostic Meta Learning (MAML)

Based on 2nd-order gradient descent:

2-stage gradient-based approach on batches of tasks T:

  1. Inner loop:

θi=θαθLT(fθ)

  1. Outer Loop:

θ=θβθTip(T)LJi(fθi)

Resources