Thank you for the code.
I have a question about the base_learner update.
- The base-learner is fast updated using 100 steps.
- then we return qry_logits and calculate the cross entropy loss for qry_set
- using sel.optimizer to update.
But what gradient is stored in base_learner? Because you use fast-model to calcualte qry loss