In our ICML paper, we study fine-tuning a generalist policy for multiple tasks. We ask, provided a pre-trained policy, how can we maximize multi-task performance with a minimal number of additional demonstrations?
📌 We are presenting a possible solution on Wed, 11am to 1.30pm at B2-B3 W-609!
2 months ago