Vision Language Models are In-Context Value Learners

- formulating value learning as an autoregressive prediction task over shuffled sequence of the input video
- suitable across different tasks and in-context learning

#ICLR2025 #manipulation #in_context

X Overview, Project Website
 
 
Back to Top