Vision Language Models are In-Context Value Learners- formulating value learning as an autoregressive prediction task over shuffled sequence of the input video- suitable across different tasks and in-context learning#ICLR2025 #manipulation #in_contextX Overview, Project Website

Vision Language Models are In-Context Value Learners

- formulating value learning as an autoregressive prediction task over shuffled sequence of the input video
- suitable across different tasks and in-context learning

#ICLR2025 #manipulation #in_context

X Overview, Project Website

🧵 Thread • FixupX

Jason Ma (@JasonMa2020)

Excited to finally share Generative Value Learning (GVL), my @GoogleDeepMind project on extracting universal value functions from long-context VLMs via in-context learning!

We discovered a simple method to generate zero-shot and few-shot values for 300+…