Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
- using RL with residual policy to fine-tune online policy
- similar to TRANSIC
#ICLR2025 #RL #refinement
X Overview, Project Website, TRANSIC
- using RL with residual policy to fine-tune online policy
- similar to TRANSIC
#ICLR2025 #RL #refinement
X Overview, Project Website, TRANSIC