MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning Paper • 2602.10575 • Published 6 days ago • 4
One Model for All Tasks: Leveraging Efficient World Models in Multi-Task Planning Paper • 2509.07945 • Published Sep 9, 2025 • 1
puyuan1996/unizero_mt_moco_dmc8_concat_task_embed_nlayer8_20250221 Preview • Updated Feb 21, 2025 • 6
puyuan1996/unizero_mt_moco_dmc8_concat_task_embed_nlayer8_20250221 Preview • Updated Feb 21, 2025 • 6
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 401