-
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models
Paper • 2501.03262 • Published • 104 -
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Paper • 2505.24864 • Published • 144 -
Reinforcement Learning in Vision: A Survey
Paper • 2508.08189 • Published • 30 -
AVATAR: Reinforcement Learning to See, Hear, and Reason Over Video
Paper • 2508.03100 • Published
Maarten Bussler
MaartenBussler
AI & ML interests
Machine Learning, Computer Vision, Cloud Computing
Recent Activity
updated
a collection
about 2 months ago
RL upvoted a paper about 2 months ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs updated
a collection
2 months ago
RL Organizations
None yet