Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning Paper • 2510.23473 • Published Oct 27, 2025 • 84
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 122
CoAct-1: Computer-using Agents with Coding as Actions Paper • 2508.03923 • Published Aug 5, 2025 • 14
CoAct-1: Computer-using Agents with Coding as Actions Paper • 2508.03923 • Published Aug 5, 2025 • 14
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback Paper • 2506.11930 • Published Jun 13, 2025 • 53