LLM-in-Sandbox Collection Data and models for the paper: LLM-in-Sandbox Elicits General Agentic Intelligence. Feel free to open an issue if you have any questions or problems! • 3 items • Updated 3 days ago • 1
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published 24 days ago • 37
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published 24 days ago • 40
Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning Paper • 2602.00759 • Published 27 days ago • 5
FlowRL: Matching Reward Distributions for LLM Reasoning Paper • 2509.15207 • Published Sep 18, 2025 • 116