BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks Paper • 2510.02418 • Published Oct 2, 2025 • 2
RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models Paper • 2510.03515 • Published Oct 3, 2025 • 2