E2CL: Exploration-based Error Correction Learning for Embodied Agents Paper • 2409.03256 • Published Sep 5, 2024 • 1
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning Paper • 2505.16782 • Published May 22 • 1
SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution Paper • 2505.20732 • Published May 27 • 1
STeCa: Step-level Trajectory Calibration for LLM Agent Learning Paper • 2502.14276 • Published Feb 20 • 1