OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG Paper • 2601.09028 • Published 4 days ago • 29
Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 13
Search-R1-v0.2 Collection Exploration with a more stable RL pipeline with outcome-only reward and scaled-up LLMs. https://arxiv.org/abs/2503.09516 • 26 items • Updated Aug 12, 2025 • 5
Search-R1-v0.3 Collection RL with outcome reward + format reward. https://arxiv.org/abs/2505.15117 • 12 items • Updated Aug 12, 2025 • 3
MIRIX: Multi-Agent Memory System for LLM-Based Agents Paper • 2507.07957 • Published Jul 10, 2025 • 79