view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 4 days ago • 37
AxBench Release Collection Open supervised dictionary learning models and datasets for Gemma 2 2B and 9B instruction-tuned models. • 13 items • Updated Feb 11, 2025 • 6
Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published Feb 18, 2025 • 37