koutch/short_paper_llama_0.json_train_grpo_v2_dev Text Generation • 8B • Updated about 21 hours ago • 33
koutch/short_paper_llama_0.json_train_grpo_v2_dev Text Generation • 8B • Updated about 21 hours ago • 33
koutch/short_paper_qwen_0.json_train_grpo_v2_dev Text Generation • 4B • Updated about 21 hours ago • 19