deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 707k • • 1.45k
Running 3.68k The Ultra-Scale Playbook 🌌 3.68k The ultimate guide to training LLM on large GPU Clusters
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • 8B • Updated Mar 26, 2025 • 1.02k • 227