Qwen/Qwen3-30B-A3B-Thinking-2507-FP8 Text Generation β’ 31B β’ Updated Jul 30, 2025 β’ 41.6k β’ 59
ibm-granite/granite-docling-258M Image-Text-to-Text β’ 0.3B β’ Updated Sep 23, 2025 β’ 206k β’ 1.1k
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence β’ 5 items β’ Updated 2 days ago β’ 165
Running on CPU Upgrade Featured 999 Model Memory Utility π 999 Calculate vRAM needed for model training and inference
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper β’ 2403.03206 β’ Published Mar 5, 2024 β’ 71
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation Paper β’ 2410.09584 β’ Published Oct 12, 2024 β’ 48