deepseek-ai/DeepSeek-V3.2-Speciale Text Generation β’ 685B β’ Updated about 1 month ago β’ 27.6k β’ 627
deepseek-ai/DeepSeek-V3.2 Text Generation β’ 685B β’ Updated about 1 month ago β’ 115k β’ β’ 1.05k
view article Article Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained β Whatβs Really Changing in Transformers? Apr 4, 2025 β’ 16
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) Jan 19, 2025 β’ 38
Running on CPU Upgrade 184 LLM Hallucination Leaderboard π 184 View and filter LLM hallucination leaderboard
intfloat/multilingual-e5-large-instruct Feature Extraction β’ 0.6B β’ Updated Jul 10, 2025 β’ 1.36M β’ β’ 592