spectacle

spectaclecs

spectaclecs

AI & ML interests

Multimodal LLM, Agent

Recent Activity

upvoted a collection 14 days ago

DeepSeek-V3.2

liked a model 17 days ago

deepseek-ai/DeepSeek-R1-Zero

liked a model 21 days ago

openai/gpt-oss-120b

View all activity

Organizations

upvoted a collection 14 days ago

DeepSeek-V3.2

Collection

4 items • Updated Dec 1, 2025 • 513

liked a model 17 days ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • 685B • Updated Mar 27, 2025 • 3.1k • 940

liked a model 21 days ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.25M • • 4.32k

liked a model 24 days ago

PRIME-RL/P1-30B-A3B

Text Generation • 31B • Updated Oct 24, 2025 • 59 • 9

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 247

liked 3 models about 1 month ago

upvoted 2 papers about 1 month ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 21

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 86

liked a model about 2 months ago

PRIME-RL/P1-235B-A22B

Text Generation • 235B • Updated Oct 24, 2025 • 16 • 18

upvoted a paper about 2 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

liked a Space 2 months ago

Qwen3 VL Demo

😻

347

Interact with a chatbot that handles text and images

upvoted a collection 2 months ago

CapRL

Collection

Data & Models for CapRL1.0 series &2.0 series • 10 items • Updated 16 days ago • 6

upvoted a paper 5 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

liked a dataset 5 months ago

FanqingM/MMK12

Viewer • Updated Apr 16, 2025 • 17.6k • 356 • 22

liked 2 models 6 months ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17, 2025 • 90.2k • • 742

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 2.25M • • 1.42k

liked a dataset 6 months ago

craigwu/vstar_bench

Viewer • Updated May 2, 2024 • 191 • 8.1k • 37

upvoted an article over 1 year ago

Article

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease

Oct 21, 2022

•