ZhangJin

Benjamin0

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a dataset 2 months ago

meituan-longcat/AMO-Bench

liked a model 4 months ago

internlm/Intern-S1

View all activity

Organizations

None yet

upvoted a paper 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

741

upvoted a paper 6 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39

upvoted an article 6 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.31k

upvoted an article 7 months ago

Article

The Common Pile v0.1

Jun 6, 2025

•

upvoted an article 8 months ago

Article

PipelineRL

Apr 25, 2025

•

upvoted a paper 9 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

upvoted an article 9 months ago

Article

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

Apr 4, 2025

•

upvoted 2 articles 10 months ago

Article

What changed in the Transformer architecture

Mar 8, 2025

•

Article

Common AI Model Formats

Feb 27, 2025

•

upvoted a paper 10 months ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24, 2025 • 73

upvoted 2 articles 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

267

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16, 2025

•

ZhangJin

AI & ML interests

Recent Activity

Organizations

Benjamin0's activity

SmolLM3: smol, multilingual, long-context reasoner

Open-source DeepResearch – Freeing our search agents

The Common Pile v0.1

PipelineRL

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

What changed in the Transformer architecture

Common AI Model Formats

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference