M Saad Salman's picture

4 254

M Saad Salman

MSS444

·

MSS444

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 3 days ago

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs

upvoted a paper 3 days ago

Recursive Language Models

View all activity

Organizations

None yet

commented a paper 4 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190 •

commented 3 papers 6 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

New activity in huggingchat/chat-ui over 1 year ago

[MODELS] Discussion

#372 opened almost 2 years ago by

[MODELS] Discussion

#372 opened almost 2 years ago by

[MODELS] Discussion

#372 opened almost 2 years ago by