SEGAgentRL

non-profit

AI & ML interests

We target improved agent reinforcement learning in terms of stability (S), efficiency (E), and generalization (G).

Recent Activity

dwenlong updated a collection about 5 hours ago

dwenlong updated a collection about 16 hours ago

dwenlong updated a collection about 16 hours ago

View all activity

Collections 1

models 9

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 16 hours ago • 24

SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base

Reinforcement Learning • 3B • Updated about 16 hours ago • 22

SEGAgentRL/LLDS-R-GSPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 16 hours ago • 26

SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 16 hours ago • 32

SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated about 16 hours ago • 25

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base

Reinforcement Learning • 3B • Updated about 16 hours ago • 17

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base-MA

Reinforcement Learning • 3B • Updated about 16 hours ago • 28

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base

Reinforcement Learning • 8B • Updated about 16 hours ago • 56 • 2

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins

Reinforcement Learning • 8B • Updated about 16 hours ago • 81 • 1

datasets 0

None public yet