Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
Yang Zhou
nbzy1995
Follow
0 followers
·
5 following
nbzy1995
yang-zhou-524b51170
AI & ML interests
Artificial General Intelligence, AI for Science, AI for society
Recent Activity
updated
a model
about 1 month ago
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
updated
a model
about 1 month ago
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
updated
a model
about 1 month ago
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
View all activity
Organizations
nbzy1995
's models
16
Sort:Â Recently updated
nbzy1995/Qwen2-0-5B-GRPO-vllm-trl
Updated
Nov 17
nbzy1995/Qwen3-VL-4B-Instruct-trl-grpo
Updated
Nov 13
nbzy1995/Reinforce-Cartpole-v1
Reinforcement Learning
•
Updated
Jun 7
nbzy1995/dqn_rl_zoo3_atari
Reinforcement Learning
•
Updated
Jun 6
•
4
nbzy1995/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Jun 4
nbzy1995/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jun 1
•
3
nbzy1995/LunarLander-v2-scratch
Reinforcement Learning
•
Updated
May 31
nbzy1995/poca-SoccerTwos
Reinforcement Learning
•
Updated
May 2
•
14
nbzy1995/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Apr 22
•
2
nbzy1995/ppo-PyramidsRND
Reinforcement Learning
•
Updated
Apr 18
•
12
nbzy1995/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Apr 18
•
20
nbzy1995/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
Apr 17
nbzy1995/Taxi-v3
Reinforcement Learning
•
Updated
Apr 5
nbzy1995/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Apr 5
nbzy1995/ppo-Huggy
Reinforcement Learning
•
Updated
Mar 14
•
56
nbzy1995/test
Updated
Mar 13