Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MultiRL
non-profit
Activity Feed
Follow
3
AI & ML interests
None defined yet.
Recent Activity
KimSHine
updated
a model
2 days ago
MultiRL/qwen3_1.7b_sft_final_easy_reinforce_ours_adv_fixed_gamma_0.9
KimSHine
published
a model
2 days ago
MultiRL/qwen3_1.7b_sft_final_easy_reinforce_ours_adv_fixed_gamma_0.9
KimSHine
updated
a dataset
2 days ago
MultiRL/tower_of_hanoi_benchmark
View all activity
Team members
3
MultiRL
's datasets
23
Sort: Recently updated
MultiRL/tower_of_hanoi_benchmark
Viewer
•
Updated
2 days ago
•
30
•
15
MultiRL/rush_hour_benchmark
Viewer
•
Updated
2 days ago
•
150
•
16
MultiRL/rush_hour_hard_rl
Viewer
•
Updated
2 days ago
•
640
•
17
MultiRL/rush_hour_medium_rl
Viewer
•
Updated
2 days ago
•
640
•
17
MultiRL/rush_hour_easy_rl
Viewer
•
Updated
2 days ago
•
640
•
14
MultiRL/final_sudoku_hard_new_rl
Viewer
•
Updated
17 days ago
•
480
•
51
MultiRL/final_sudoku_hard_rl_hint_raw_new
Viewer
•
Updated
19 days ago
•
635
•
23
MultiRL/final_sudoku_hard_rl_hint_raw
Viewer
•
Updated
19 days ago
•
640
•
14
MultiRL/final_sudoku_benchmark_with_hint_solver_difficulty
Viewer
•
Updated
19 days ago
•
300
•
16
MultiRL/final_sudoku_benchmark
Viewer
•
Updated
24 days ago
•
680
•
417
MultiRL/Sudoku-Benchmark_new
Viewer
•
Updated
26 days ago
•
300
•
20
MultiRL/final_sudoku_sft_A
Viewer
•
Updated
26 days ago
•
399
•
12
MultiRL/sudoku_hard_solved_first_final
Viewer
•
Updated
28 days ago
•
640
•
21
MultiRL/sudoku_hard_solved_25_final
Viewer
•
Updated
28 days ago
•
640
•
25
MultiRL/sudoku_hard_solved_10_final
Viewer
•
Updated
28 days ago
•
640
•
10
MultiRL/sudoku_hard_with_hint
Viewer
•
Updated
29 days ago
•
640
•
28
MultiRL/sudoku_easy_rl_shuffle
Viewer
•
Updated
Nov 24, 2025
•
1.2k
•
3
MultiRL/sudoku_validation
Viewer
•
Updated
Nov 17, 2025
•
56
•
47
MultiRL/sudoku_4x4_sft_1_gpt
Viewer
•
Updated
Nov 12, 2025
•
133
•
13
MultiRL/sudoku_4x4_sft_1
Viewer
•
Updated
Nov 6, 2025
•
133
•
3
MultiRL/sudoku_knowledge
Viewer
•
Updated
Nov 6, 2025
•
79
•
34
MultiRL/annotation_dataset
Viewer
•
Updated
Oct 30, 2025
•
21.5k
•
84
MultiRL/Sudoku-Benchmark
Viewer
•
Updated
Oct 24, 2025
•
645
•
54