MultiRL

non-profit

AI & ML interests

None defined yet.

Recent Activity

iruno updated a dataset about 5 hours ago

MultiRL/new_sudoku_benchmark_2045_with_variants_extra_only

iruno updated a dataset about 5 hours ago

MultiRL/new_sudoku_benchmark_2045_with_variants

KimSHine updated a model about 5 hours ago

MultiRL/qwen3_1.7b_rush_hour_one_move_easy_short_rl_step_120

View all activity

MultiRL 's models 130

MultiRL/qwen3_4b_easy_rl_new

4B • Updated Dec 16, 2025 • 87

MultiRL/qwen3_1.7b_easy_rl_gspo

2B • Updated Dec 16, 2025 • 3

MultiRL/qwen3_4b_sft_new

4B • Updated Dec 15, 2025 • 61

MultiRL/qwen3_1.7b_easy_rl_final_step120

2B • Updated Dec 15, 2025 • 2.45k

MultiRL/qwen3_4b_medium_rl_final

4B • Updated Dec 15, 2025 • 345

MultiRL/qwen3_4b_sft_one_act

4B • Updated Dec 14, 2025 • 63

MultiRL/qwen3_1.7b_easy_rl_reinforce_ori

2B • Updated Dec 14, 2025 • 89

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_0.5

2B • Updated Dec 14, 2025 • 3

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_1

2B • Updated Dec 14, 2025 • 3

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_0

2B • Updated Dec 14, 2025 • 2

MultiRL/qwen3_1.7b_sft_one_act

2B • Updated Dec 14, 2025 • 106

MultiRL/qwen3_1.7b_easy_rl_final

2B • Updated Dec 13, 2025 • 869

MultiRL/qwen3_4b_easy_rl_final

4B • Updated Dec 13, 2025 • 64

MultiRL/qwen3_1.7b_sft_final

2B • Updated Dec 11, 2025 • 3.45k

MultiRL/qwen3_4b_sft_final

4B • Updated Dec 11, 2025 • 169

MultiRL/qwen3_1.7b_easy_rl_new

2B • Updated Dec 6, 2025 • 1

MultiRL/qwen3_4b_standard_medium_rl

4B • Updated Dec 6, 2025 • 53

MultiRL/qwen3_4b_standard_easy_rl

4B • Updated Dec 5, 2025 • 55

MultiRL/qwen3_4b_medium_rl_progress_C

4B • Updated Dec 5, 2025

MultiRL/qwen3_4b_medium_rl

4B • Updated Dec 4, 2025 • 52

MultiRL/qwen3_4b_easy_rl

4B • Updated Dec 2, 2025 • 33

MultiRL/qwen3_4b_instruct_sft

4B • Updated Dec 1, 2025 • 66

MultiRL/qwen3_1.7b_easy_rl_test_task_group

2B • Updated Dec 1, 2025

MultiRL/qwen3_1.7b_easy_rl_test

2B • Updated Nov 30, 2025 • 43

MultiRL/qwen3_8b_easy_rl

8B • Updated Nov 29, 2025 • 28

MultiRL/qwen3_8b_sudoku_sft

8B • Updated Nov 28, 2025 • 27

MultiRL/qwen3_1.7b_sudoku_sft

2B • Updated Nov 28, 2025 • 107

MultiRL/qwen3_1.7b_easy_reinforce_batch_32_by_pass

2B • Updated Nov 26, 2025 • 20

MultiRL/qwen3_1.7b_easy_reinforce_batch_64_by_pass

2B • Updated Nov 25, 2025

MultiRL/qwen3_1.7b_easy_reinforce_test

2B • Updated Nov 23, 2025 • 5