mlfoundations-dev/Qwen3-8B_exp-swd-r2egym-standard_glm_4.7_traces_locetash_save-strategy_steps Updated about 19 hours ago
mlfoundations-dev/Qwen3-8B_exp_tas_trajectory_minimal_traces_save-strategy_steps Updated about 21 hours ago
mlfoundations-dev/Qwen3-8B_exp_tas_summarize_threshold_4096_traces_save-strategy_steps Updated about 21 hours ago
mlfoundations-dev/Qwen3-8B_perturbed-docker-exp-taskmaster2-tasks_glm_4.7_traces_locetash_save-strategy_steps Updated about 22 hours ago
mlfoundations-dev/staqc-ot3-100k-code-subset-traces-terminus-2_save-strategy_steps_Qwen3-8B Updated 6 days ago
mlfoundations-dev/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_1e-05_Qwen3-32B Updated 13 days ago
mlfoundations-dev/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_6.0_Qwen3-32B Updated 15 days ago
mlfoundations-dev/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_4.0_Qwen3-32B Updated 16 days ago
mlfoundations-dev/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k_eval_8179 Updated 18 days ago • 2
SkillFactory: Self-Distillation For Learning Cognitive Behaviors Paper • 2512.04072 • Published Dec 3, 2025 • 4
SkillFactory: Self-Distillation For Learning Cognitive Behaviors Paper • 2512.04072 • Published Dec 3, 2025 • 4
mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_nl2bash-0-3k-traces-restore-hp_202578280dae Updated Nov 18, 2025 • 1
mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_nl2bash-3k-traces-restore-hp_20251103be0a79 Updated Nov 18, 2025 • 3