ML Foundations Development

non-profit

https://github.com/mlfoundations

AI & ML interests

None defined yet.

Recent Activity

marianna13 published a model about 19 hours ago

mlfoundations-dev/Qwen3-8B_exp-swd-r2egym-standard_glm_4.7_traces_locetash_save-strategy_steps

marianna13 published a model about 21 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_temp_2.0_traces_save-strategy_steps

marianna13 published a model about 21 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_trajectory_minimal_traces_save-strategy_steps

View all activity

marianna13

published a model about 19 hours ago

mlfoundations-dev/Qwen3-8B_exp-swd-r2egym-standard_glm_4.7_traces_locetash_save-strategy_steps

Updated about 19 hours ago

marianna13

published 4 models about 21 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_temp_2.0_traces_save-strategy_steps

Updated about 21 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_trajectory_minimal_traces_save-strategy_steps

Updated about 21 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_temp_0.25_traces_save-strategy_steps

Updated about 21 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_summarize_threshold_4096_traces_save-strategy_steps

Updated about 21 hours ago

marianna13

published 2 models about 22 hours ago

mlfoundations-dev/Qwen3-8B_perturbed-docker-exp-taskmaster2-tasks_glm_4.7_traces_locetash_save-strategy_steps

Updated about 22 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_temp_0.5_traces_save-strategy_steps

Updated about 22 hours ago

marianna13

published 2 models about 23 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_top_k_32_traces_save-strategy_steps

Updated about 23 hours ago

mlfoundations-dev/Qwen3-8B_exp_tas_tmux_large_traces_save-strategy_steps

Updated about 23 hours ago

marianna13

published a model 1 day ago

mlfoundations-dev/Qwen3-8B_exp_tas_temp_0_5_traces_save-strategy_steps

Updated 1 day ago

marianna13

published a model 6 days ago

mlfoundations-dev/staqc-ot3-100k-code-subset-traces-terminus-2_save-strategy_steps_Qwen3-8B

Updated 6 days ago

penfever

published a model 13 days ago

mlfoundations-dev/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_1e-05_Qwen3-32B

Updated 13 days ago

penfever

published a model 15 days ago

mlfoundations-dev/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_6.0_Qwen3-32B

Updated 15 days ago

penfever

published a model 16 days ago

mlfoundations-dev/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_4.0_Qwen3-32B

Updated 16 days ago

penfever

published a dataset 18 days ago

mlfoundations-dev/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k_eval_8179

Updated 18 days ago • 2

penfever

published a dataset 19 days ago

mlfoundations-dev/evalset_2444

Updated 19 days ago • 2

Zaynes

authored a paper about 1 month ago

SkillFactory: Self-Distillation For Learning Cognitive Behaviors

Paper • 2512.04072 • Published Dec 3, 2025 • 4

sedrickkeh

authored a paper about 1 month ago

SkillFactory: Self-Distillation For Learning Cognitive Behaviors

Paper • 2512.04072 • Published Dec 3, 2025 • 4

EtashGuha

published 2 datasets about 2 months ago

mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_nl2bash-0-3k-traces-restore-hp_202578280dae

Updated Nov 18, 2025 • 1

mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_nl2bash-3k-traces-restore-hp_20251103be0a79

Updated Nov 18, 2025 • 3