James-WYang/ICR_ANALYSIS_M1_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_with_t-1_reference_model 8B • Updated Aug 4 • 5
James-WYang/ICR_ANALYSIS_M0_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_wo_length_control 8B • Updated Aug 4 • 7
James-WYang/ICR_ANALYSIS_M0_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_each_language_5000_samples 8B • Updated Aug 4 • 6
James-WYang/ICR_ANALYSIS_M0_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr_each_language_1000_samples 8B • Updated Aug 4 • 4
James-WYang/LIDR_Multilingual_Reasoning_M1_Meta-Llama-3-8B-Instruct_en_es_ru_de_fr 8B • Updated Aug 1 • 5
James-WYang/LIDR_Multilingual_Reasoning_M0_Meta-Llama-3-8B-Instruct_en_es_ru_de_fr 8B • Updated Aug 1 • 6