Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper • 2501.11873 • Published 3 days ago • 59
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-annotate-judge-mtbench_cot_truth Viewer • Updated Dec 1, 2024 • 6 • 43
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_reason Viewer • Updated Nov 30, 2024 • 61.1k • 46
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_safe Viewer • Updated Nov 29, 2024 • 61.1k • 41
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_hon Viewer • Updated Nov 28, 2024 • 61.1k • 41
simonycl/ultrafeedback_binarized_raw-annotate-judge-mtbench_cot_truth Viewer • Updated Nov 27, 2024 • 61.1k • 50
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-annotate-judge-mtbench_cot_helpsteer_complexity Viewer • Updated Nov 22, 2024 • 62k • 46
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-annotate-judge-mtbench_cot_helpsteer_verbose Viewer • Updated Nov 18, 2024 • 62k • 38
simonycl/Meta-Llama-3-8B-Instruct_ultrafeedback-annotate-judge-mtbench_cot_helpsteer_helpfulness Viewer • Updated Nov 18, 2024 • 62k • 38