RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-3B-Instruct-Q4-LoRA8-Batch-16-Tok-1024 Updated 4 days ago
sadra-barikbin/V3_llama-3.2-3b-query-understandings_prompt_short_r_64_epoch_2 Updated about 21 hours ago