Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes? Mar 5, 2024 • 4
hbXNov/llama3.1-8b_train_gpt_4o_verifications_e3_lr5e-7-add-special-true-len3072-19233-merged Updated 10 days ago • 5
hbXNov/llama3.1-8b_train_gpt_4o_verifications_e3_lr5e-7-add-special-true-31389-merged Updated 10 days ago • 5
hbXNov/llama3.1-8b_train_correct_verifications_gt_soln_in_context_consistent_verifications-32786-merged Updated 20 days ago • 7