tsystems
/

colqwen2-2b-v1.0

multimodal_embedding

Model card Files Files and versions Community

tattrongvu commited on 2 days ago

Commit

5868eec

·

verified ·

1 Parent(s): 251867d

Update README.md

Files changed (1) hide show

README.md +0 -5

README.md CHANGED Viewed

@@ -37,11 +37,6 @@ Data is the same as the ColPali data described in the paper.
 ## Model Training
-### Dataset
-The dataset was extended from the original colpali train set with the gemini 1.5 flash generated QA on 35k images scraped from internet.
-*Note: Multilingual data is present in the pretraining corpus of the language model and most probably in the multimodal training.*
 ### Parameters
 We train models  use low-rank adapters ([LoRA](https://arxiv.org/abs/2106.09685))
 with `alpha=128`  and `r=128` on the transformer layers from the language model,

 ## Model Training
 ### Parameters
 We train models  use low-rank adapters ([LoRA](https://arxiv.org/abs/2106.09685))
 with `alpha=128`  and `r=128` on the transformer layers from the language model,