|
--- |
|
license: mit |
|
datasets: |
|
- liuhaotian/LLaVA-Pretrain |
|
- lmms-lab/LLaVA-ReCap-558K |
|
- lmms-lab/LLaVA-ReCap-118K |
|
- lmms-lab/LLaVA-ReCap-CC3M |
|
- lmms-lab/LLaVA-OneVision-Mid-Data |
|
- lmms-lab/LLaVA-OneVision-Data |
|
- Zhiqiang007/MathV360K |
|
language: |
|
- en |
|
base_model: |
|
- Qwen/Qwen2-0.5B-Instruct |
|
- google/siglip-so400m-patch14-384 |
|
tags: |
|
- LLaVA-OneVision-Manager |
|
- LLaVA-OV-Manager |
|
- Manager |
|
--- |
|
|
|
|
|
Model weights for our submission to TCSVT, titled "Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs". |
|
|
|
Related materials can be found at https://looperxx.github.io/. |