metadata
license: mit
datasets:
- liuhaotian/LLaVA-Pretrain
- lmms-lab/LLaVA-ReCap-558K
- lmms-lab/LLaVA-ReCap-118K
- lmms-lab/LLaVA-ReCap-CC3M
- lmms-lab/LLaVA-OneVision-Mid-Data
- lmms-lab/LLaVA-OneVision-Data
- Zhiqiang007/MathV360K
language:
- en
base_model:
- Qwen/Qwen2-0.5B-Instruct
- google/siglip-so400m-patch14-384
tags:
- LLaVA-OneVision-Manager
- LLaVA-OV-Manager
- Manager
Model weights for our submission to TCSVT, titled "Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs".
Related materials can be found at https://looperxx.github.io/.