File size: 583 Bytes
9a028ed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9bc8d73
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
license: mit
datasets:
- liuhaotian/LLaVA-Pretrain
- lmms-lab/LLaVA-ReCap-558K
- lmms-lab/LLaVA-ReCap-118K
- lmms-lab/LLaVA-ReCap-CC3M
- lmms-lab/LLaVA-OneVision-Mid-Data
- lmms-lab/LLaVA-OneVision-Data
- Zhiqiang007/MathV360K
language:
- en
base_model:
- Qwen/Qwen2-0.5B-Instruct
- google/siglip-so400m-patch14-384
tags:
- LLaVA-OneVision-Manager
- LLaVA-OV-Manager
- Manager
---


Model weights for our submission to TCSVT, titled "Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs".

Related materials can be found at https://looperxx.github.io/.